curiousdesk.com
robots.txt
Robots Exclusion Standard data for curiousdesk.com
Resource Scan
Scan Details
Site Domain | curiousdesk.com |
Base Domain | curiousdesk.com |
Scan Status | Ok |
Last Scan | 2024-09-13T06:58:16+00:00 |
Next Scan | 2024-09-20T06:58:16+00:00 |
Last Scan
Scanned | 2024-09-13T06:58:16+00:00 |
URL | https://curiousdesk.com/robots.txt |
Domain IPs | 104.21.69.35, 172.67.203.156, 2606:4700:3033::ac43:cb9c, 2606:4700:3035::6815:4523 |
Response IP | 172.67.203.156 |
Found | Yes |
Hash | 9939ba850acfe65a1ede5b86f1bad4b6573732ecab38fb3bd3a69f34a5f04c63 |
SimHash | 753899618db2 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /wp-admin/ |
Disallow | */wp-json |
Disallow | /? |
Disallow | *?s= |
Disallow | *%26s%3D |
Disallow | */embed$ |
Disallow | */xmlrpc.php |
Disallow | *utm*%3D |
Disallow | *openstat%3D |
Disallow | /feed/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://curiousdesk.com/sitemap.xml |