josephcornellbox.com
robots.txt
Robots Exclusion Standard data for josephcornellbox.com
Resource Scan
Scan Details
Site Domain | josephcornellbox.com |
Base Domain | josephcornellbox.com |
Scan Status | Ok |
Last Scan | 2025-09-13T16:25:19+00:00 |
Next Scan | 2025-10-13T16:25:19+00:00 |
Last Scan
Scanned | 2025-09-13T16:25:19+00:00 |
URL | https://josephcornellbox.com/robots.txt |
Domain IPs | 104.21.1.60, 172.67.128.180, 2606:4700:3032::ac43:80b4, 2606:4700:3034::6815:13c |
Response IP | 104.21.1.60 |
Found | Yes |
Hash | 80a620d116c695bb247a69a751da089a3fe0edc0b35d709e5fb6689bf4e2f448 |
SimHash | 6bc40c5c0112 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /readme.html |
Disallow | /license.txt |
Disallow | /search/?q=* |
Disallow | /s/ |
Disallow | /?s= |
Disallow | *?replytocom |
Disallow | */attachment/* |
Disallow | /refer/ |
Disallow | /wp-login.php* |
Disallow | /component/* |
Allow | /*.js$ |
Allow | /*.css$ |
Allow | /wp-admin/admin-ajax.php |
Allow | /wp-admin/images/* |
Other Records
Field | Value |
---|---|
sitemap | https://josephcornellbox.com/sitemap_index.xml |