josephcornellbox.com
robots.txt

Robots Exclusion Standard data for josephcornellbox.com

Resource Scan

Scan Details

Site Domain josephcornellbox.com
Base Domain josephcornellbox.com
Scan Status Ok
Last Scan2025-09-13T16:25:19+00:00
Next Scan 2025-10-13T16:25:19+00:00

Last Scan

Scanned2025-09-13T16:25:19+00:00
URL https://josephcornellbox.com/robots.txt
Domain IPs 104.21.1.60, 172.67.128.180, 2606:4700:3032::ac43:80b4, 2606:4700:3034::6815:13c
Response IP 104.21.1.60
Found Yes
Hash 80a620d116c695bb247a69a751da089a3fe0edc0b35d709e5fb6689bf4e2f448
SimHash 6bc40c5c0112

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /search/?q=*
Disallow /s/
Disallow /?s=
Disallow *?replytocom
Disallow */attachment/*
Disallow /refer/
Disallow /wp-login.php*
Disallow /component/*
Allow /*.js$
Allow /*.css$
Allow /wp-admin/admin-ajax.php
Allow /wp-admin/images/*

Other Records

Field Value
sitemap https://josephcornellbox.com/sitemap_index.xml