drchrisharlan.com
robots.txt

Robots Exclusion Standard data for drchrisharlan.com

Resource Scan

Scan Details

Site Domain drchrisharlan.com
Base Domain drchrisharlan.com
Scan Status Ok
Last Scan2024-11-02T22:11:37+00:00
Next Scan 2024-11-16T22:11:37+00:00

Last Scan

Scanned2024-11-02T22:11:37+00:00
URL https://drchrisharlan.com/robots.txt
Redirect https://www.drchrisharlan.com/robots.txt
Redirect Domain www.drchrisharlan.com
Redirect Base drchrisharlan.com
Domain IPs 13.33.30.110, 13.33.30.123, 13.33.30.25, 13.33.30.83
Redirect IPs 13.33.30.110, 13.33.30.123, 13.33.30.25, 13.33.30.83
Response IP 13.33.30.83
Found Yes
Hash d207a67087cdc4771966f48daddc1cf4102f669e00682b3fbc5f88db889368ea
SimHash 78499c40a389

Groups

*

Rule Path
Disallow /dash/
Disallow /pp*
Disallow /trackback
Disallow /cgi-bin
Disallow /search
Disallow /rss
Disallow /comments/feed
Disallow /*/trackback/$
Disallow /hero_slides
Disallow /banners

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.drchrisharlan.com/sitemap.xml