blog.bearclausepublications.com
robots.txt

Robots Exclusion Standard data for blog.bearclausepublications.com

Resource Scan

Scan Details

Site Domain blog.bearclausepublications.com
Base Domain bearclausepublications.com
Scan Status Ok
Last Scan2025-11-23T06:47:15+00:00
Next Scan 2025-12-23T06:47:15+00:00

Last Scan

Scanned2025-11-23T06:47:15+00:00
URL https://blog.bearclausepublications.com/robots.txt
Domain IPs 192.185.149.216
Response IP 192.185.149.216
Found Yes
Hash a3d20aa85f334a7a99f73de307bd3105961cab142d2fc1a2df9bf949c649dfe9
SimHash 2a4458408483

Groups

googlebot

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

twiceler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

nutch

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

asterias

Rule Path
Disallow /

*

Rule Path
Disallow
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/

Other Records

Field Value
sitemap http://www.yoursite.com/sitemap.gz