blog.bearclausepublications.com
robots.txt

Robots Exclusion Standard data for blog.bearclausepublications.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	blog.bearclausepublications.com
Base Domain	bearclausepublications.com
Scan Status	Ok
Last Scan	2025-11-23T06:47:15+00:00
Next Scan	2025-12-23T06:47:15+00:00

Last Scan

Scanned	2025-11-23T06:47:15+00:00
URL	https://blog.bearclausepublications.com/robots.txt
Domain IPs	192.185.149.216
Response IP	192.185.149.216
Found	Yes
Hash	a3d20aa85f334a7a99f73de307bd3105961cab142d2fc1a2df9bf949c649dfe9
SimHash	2a4458408483

Groups

googlebot

Rule	Path
Disallow

Rule

Path

Disallow

msnbot

Rule	Path
Disallow

Rule

Path

Disallow

slurp

Rule	Path
Disallow

Rule

Path

Disallow

teoma

Rule	Path
Disallow

Rule

Path

Disallow

twiceler

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrubby

Rule	Path
Disallow	/

Rule

Path

Disallow

nutch

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

naverbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

asterias

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow
Disallow	/cgi-bin/
Disallow	/wp-admin/
Disallow	/wp-includes/

Rule

Path

Disallow

/cgi-bin/

Disallow

/wp-admin/

Disallow

/wp-includes/

Other Records

Field	Value
sitemap	http://www.yoursite.com/sitemap.gz

Field

Value

sitemap

http://www.yoursite.com/sitemap.gz

blog.bearclausepublications.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

msnbot

slurp

teoma

twiceler

gigabot

scrubby

nutch

baiduspider

naverbot

yeti

asterias

*

Other Records

blog.bearclausepublications.com
robots.txt