sparrowblogs.com
robots.txt

Robots Exclusion Standard data for sparrowblogs.com

Resource Scan

Scan Details

Site Domain sparrowblogs.com
Base Domain sparrowblogs.com
Scan Status Ok
Last Scan2025-07-22T13:03:15+00:00
Next Scan 2025-08-21T13:03:15+00:00

Last Scan

Scanned2025-07-22T13:03:15+00:00
URL https://sparrowblogs.com/robots.txt
Domain IPs 208.91.197.132
Response IP 208.91.197.132
Found Yes
Hash f746e6bb107cdcef4d19a5903b1491098eedfcedd21f68bcdbc12311206c8195
SimHash 28187013c193

Groups

*

Rule Path
Disallow /fcmedianet.js
Disallow /__media__/js/templates.js
Disallow /cmedianet
Disallow /cmdynet
Disallow /mediamainlog.php

googlebot

Rule Path
Disallow

slurp

Rule Path
Disallow

msnbot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

*

Rule Path
Disallow /