thecranberryeagle.com
robots.txt

Robots Exclusion Standard data for thecranberryeagle.com

Resource Scan

Scan Details

Site Domain thecranberryeagle.com
Base Domain thecranberryeagle.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-05T01:10:04+00:00
Next Scan 2024-10-03T01:10:04+00:00

Last Successful Scan

Scanned2023-06-04T19:17:20+00:00
URL http://www.thecranberryeagle.com/robots.txt
Domain IPs 3.221.203.58, 52.5.20.244
Response IP 52.5.20.244
Found Yes
Hash 3639f719703b3f3a94a2355148d503a13569abab32bf4a83cf7f3fb7273cd215
SimHash 8c5155612d91

Groups

msiecrawler

Rule Path
Disallow /

*

Rule Path
Disallow /apps/pbcs.dll/classifieds
Disallow /apps/pbcs.dll/events
Disallow /apps/pbcs.dll/index
Disallow /apps/pbcs.dll/news
Disallow /apps/pbcs.dll/temaoversikt
Disallow /apps/pbcs.dll/related
Disallow /apps/pbcs.dll/misc
Disallow /apps/pbcs.dll/error
Disallow /apps/pbcs.dll/search
Disallow /apps/pbcsi.dll
Disallow /apps/pbcsad.dll
Disallow /apps/rub.dll
Disallow /tmp/
Disallow /logs/
Disallow /images/
Disallow /.cache/
Disallow /mal/
Disallow /apps/pbcs.dll/section?Category=ARCHIVES30
Disallow /apps/pbcs.dll/section?Kategori=ARCHIVES30
Disallow /apps/pbcs.dll/oversikt?Kategori=ARCHIVES30

Comments

  • Robots.txt
  • Be nice.