collegehockeynews.com
robots.txt

Robots Exclusion Standard data for collegehockeynews.com

Resource Scan

Scan Details

Site Domain collegehockeynews.com
Base Domain collegehockeynews.com
Scan Status Ok
Last Scan2024-09-27T18:28:41+00:00
Next Scan 2024-10-04T18:28:41+00:00

Last Scan

Scanned2024-09-27T18:28:41+00:00
URL https://collegehockeynews.com/robots.txt
Domain IPs 69.16.239.22
Response IP 69.16.239.22
Found Yes
Hash 3969e6e206ed31332f8a61650e104b034a481130a92a4ade557bceb81fa0e50d
SimHash 3808f8c75685

Groups

*

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 600

googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler
facebookexternalhit

Rule Path
Allow /images
Disallow /images/logos
Disallow /images/bg
Disallow /images/ads
Disallow /images/chncom
Disallow /images/design
Disallow /images/hs
Disallow /images/hs-small
Disallow /images/lids
Disallow /images/special
Disallow /images/watch
Disallow /users
Disallow /php

Other Records

Field Value
crawl-delay 10

Comments

  • allow only important bots

Warnings

  • `request-rate` is not a known field.