orleansminorhockey.ca
robots.txt

Robots Exclusion Standard data for orleansminorhockey.ca

Resource Scan

Scan Details

Site Domain orleansminorhockey.ca
Base Domain orleansminorhockey.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-13T03:55:43+00:00
Next Scan 2024-11-27T03:55:43+00:00

Last Successful Scan

Scanned2024-10-29T02:01:25+00:00
URL http://orleansminorhockey.ca/robots.txt
Domain IPs 23.21.248.137, 54.243.53.0
Response IP 23.21.248.137
Found Yes
Hash 0a5d0bfb7b4d46427b48b87637a283e7977b969a821f9565edfd3dfad17c2e0f
SimHash 8e44d400c561

Groups

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow /sports-management-software/signup.php*

linguee

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

nutch

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

*

Rule Path
Disallow /images/gla/
Disallow /printable_gamesheet.php
Disallow /printable_gamesheet_landscape.php*
Disallow /printable_gamesheet_cmsa.php
Disallow /*.wmv$
Disallow /*.zip$
Disallow /files/
Disallow /mobile/

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.