campbelltontigers.ca
robots.txt

Robots Exclusion Standard data for campbelltontigers.ca

Resource Scan

Scan Details

Site Domain campbelltontigers.ca
Base Domain campbelltontigers.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-24T13:14:42+00:00
Next Scan 2024-11-22T13:14:42+00:00

Last Successful Scan

Scanned2022-11-02T12:07:17+00:00
URL http://campbelltontigers.ca/robots.txt
Response IP 23.21.248.137, 54.243.53.0
Found Yes
Hash 0a5d0bfb7b4d46427b48b87637a283e7977b969a821f9565edfd3dfad17c2e0f
SimHash 8e44d400c561

Groups

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow /sports-management-software/signup.php*

linguee

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

nutch

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

*

Rule Path
Disallow /images/gla/
Disallow /printable_gamesheet.php
Disallow /printable_gamesheet_landscape.php*
Disallow /printable_gamesheet_cmsa.php
Disallow /*.wmv$
Disallow /*.zip$
Disallow /files/
Disallow /mobile/

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.