clsmba.ca
robots.txt

Robots Exclusion Standard data for clsmba.ca

Resource Scan

Scan Details

Site Domain clsmba.ca
Base Domain clsmba.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-08T15:20:49+00:00
Next Scan 2024-07-07T15:20:49+00:00

Last Successful Scan

Scanned2022-05-09T00:18:54+00:00
URL http://clsmba.ca/robots.txt
Response IP 23.21.248.137
Found Yes
Hash 0a5d0bfb7b4d46427b48b87637a283e7977b969a821f9565edfd3dfad17c2e0f
SimHash 8e44d400c561

Groups

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow /sports-management-software/signup.php*

linguee

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

nutch

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

*

Rule Path
Disallow /images/gla/
Disallow /printable_gamesheet.php
Disallow /printable_gamesheet_landscape.php*
Disallow /printable_gamesheet_cmsa.php
Disallow /*.wmv$
Disallow /*.zip$
Disallow /files/
Disallow /mobile/

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.