site1759.goalline.ca
robots.txt

Robots Exclusion Standard data for site1759.goalline.ca

Resource Scan

Scan Details

Site Domain site1759.goalline.ca
Base Domain goalline.ca
Scan Status Ok
Last Scan2024-05-04T01:01:29+00:00
Next Scan 2024-06-03T01:01:29+00:00

Last Scan

Scanned2024-05-04T01:01:29+00:00
URL https://site1759.goalline.ca/robots.txt
Domain IPs 23.21.248.137, 54.243.53.0
Response IP 54.243.53.0
Found Yes
Hash 0a5d0bfb7b4d46427b48b87637a283e7977b969a821f9565edfd3dfad17c2e0f
SimHash 8e44d400c561

Groups

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow /sports-management-software/signup.php*

linguee

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

nutch

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

*

Rule Path
Disallow /images/gla/
Disallow /printable_gamesheet.php
Disallow /printable_gamesheet_landscape.php*
Disallow /printable_gamesheet_cmsa.php
Disallow /*.wmv$
Disallow /*.zip$
Disallow /files/
Disallow /mobile/

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.