sports-central.org
robots.txt

Robots Exclusion Standard data for sports-central.org

Resource Scan

Scan Details

Site Domain sports-central.org
Base Domain sports-central.org
Scan Status Ok
Last Scan2024-10-12T19:15:26+00:00
Next Scan 2024-10-19T19:15:26+00:00

Last Scan

Scanned2024-10-12T19:15:26+00:00
URL https://sports-central.org/robots.txt
Redirect https://www.sports-central.org/robots.txt
Redirect Domain www.sports-central.org
Redirect Base sports-central.org
Domain IPs 104.21.73.39, 172.67.140.27, 2606:4700:3031::6815:4927, 2606:4700:3033::ac43:8c1b
Redirect IPs 104.21.73.39, 172.67.140.27, 2606:4700:3031::6815:4927, 2606:4700:3033::ac43:8c1b
Response IP 172.67.140.27
Found Yes
Hash 3e4162b2429c4d0dbdae1e52c03bba1f7f91d68c592c4a5ab189dd5acaa824de
SimHash c2cedb469213

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /scripts/adserver/

*

Rule Path
Disallow /scripts/openxads/

*

Rule Path
Disallow /images/
Disallow /photos/

jetbot/1.0

Rule Path
Disallow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

yandex

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

sch-fast-se-crawl02.osl.basefarm.net

Rule Path
Disallow /

sch-fast-se-crawl04.osl.basefarm.net

Rule Path
Disallow /

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Comments

  • disallow OSL.basefarm.net

Warnings

  • 1 invalid line.