athletics.hn.psu.edu
robots.txt

Robots Exclusion Standard data for athletics.hn.psu.edu

Resource Scan

Scan Details

Site Domain athletics.hn.psu.edu
Base Domain psu.edu
Scan Status Ok
Last Scan2024-06-19T11:17:07+00:00
Next Scan 2024-07-03T11:17:07+00:00

Last Scan

Scanned2024-06-19T11:17:07+00:00
URL http://athletics.hn.psu.edu/robots.txt
Domain IPs 34.215.18.43, 35.165.68.27, 50.112.22.155, 52.40.168.138
Response IP 50.112.22.155
Found Yes
Hash 5384c90a8f1903a1e18b6ec686452644462cf557df08cadced82f4127e1e6a72
SimHash 6875d002ceb9

Groups

american-univ-crawler (enterprise; s5-dwrrj5kwb2naa; nguyen@american.edu)

Rule Path
Disallow /

cstv search crawler

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /_private/
Disallow /_vti_bin/
Disallow /_vti_cnf/
Disallow /_vti_log/
Disallow /_vti_pvt/
Disallow /_vti_txt/
Disallow /reports/
Disallow /admin/
Disallow /action/

Other Records

Field Value
crawl-delay 5

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

twitterbot

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baidu

Rule Path
Disallow /

Comments

  • Managed by PrestoSports sysadmin@prestosports.com