www.commmedia.psu.edu
robots.txt

Robots Exclusion Standard data for www.commmedia.psu.edu

Resource Scan

Scan Details

Site Domain www.commmedia.psu.edu
Base Domain psu.edu
Scan Status Ok
Last Scan2024-05-19T14:48:52+00:00
Next Scan 2024-06-18T14:48:52+00:00

Last Scan

Scanned2024-05-19T14:48:52+00:00
URL https://www.commmedia.psu.edu/robots.txt
Domain IPs 44.210.17.14, 44.217.26.174
Response IP 44.217.26.174
Found Yes
Hash 5e7ea7be1c633e5d0d6cd9a63470b435c4709766ee5f55d08c10c733359c4dd9
SimHash 6f1a6200cedb

Groups

seekport crawler

Rule Path
Disallow /

*

Rule Path
Disallow *.axd
Disallow /cgi-bin/
Disallow /member

Other Records

Field Value
crawl-delay 30

bingbot

Rule Path
Disallow /