www.uphs.upenn.edu
robots.txt

Robots Exclusion Standard data for www.uphs.upenn.edu

Resource Scan

Scan Details

Site Domain www.uphs.upenn.edu
Base Domain upenn.edu
Scan Status Ok
Last Scan2025-12-06T09:13:59+00:00
Next Scan 2026-01-05T09:13:59+00:00

Last Scan

Scanned2025-12-06T09:13:59+00:00
URL https://www.uphs.upenn.edu/robots.txt
Domain IPs 45.60.75.182
Response IP 45.60.75.182
Found Yes
Hash 4f8888d3c2ea9f7809a6691eabf7f626bffdf53ccab09694be29894fd0e33d08
SimHash ab4a536484d9

Groups

*

Rule Path
Disallow /abc/
Disallow /addiction2/
Disallow /antibiotics/
Disallow /autosuggest
Disallow /barryg/
Disallow /cgi-bin/
Disallow /chowh/
Disallow /danssite/
Disallow /dept/
Disallow /employeeselfservice/
Disallow /encyclopedia/
Disallow /fritze/
Disallow /heart/
Disallow /includes/
Disallow /includes_shared/
Disallow /jobsj/
Disallow /jon2/
Disallow /jonsite9/
Disallow /jontest2/
Disallow /kirkest/
Disallow /lehmann/
Disallow /luih/
Disallow /lung/
Disallow /mcadamsd/
Disallow /mcadamsd2/
Disallow /mcadamsd3/
Disallow /personalized-diagnostics/services.html
Disallow /rescue/
Disallow /strategic-plan/
Disallow /tavaresh/
Disallow /testdgim/
Disallow /testinternet/
Disallow /testneuro/
Disallow /testsites/
Disallow /wagtemplate/
Disallow /web-requests/
Disallow /webhelp/
Disallow /webresource/
Disallow /wernej/

Warnings

  • 1 invalid line.