www-scf.usc.edu
robots.txt

Robots Exclusion Standard data for www-scf.usc.edu

Resource Scan

Scan Details

Site Domain www-scf.usc.edu
Base Domain usc.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-08T08:27:10+00:00
Next Scan 2025-12-07T08:27:10+00:00

Last Successful Scan

Scanned2023-04-26T21:18:50+00:00
URL https://www-scf.usc.edu/robots.txt
Domain IPs 68.181.201.23
Response IP 68.181.201.23
Found Yes
Hash 616847a09eaf343ed5f6573b124cff007a81b29657cfe21717aa941d5da90c31
SimHash e880ab68e473

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /RCS/
Disallow /~millilgan/

vdkwebi

Rule Path
Disallow /lib/

Comments

  • /robots.txt for http://www-scf.usc.edu/