www.cs.usc.edu
robots.txt

Robots Exclusion Standard data for www.cs.usc.edu

Resource Scan

Scan Details

Site Domain www.cs.usc.edu
Base Domain usc.edu
Scan Status Ok
Last Scan2025-08-02T09:33:42+00:00
Next Scan 2025-09-01T09:33:42+00:00

Last Scan

Scanned2025-08-02T09:33:42+00:00
URL https://www.cs.usc.edu/robots.txt
Domain IPs 141.193.213.10, 141.193.213.11
Response IP 141.193.213.10
Found Yes
Hash c74eb116d94b0d1e763ead62f050b9373fef952c34ae6bcc320c127bb4449a3a
SimHash 486cc880e49a

Groups

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.cs.usc.edu/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK

Warnings

  • 1 invalid line.