csuchico.edu
robots.txt

Robots Exclusion Standard data for csuchico.edu

Resource Scan

Scan Details

Site Domain csuchico.edu
Base Domain csuchico.edu
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-08-22T19:04:57+00:00
Next Scan 2024-11-20T19:04:57+00:00

Last Successful Scan

Scanned2023-10-05T15:43:25+00:00
URL https://www.csuchico.edu/robots.txt
Domain IPs 52.143.79.173
Response IP 52.143.79.173
Found Yes
Hash 6e9ef444db32a11732ff8c0e6c132409204342103e7db969f0bb67379bd958b5
SimHash 2f0d8f420097

Groups

*

Rule Path Comment
Disallow /cwis/ This is local info only
Disallow /index/ This is local info only
Disallow /catalog/cat93/ -
Disallow /catalog/cat95/ -
Disallow /catalog/cat97/ -
Disallow /catalog/cat99/ -
Disallow /catalog/cat01/ -
Disallow /stcp/backup/ -
Disallow /stcp/gfx/ -
Disallow /stcp/inc/ -
Disallow /stcp/labrat/ -
Disallow /stcp/private/ -
Disallow /stcp/style/ -
Disallow /photos_index/old_pics -
Disallow /newweb/ -
Disallow /gst/cgi-bin/ -
Disallow /~kschenk/cgi-bin/ -
Disallow /attic/ -
Disallow /directory/students/ -
Disallow /url/ -
Disallow /inf/ -
Disallow /manual/ -
Disallow /pa/2008/04/ -
Disallow /recsports/sports_clubs -
Disallow /as/ -
Disallow /count.d/ -
Disallow /cms/ -
Disallow /phieta/ -
Disallow /portal-content/ -
Disallow /mbainfo/ -
Disallow /cob/mba-join-us/ -
Disallow /cob/join-us/ -
Disallow /jose-land/ -
Disallow /jordan-land/ -
Disallow /megan-land/ -
Disallow /scott-land/ -
Disallow /francie-land/ -

sitebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.csuchico.edu/sitemap.xml

Comments

  • robots.txt for https://www.csuchico.edu/