www.pierce.ctc.edu
robots.txt

Robots Exclusion Standard data for www.pierce.ctc.edu

Resource Scan

Scan Details

Site Domain www.pierce.ctc.edu
Base Domain ctc.edu
Scan Status Ok
Last Scan2025-08-31T12:51:03+00:00
Next Scan 2025-09-30T12:51:03+00:00

Last Scan

Scanned2025-08-31T12:51:03+00:00
URL https://www.pierce.ctc.edu/robots.txt
Domain IPs 3.209.26.193, 34.236.193.193, 52.73.2.219
Response IP 34.236.193.193
Found Yes
Hash b7de49374bac6cec896cacdb51c844b482e6843e95ce143e036809bfc9db329a
SimHash 69091552d7c6

Groups

thunderstonesa

Rule Path
Disallow

*

Rule Path
Disallow /_resources/
Disallow /_dev/
Disallow /_showcase/
Disallow /documents/

Other Records

Field Value
sitemap https://www.pierce.ctc.com/sitemap.xml

Comments

  • allow modern campus cms search to index the site
  • protected resources