collin.edu
robots.txt

Robots Exclusion Standard data for collin.edu

Resource Scan

Scan Details

Site Domain collin.edu
Base Domain collin.edu
Scan Status Ok
Last Scan2025-12-26T21:51:01+00:00
Next Scan 2026-01-25T21:51:01+00:00

Last Scan

Scanned2025-12-26T21:51:01+00:00
URL https://collin.edu/robots.txt
Response IP 3.209.26.193
Found Yes
Hash 0728ab49eb1193854d77342678cb41bc81b6a1bcf4473e3e3b5a6ae6df3b7de8
SimHash 73245a5881d0

Groups

*

Rule Path
Disallow /_dev
Disallow /_resources
Disallow /_qa
Disallow /_acalog_test
Disallow /_showcase
Disallow /_uat
Disallow /*_nav.inc
Disallow /*_nav.ounav
Disallow /*_props.html
Allow /sitemap.xml

Other Records

Field Value
sitemap https://www.collin.edu/sitemap.xml

Comments

  • Blocks robots from specific folders / directories