kccd.edu
robots.txt

Robots Exclusion Standard data for kccd.edu

Resource Scan

Scan Details

Site Domain kccd.edu
Base Domain kccd.edu
Scan Status Ok
Last Scan2024-11-18T15:11:28+00:00
Next Scan 2024-12-18T15:11:28+00:00

Last Scan

Scanned2024-11-18T15:11:28+00:00
URL https://kccd.edu/robots.txt
Redirect https://www.kccd.edu/robots.txt
Redirect Domain www.kccd.edu
Redirect Base kccd.edu
Domain IPs 199.83.129.33, 199.83.131.33
Redirect IPs 103.28.249.33
Response IP 103.28.249.33
Found Yes
Hash be16ce07e463957072b3425f4fde534eb362735231910ee404cdba8925b7f1c6
SimHash 2a07d8b48412

Groups

*

Rule Path
Disallow /_kccd-debug/
Disallow /_sample_migration/
Disallow /_qa/
Disallow /_resources/
Disallow /_showcase/
Disallow /include-test/

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.kccd.edu/sitemap.xml