cpec.org
robots.txt

Robots Exclusion Standard data for cpec.org

Resource Scan

Scan Details

Site Domain cpec.org
Base Domain cpec.org
Scan Status Ok
Last Scan2025-09-13T19:43:24+00:00
Next Scan 2025-10-13T19:43:24+00:00

Last Scan

Scanned2025-09-13T19:43:24+00:00
URL https://cpec.org/robots.txt
Redirect https://www.cpec.org/robots.txt
Redirect Domain www.cpec.org
Redirect Base cpec.org
Domain IPs 199.34.228.70
Redirect IPs 199.34.228.70
Response IP 199.34.228.70
Found Yes
Hash dfe67684d8eb3757c1c9cb956ac1958a2e83ba4640e4f01255bf6c71d3f60e1f
SimHash 7640d0703b13

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /staff-toolbox.html
Disallow /2021-2022-pictures.html
Disallow /2020-2021-pictures.html
Disallow /pictures-2019-2020.html
Disallow /2018-2019-photos.html
Disallow /2017-2018-photos.html
Disallow /https%3A//outlook.office365.com
Disallow /accidents.html
Disallow /new-employee-training.html
Disallow /the-reality-of-food-allergies.html
Disallow /suicide-prevention-training.html
Disallow /it-support-request.html
Disallow /wellness.html
Disallow /january-2021-inservice.html
Disallow /february-2021-inservice.html
Disallow /march-2021-inservice.html

Other Records

Field Value
sitemap https://www.cpec.org/sitemap.xml