cuny.edu
robots.txt

Robots Exclusion Standard data for cuny.edu

Resource Scan

Scan Details

Site Domain cuny.edu
Base Domain cuny.edu
Scan Status Ok
Last Scan2024-05-18T10:28:31+00:00
Next Scan 2024-06-17T10:28:31+00:00

Last Scan

Scanned2024-05-18T10:28:31+00:00
URL https://cuny.edu/robots.txt
Domain IPs 128.228.254.200
Response IP 128.228.254.200
Found Yes
Hash 24aaa4ca0f27a8042523fc8ffc36273a84c893aad0b9260598a8c1f049916acf
SimHash 635c50154523

Groups

googlebot
bingbot
msnbot
twitterbot
slurp
duckduckbot

Rule Path
Disallow /global-components/
Disallow /d-i/
Disallow /cgi-bin/
Disallow /search/
Disallow /wp-admin
Disallow /alumni-test/
Disallow /role-template/
Disallow /home-preview/
Disallow /about/administration/offices/sa/
Disallow /homepage/digital-displays/
Disallow /employment/search-jobs/
Disallow /old-version/
Disallow /policyimport/

Other Records

Field Value
crawl-delay 30

semanticscholarbot

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cuny.edu/sitemap.xml
sitemap https://policy.cuny.edu/sitemap.xml