cuny.edu
robots.txt
Robots Exclusion Standard data for cuny.edu
Resource Scan
Scan Details
Site Domain | cuny.edu |
Base Domain | cuny.edu |
Scan Status | Ok |
Last Scan | 2024-05-18T10:28:31+00:00 |
Next Scan | 2024-06-17T10:28:31+00:00 |
Last Scan
Scanned | 2024-05-18T10:28:31+00:00 |
URL | https://cuny.edu/robots.txt |
Domain IPs | 128.228.254.200 |
Response IP | 128.228.254.200 |
Found | Yes |
Hash | 24aaa4ca0f27a8042523fc8ffc36273a84c893aad0b9260598a8c1f049916acf |
SimHash | 635c50154523 |
Groups
googlebot
bingbot
msnbot
twitterbot
slurp
duckduckbot
Rule | Path |
---|---|
Disallow | /global-components/ |
Disallow | /d-i/ |
Disallow | /cgi-bin/ |
Disallow | /search/ |
Disallow | /wp-admin |
Disallow | /alumni-test/ |
Disallow | /role-template/ |
Disallow | /home-preview/ |
Disallow | /about/administration/offices/sa/ |
Disallow | /homepage/digital-displays/ |
Disallow | /employment/search-jobs/ |
Disallow | /old-version/ |
Disallow | /policyimport/ |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
*
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.cuny.edu/sitemap.xml |
sitemap | https://policy.cuny.edu/sitemap.xml |