www.cs.columbia.edu
robots.txt

Robots Exclusion Standard data for www.cs.columbia.edu

Resource Scan

Scanned	2025-04-18T06:09:31+00:00
URL	https://www.cs.columbia.edu/robots.txt
Domain IPs	128.59.11.206
Response IP	128.59.11.206
Found	Yes
Hash	ad46993b25f01113f1f071244450fcd6927c0c853dc7a0a8b5f7789e540f0c40
SimHash	410154604793

Rule	Path
Allow	/

Rule

Path

Allow

/

Rule	Path
Disallow	/wp/wp-admin/
Allow	/wp/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp/wp-admin/

Allow

/wp/wp-admin/admin-ajax.php

Back to top

Field	Value
sitemap	https://www.cs.columbia.edu/wp-sitemap.xml

Field

Value

sitemap

https://www.cs.columbia.edu/wp-sitemap.xml

Back to top

Back to top