www.cs.columbia.edu
robots.txt

Robots Exclusion Standard data for www.cs.columbia.edu

Resource Scan

Scan Details

Site Domain www.cs.columbia.edu
Base Domain columbia.edu
Scan Status Ok
Last Scan2025-04-18T06:09:31+00:00
Next Scan 2025-05-18T06:09:31+00:00

Last Scan

Scanned2025-04-18T06:09:31+00:00
URL https://www.cs.columbia.edu/robots.txt
Domain IPs 128.59.11.206
Response IP 128.59.11.206
Found Yes
Hash ad46993b25f01113f1f071244450fcd6927c0c853dc7a0a8b5f7789e540f0c40
SimHash 410154604793

Groups

termlybot

Rule Path
Allow /

*

Rule Path
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.cs.columbia.edu/wp-sitemap.xml

Comments

  • Termly scanner