tc.edu
robots.txt
Robots Exclusion Standard data for tc.edu
Resource Scan
Scan Details
| Site Domain | tc.edu |
| Base Domain | tc.edu |
| Scan Status | Ok |
| Last Scan | 2025-10-23T04:01:09+00:00 |
| Next Scan | 2025-11-22T04:01:09+00:00 |
Last Scan
| Scanned | 2025-10-23T04:01:09+00:00 |
| URL | http://tc.edu/robots.txt |
| Redirect | https://www.tc.columbia.edu/robots.txt |
| Redirect Domain | www.tc.columbia.edu |
| Redirect Base | columbia.edu |
| Domain IPs | 34.196.195.67 |
| Redirect IPs | 3.215.252.197, 44.213.234.56, 98.86.125.226 |
| Response IP | 44.213.234.56 |
| Found | Yes |
| Hash | 25982ad93a870b73b1b0688b2afa74bf20c0460e40516a2b871ffab056cf0c0a |
| SimHash | 89b01c8d2792 |
Groups
googlebot
| Rule | Path |
|---|---|
| Disallow | /i/a/* |
| Disallow | /*/includes |
| Disallow | /*/scripts |
| Disallow | /*/styles |
| Disallow | /pulled-content |
| Disallow | /*/pulled-content |
| Disallow | /*/embeds |
*
| Rule | Path |
|---|---|
| Disallow | /i/a/* |
| Disallow | /*/includes |
| Disallow | /*/scripts |
| Disallow | /*/styles |
| Disallow | /pulled-content |
| Disallow | /*/pulled-content |
| Disallow | /*/embeds |
Comments