t4dev.tc.columbia.edu
robots.txt
Robots Exclusion Standard data for t4dev.tc.columbia.edu
Resource Scan
Scan Details
| Site Domain | t4dev.tc.columbia.edu |
| Base Domain | columbia.edu |
| Scan Status | Ok |
| Last Scan | 2025-10-13T10:09:05+00:00 |
| Next Scan | 2025-11-12T10:09:05+00:00 |
Last Scan
| Scanned | 2025-10-13T10:09:05+00:00 |
| URL | https://t4dev.tc.columbia.edu/robots.txt |
| Domain IPs | 52.44.155.110 |
| Response IP | 52.44.155.110 |
| Found | Yes |
| Hash | 25982ad93a870b73b1b0688b2afa74bf20c0460e40516a2b871ffab056cf0c0a |
| SimHash | 89b01c8d2792 |
Groups
googlebot
| Rule | Path |
|---|---|
| Disallow | /i/a/* |
| Disallow | /*/includes |
| Disallow | /*/scripts |
| Disallow | /*/styles |
| Disallow | /pulled-content |
| Disallow | /*/pulled-content |
| Disallow | /*/embeds |
*
| Rule | Path |
|---|---|
| Disallow | /i/a/* |
| Disallow | /*/includes |
| Disallow | /*/scripts |
| Disallow | /*/styles |
| Disallow | /pulled-content |
| Disallow | /*/pulled-content |
| Disallow | /*/embeds |
Comments