t4dev.tc.columbia.edu
robots.txt

Robots Exclusion Standard data for t4dev.tc.columbia.edu

Resource Scan

Scan Details

Site Domain t4dev.tc.columbia.edu
Base Domain columbia.edu
Scan Status Ok
Last Scan2025-10-13T10:09:05+00:00
Next Scan 2025-11-12T10:09:05+00:00

Last Scan

Scanned2025-10-13T10:09:05+00:00
URL https://t4dev.tc.columbia.edu/robots.txt
Domain IPs 52.44.155.110
Response IP 52.44.155.110
Found Yes
Hash 25982ad93a870b73b1b0688b2afa74bf20c0460e40516a2b871ffab056cf0c0a
SimHash 89b01c8d2792

Groups

googlebot

Rule Path
Disallow /i/a/*
Disallow /*/includes
Disallow /*/scripts
Disallow /*/styles
Disallow /pulled-content
Disallow /*/pulled-content
Disallow /*/embeds

*

Rule Path
Disallow /i/a/*
Disallow /*/includes
Disallow /*/scripts
Disallow /*/styles
Disallow /pulled-content
Disallow /*/pulled-content
Disallow /*/embeds

Comments

  • Robots.txt file