tc.edu
robots.txt

Robots Exclusion Standard data for tc.edu

Resource Scan

Scan Details

Site Domain tc.edu
Base Domain tc.edu
Scan Status Ok
Last Scan2025-10-23T04:01:09+00:00
Next Scan 2025-11-22T04:01:09+00:00

Last Scan

Scanned2025-10-23T04:01:09+00:00
URL http://tc.edu/robots.txt
Redirect https://www.tc.columbia.edu/robots.txt
Redirect Domain www.tc.columbia.edu
Redirect Base columbia.edu
Domain IPs 34.196.195.67
Redirect IPs 3.215.252.197, 44.213.234.56, 98.86.125.226
Response IP 44.213.234.56
Found Yes
Hash 25982ad93a870b73b1b0688b2afa74bf20c0460e40516a2b871ffab056cf0c0a
SimHash 89b01c8d2792

Groups

googlebot

Rule Path
Disallow /i/a/*
Disallow /*/includes
Disallow /*/scripts
Disallow /*/styles
Disallow /pulled-content
Disallow /*/pulled-content
Disallow /*/embeds

*

Rule Path
Disallow /i/a/*
Disallow /*/includes
Disallow /*/scripts
Disallow /*/styles
Disallow /pulled-content
Disallow /*/pulled-content
Disallow /*/embeds

Comments

  • Robots.txt file