tc.columbia.edu
robots.txt

Robots Exclusion Standard data for tc.columbia.edu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	tc.columbia.edu
Base Domain	columbia.edu
Scan Status	Ok
Last Scan	2025-10-05T23:30:56+00:00
Next Scan	2025-11-04T23:30:56+00:00

Last Scan

Scanned	2025-10-05T23:30:56+00:00
URL	http://tc.columbia.edu/robots.txt
Redirect	https://www.tc.columbia.edu/robots.txt
Redirect Domain	www.tc.columbia.edu
Redirect Base	columbia.edu
Domain IPs	34.196.195.67
Redirect IPs	34.237.188.241, 52.73.63.111, 54.92.134.63
Response IP	54.92.134.63
Found	Yes
Hash	25982ad93a870b73b1b0688b2afa74bf20c0460e40516a2b871ffab056cf0c0a
SimHash	89b01c8d2792

Groups

googlebot

Rule	Path
Disallow	/i/a/*
Disallow	/*/includes
Disallow	/*/scripts
Disallow	/*/styles
Disallow	/pulled-content
Disallow	/*/pulled-content
Disallow	/*/embeds

Rule

Path

Disallow

/i/a/*

Disallow

/*/includes

Disallow

/*/scripts

Disallow

/*/styles

Disallow

/pulled-content

Disallow

/*/pulled-content

Disallow

/*/embeds

*

Rule	Path
Disallow	/i/a/*
Disallow	/*/includes
Disallow	/*/scripts
Disallow	/*/styles
Disallow	/pulled-content
Disallow	/*/pulled-content
Disallow	/*/embeds

Rule

Path

Disallow

/i/a/*

Disallow

/*/includes

Disallow

/*/scripts

Disallow

/*/styles

Disallow

/pulled-content

Disallow

/*/pulled-content

Disallow

/*/embeds

Back to top

Comments

Robots.txt file

Back to top

tc.columbia.edurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

*

Comments

tc.columbia.edu
robots.txt