t4dev.tc.columbia.edu
robots.txt

Robots Exclusion Standard data for t4dev.tc.columbia.edu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	t4dev.tc.columbia.edu
Base Domain	columbia.edu
Scan Status	Ok
Last Scan	2025-10-13T10:09:05+00:00
Next Scan	2025-11-12T10:09:05+00:00

Last Scan

Scanned	2025-10-13T10:09:05+00:00
URL	https://t4dev.tc.columbia.edu/robots.txt
Domain IPs	52.44.155.110
Response IP	52.44.155.110
Found	Yes
Hash	25982ad93a870b73b1b0688b2afa74bf20c0460e40516a2b871ffab056cf0c0a
SimHash	89b01c8d2792

Groups

googlebot

Rule	Path
Disallow	/i/a/*
Disallow	/*/includes
Disallow	/*/scripts
Disallow	/*/styles
Disallow	/pulled-content
Disallow	/*/pulled-content
Disallow	/*/embeds

Rule

Path

Disallow

/i/a/*

Disallow

/*/includes

Disallow

/*/scripts

Disallow

/*/styles

Disallow

/pulled-content

Disallow

/*/pulled-content

Disallow

/*/embeds

*

Rule	Path
Disallow	/i/a/*
Disallow	/*/includes
Disallow	/*/scripts
Disallow	/*/styles
Disallow	/pulled-content
Disallow	/*/pulled-content
Disallow	/*/embeds

Rule

Path

Disallow

/i/a/*

Disallow

/*/includes

Disallow

/*/scripts

Disallow

/*/styles

Disallow

/pulled-content

Disallow

/*/pulled-content

Disallow

/*/embeds

Back to top

Comments

Robots.txt file

Back to top

t4dev.tc.columbia.edurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

*

Comments

t4dev.tc.columbia.edu
robots.txt