www.tacc.utexas.edu
robots.txt

Robots Exclusion Standard data for www.tacc.utexas.edu

Resource Scan

Scan Details

Site Domain www.tacc.utexas.edu
Base Domain utexas.edu
Scan Status Ok
Last Scan2025-06-07T06:56:16+00:00
Next Scan 2025-07-07T06:56:16+00:00

Last Scan

Scanned2025-06-07T06:56:16+00:00
URL https://www.tacc.utexas.edu/robots.txt
Redirect https://tacc.utexas.edu/robots.txt
Redirect Domain tacc.utexas.edu
Redirect Base utexas.edu
Domain IPs 129.114.4.127
Redirect IPs 129.114.4.127
Response IP 129.114.4.127
Found Yes
Hash e41dbc0d3b0821b3c3a86123e4d74f5e1e2d0fb99729005f0d7ba71508725a62
SimHash 600449488037

Groups

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claudebot/1.0

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

*

Rule Path
Disallow /test