originstexas.com
robots.txt

Robots Exclusion Standard data for originstexas.com

Resource Scan

Scan Details

Site Domain originstexas.com
Base Domain originstexas.com
Scan Status Ok
Last Scan2025-10-28T11:56:30+00:00
Next Scan 2025-11-27T11:56:30+00:00

Last Scan

Scanned2025-10-28T11:56:30+00:00
URL https://originstexas.com/robots.txt
Redirect https://www.originstexas.com/robots.txt
Redirect Domain www.originstexas.com
Redirect Base originstexas.com
Domain IPs 45.76.232.99
Redirect IPs 45.76.232.99
Response IP 45.76.232.99
Found Yes
Hash 8a2c75619f433b0594e1ad1bebbbe2e4dc6dbf39f2697014e3a4d1f96e5b8fcd
SimHash 62b078d5cff5

Groups

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot

Rule Path
Allow /

gptbot
ccbot
google-extended

Rule Path
Disallow /

googlebot
bingbot

Rule Path
Allow /

*

Rule Path
Disallow /admin/
Disallow /internal/

Other Records

Field Value
sitemap https://www.originstexas.com/sitemap_index.xml

Comments

  • Allow AI search and agent use
  • Disallow AI training data collection
  • Allow traditional search indexing
  • Disallow access to admin areas for all bots