euci.com
robots.txt

Robots Exclusion Standard data for euci.com

Resource Scan

Scan Details

Site Domain euci.com
Base Domain euci.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-25T11:52:06+00:00
Next Scan 2025-09-08T11:52:06+00:00

Last Successful Scan

Scanned2025-07-18T09:14:59+00:00
URL https://euci.com/robots.txt
Redirect https://www.euci.com/robots.txt
Redirect Domain www.euci.com
Redirect Base euci.com
Domain IPs 172.66.41.43, 172.66.42.213, 2606:4700:3108::ac42:292b, 2606:4700:3108::ac42:2ad5
Redirect IPs 172.66.41.43, 172.66.42.213, 2606:4700:3108::ac42:292b, 2606:4700:3108::ac42:2ad5
Response IP 172.66.42.213
Found Yes
Hash a9f776f06da625f1411c68a93ba5bdd9c00732432d16b2870512f7c1a2729c32
SimHash 62b17050af90

Groups

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot

Rule Path
Allow /

gptbot
ccbot
google-extended

Rule Path
Disallow /

googlebot
bingbot

Rule Path
Allow /

*

Rule Path
Disallow /wp-content/plugins/
Disallow /wp-admin/
Disallow /pdf/
Disallow /images/
Disallow /sg/

Other Records

Field Value
sitemap https://www.euci.com/sitemap_index.xml

Comments

  • Allow AI search and agent use
  • Disallow AI training data collection
  • Allow traditional search indexing