tucia.com
robots.txt

Robots Exclusion Standard data for tucia.com

Resource Scan

Scan Details

Site Domain tucia.com
Base Domain tucia.com
Scan Status Ok
Last Scan2025-11-13T17:18:18+00:00
Next Scan 2025-12-13T17:18:18+00:00

Last Scan

Scanned2025-11-13T17:18:18+00:00
URL https://tucia.com/robots.txt
Redirect https://www.tucia.com/robots.txt
Redirect Domain www.tucia.com
Redirect Base tucia.com
Domain IPs 104.21.78.240, 172.67.138.184, 2606:4700:3033::6815:4ef0, 2606:4700:3037::ac43:8ab8
Redirect IPs 104.21.78.240, 172.67.138.184, 2606:4700:3033::6815:4ef0, 2606:4700:3037::ac43:8ab8
Response IP 172.67.138.184
Found Yes
Hash 02f00df9ca0f5ddb6ae3af0100fa2841012d97af8e30cfe045cfc456c4d35e03
SimHash 62428d13e3b2

Groups

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.tucia.com/sitemap-index.xml

Comments

  • Allow all bots to access the entire site
  • This is good for SEO and discovery
  • Sitemap location
  • Crawl delay to prevent server overload
  • Most major search engines ignore this, but it's helpful for smaller bots