truvia.com
robots.txt

Robots Exclusion Standard data for truvia.com

Resource Scan

Scan Details

Site Domain truvia.com
Base Domain truvia.com
Scan Status Ok
Last Scan2026-01-20T04:29:22+00:00
Next Scan 2026-02-19T04:29:22+00:00

Last Scan

Scanned2026-01-20T04:29:22+00:00
URL https://truvia.com/robots.txt
Domain IPs 104.17.124.41, 104.17.125.41
Response IP 104.17.124.41
Found Yes
Hash 45e81ab87cb548c700d9477b688fa456a83b1eeb365390ef326cd7efbe0a0d58
SimHash a60c19423a91

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.truvia.com/sitemaps-1-sitemap.xml
sitemap https://www.truvia.ca/sitemaps-1-sitemap.xml
sitemap https://www.truvia.ca/fr/sitemaps-1-sitemap.xml
sitemap https://www.truvia.co.uk/sitemaps-1-sitemap.xml
sitemap https://www.truvia.me/en/sitemaps-1-sitemap.xml
sitemap https://www.truvia.me/sitemaps-1-sitemap.xml
sitemap https://www.truvia.com.br/sitemaps-1-sitemap.xml
sitemap https://www.truvia.com.au/sitemaps-1-sitemap.xml
sitemap https://www.truvia.co.il/sitemaps-1-sitemap.xml
sitemap https://www.truvia.cn/sitemaps-1-sitemap.xml
sitemap https://www.truvia.co.za/sitemaps-1-sitemap.xml
sitemap https://www.truvia.es/sitemaps-1-sitemap.xml
sitemap https://www.truvia.it/sitemaps-1-sitemap.xml
sitemap https://www.truvia.ph/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.truvia.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/