montea.com
robots.txt

Robots Exclusion Standard data for montea.com

Resource Scan

Scan Details

Site Domain montea.com
Base Domain montea.com
Scan Status Ok
Last Scan2026-02-07T10:50:42+00:00
Next Scan 2026-02-14T10:50:42+00:00

Last Scan

Scanned2026-02-07T10:50:42+00:00
URL https://montea.com/robots.txt
Domain IPs 104.17.124.41, 104.17.125.41
Response IP 104.17.124.41
Found Yes
Hash 3fd0e21c4f023d55b5e80f7c817a5d9e496d1330395e0708ae52aa8dba4dc025
SimHash 61341b3a5d36

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://montea.com/en/sitemaps-1-sitemap.xml
sitemap https://montea.com/nl/sitemaps-1-sitemap.xml
sitemap https://montea.com/fr/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://montea.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • User-agent: *
  • Disallow: /cpresources/
  • Disallow: /vendor/
  • Disallow: /.env
  • Disallow: /cache/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
  • Disallow Perplexity bot, as there's no benefit to allowing it to index your site