theorie.nl
robots.txt

Robots Exclusion Standard data for theorie.nl

Resource Scan

Scanned	2025-08-26T16:52:23+00:00
URL	https://theorie.nl/robots.txt
Redirect	https://www.theorie.nl/robots.txt
Redirect Domain	www.theorie.nl
Redirect Base	theorie.nl
Domain IPs	52.18.136.112
Redirect IPs	54.170.125.114
Response IP	54.170.125.114
Found	Yes
Hash	6e70e180d79b29248aa90f0b721e9d88ecafd8e7804eca39befe17e1e81d9785
SimHash	6194196a4d36

Rule

Path

Disallow

/cpresources/

Disallow

/vendor/

Disallow

/.env

Disallow

/cache/

Back to top

Field	Value
sitemap	https://www.theorie.nl/sitemaps-1-sitemap.xml

Field

Value

sitemap

https://www.theorie.nl/sitemaps-1-sitemap.xml

Back to top

robots.txt for https://www.theorie.nl/
live - don't allow web crawlers to index cpresources/ or vendor/
Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
User-agent: GPTBot
Disallow: /
Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
User-agent: Google-Extended
Disallow: /
Disallow Perplexity bot, as there's no benefit to allowing it to index your site
User-agent: PerplexityBot
Disallow: /

Back to top