noest.be
robots.txt

Robots Exclusion Standard data for noest.be

Resource Scan

Scan Details

Site Domain noest.be
Base Domain noest.be
Scan Status Ok
Last Scan2025-09-11T09:01:45+00:00
Next Scan 2025-09-18T09:01:45+00:00

Last Scan

Scanned2025-09-11T09:01:45+00:00
URL https://noest.be/robots.txt
Domain IPs 2a04:3544:1000:1510:3cc8:64ff:fefa:5871, 94.237.125.171
Response IP 94.237.125.171
Found Yes
Hash 201a5e275c3b86a57aa1c1a32c2e23490a0352f8109a6621a0d42f63e732e377
SimHash 51180a52fdb1

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
crawl-delay 20

ahrefsbot
amazonbot
awariobot
barkrowler
blexbot
bw/1.1
bytespider
censysinspect
claudebot
coccocbot
dalvik/2.1.0
dataforseobot
dataprovider
dotbot
expanse
foregenix
geedoproductsearch
google-extended
gptbot
imagesiftbot
internet-measurement
ioncrawl
java
mj12bot
mozlila
orbbot
perplexitybot
petalbot
python-requests
scrapy
semrushbot
seznambot
wp_is_mobile
yandexbot
zoominfobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://noest.be/nl-be/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://noest.be/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow Bots