cardiowelt.de
robots.txt

Robots Exclusion Standard data for cardiowelt.de

Resource Scan

Scan Details

Site Domain cardiowelt.de
Base Domain cardiowelt.de
Scan Status Ok
Last Scan2024-06-21T12:50:56+00:00
Next Scan 2024-06-28T12:50:56+00:00

Last Scan

Scanned2024-06-21T12:50:56+00:00
URL https://cardiowelt.de/robots.txt
Domain IPs 104.21.18.38, 172.67.180.67, 2606:4700:3032::6815:1226, 2606:4700:3032::ac43:b443
Response IP 172.67.180.67
Found Yes
Hash 99d9101323210a492a05af60aa4f3029c1a08f70edc12b82d4bc3d38524357fc
SimHash e17419224736

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cardiowelt.de/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://cardiowelt.de/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site