myhealthbox.eu
robots.txt

Robots Exclusion Standard data for myhealthbox.eu

Resource Scan

Scan Details

Site Domain myhealthbox.eu
Base Domain myhealthbox.eu
Scan Status Ok
Last Scan2024-10-02T02:06:04+00:00
Next Scan 2024-10-09T02:06:04+00:00

Last Scan

Scanned2024-10-02T02:06:04+00:00
URL https://myhealthbox.eu/robots.txt
Domain IPs 95.110.227.120
Response IP 95.110.227.120
Found Yes
Hash 660a9a45e3c1942408ef418dec709ea7855eae527e377a3126f247e0eb86a1bf
SimHash 785e9851c4b2

Groups

lcc

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

omgili

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

*

Rule Path
Disallow /*/view/*
Disallow /*/download/*
Disallow /webmail
Disallow */leaflet_request.php?*

Other Records

Field Value
sitemap https://myhealthbox.eu/sitemaps/sitemap.xml

Comments

  • AI Data Scraper
  • All the others
  • Disallow: */search.php?q=*
  • Sitemap