the-health-site.com
robots.txt

Robots Exclusion Standard data for the-health-site.com

Resource Scan

Scan Details

Site Domain the-health-site.com
Base Domain the-health-site.com
Scan Status Ok
Last Scan2025-04-11T22:00:03+00:00
Next Scan 2025-04-18T22:00:03+00:00

Last Scan

Scanned2025-04-11T22:00:03+00:00
URL https://the-health-site.com/robots.txt
Domain IPs 104.21.21.135, 172.67.199.21, 2606:4700:3035::ac43:c715, 2606:4700:3036::6815:1587
Response IP 172.67.199.21
Found Yes
Hash 800424b86aec722bcbf18bef8ec058727bcccedf40d1b5da32d0d18a81906702
SimHash 6d14dae75791

Groups

*

Rule Path
Disallow

googlebot

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

gigabot

Rule Path
Disallow /

yottosbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

yandexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://the-health-site.com/sitemap.xml

Comments

  • Not bad robots
  • Маджестик go to fack!
  • LinkBot go to fack!

Warnings

  • `host` is not a known field.