detox.com
robots.txt

Robots Exclusion Standard data for detox.com

Resource Scan

Scan Details

Site Domain detox.com
Base Domain detox.com
Scan Status Ok
Last Scan2025-08-17T09:59:33+00:00
Next Scan 2025-08-24T09:59:33+00:00

Last Scan

Scanned2025-08-17T09:59:33+00:00
URL https://detox.com/robots.txt
Domain IPs 104.21.93.11, 172.67.202.67, 2606:4700:3035::ac43:ca43, 2606:4700:3036::6815:5d0b
Response IP 104.21.93.11
Found Yes
Hash 55a27dfa933002c6df681fc17380baa486efd067316a1b590a830924638708fd
SimHash e864cbcac52f

Groups

bytespider

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.detox.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • Block Bytespider, used by various services for crawling.
  • Block Diffbot, a bot for extracting structured data from websites.
  • Block ImagesiftBot, likely related to image processing or crawling.
  • Block Omgili, a bot related to web crawling by Omgili.
  • Block Omgilibot, a bot used for crawling by Omgili.
  • Block YouBot, another general bot.
  • ---------------------------
  • END YOAST BLOCK