blog.badbadrobot.tv
robots.txt

Robots Exclusion Standard data for blog.badbadrobot.tv

Resource Scan

Scan Details

Site Domain blog.badbadrobot.tv
Base Domain badbadrobot.tv
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-27T23:43:43+00:00
Next Scan 2026-01-25T23:43:43+00:00

Last Successful Scan

Scanned2023-12-31T17:46:15+00:00
URL https://blog.badbadrobot.tv/robots.txt
Domain IPs 74.114.154.18, 74.114.154.22
Response IP 74.114.154.18
Found Yes
Hash 30eccb999716ad6b9022a231e084e982898999f9414ab8ce61c9a7af572bb0e4
SimHash 6b94d8414526

Groups

*

Rule Path
Disallow /random
Disallow /day
Disallow /sticky-ad-iframe.html
Disallow /privacy/consent

Other Records

Field Value
crawl-delay 1

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://blog.badbadrobot.tv/sitemap.xml

Comments

  • OpenAI's crawler
  • Common Crawl's crawler
  • SentiBot's crawler
  • Google Bard's crawler
  • Facebook's crawler
  • webz.io's crawler
  • webz.io's crawler
  • Amazon's crawler
  • Bing's crawler