tubillete.com
robots.txt

Robots Exclusion Standard data for tubillete.com

Resource Scan

Scan Details

Site Domain tubillete.com
Base Domain tubillete.com
Scan Status Ok
Last Scan2024-09-25T01:28:20+00:00
Next Scan 2024-10-25T01:28:20+00:00

Last Scan

Scanned2024-09-25T01:28:20+00:00
URL https://tubillete.com/robots.txt
Domain IPs 3.251.19.98, 34.246.34.78, 54.74.24.248
Response IP 34.246.34.78
Found Yes
Hash cedc8f55b7475fae9dd648fd24c875132abe69a0ca4e3fc1d8a31ca81cb17875
SimHash 43567850c831

Groups

*

Rule Path
Disallow /npack/*
Disallow /analytics/*
Disallow /commons/*
Disallow /hoteles/autocomplete/*
Allow /hotel/js/*
Allow /hoteles/js/*

adnettrack
leiacrawler
mindcrawler
searchtone2.0
whatsup_gold/5.01
claude-web
claudebot
compspybot
curious george
cybeye.com
docomo
exb language crawler
ezooms
flamingo_searchengine
genieo
genio
lwnutch
lexxebot
openwebindex
rediffnewsbot
seoengworldbot
scanmine
screaming frog seo spider
shopwiki
showyoubot
sosospider
wocbot
yandex
yeti
youdaobot
anthropic-ai
daumoa
gsa-crawler
libcrawl
linkdex
magpie-crawler
repparser
rogerbot
sindice-site-manager
sogou
woriobot
yacybot
yolinkbot
chatgpt-user
gptbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.tubillete.com/sitemap.xml

Comments

  • Robots peligrosos o que consumen mucho ancho de banda

Warnings

  • 2 invalid lines.