cholloteca.com
robots.txt

Robots Exclusion Standard data for cholloteca.com

Resource Scan

Scan Details

Site Domain cholloteca.com
Base Domain cholloteca.com
Scan Status Ok
Last Scan2024-05-30T14:00:29+00:00
Next Scan 2024-06-06T14:00:29+00:00

Last Scan

Scanned2024-05-30T14:00:29+00:00
URL https://cholloteca.com/robots.txt
Domain IPs 104.21.24.180, 172.67.219.224, 2606:4700:3036::6815:18b4, 2606:4700:3037::ac43:dbe0
Response IP 104.21.24.180
Found Yes
Hash ab62ef3d3c8b555f79bdec99721c7651bead5a09a0d837f021efdb0640e177bf
SimHash 849cd75cccb3

Groups

*

Rule Path
Allow /
Disallow /links
Disallow /goto
Disallow /wp-admin/
Disallow /readme.html$
Disallow /?s=
Disallow /search
Allow /*.js$
Allow /*.css$

googlebot-image

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

yandex

Rule Path
Allow /

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mediapartners-google

Rule Path
Disallow /

proximic

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

awariobot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

ia_archiver disallow: /
addthis.com disallow: /
admantx disallow: /
ahrefsbot disallow: /
bdcbot disallow: /
bender disallow: /
bixocrawler disallow: /
bl.uk_lddc_bot disallow: /
blexbot disallow: /
bubing disallow: /
cliqzbot disallow: /
cncdialer disallow: /
crawler4j disallow: /
crystalsemanticsbot disallow: /
cyberalert disallow: /
digext disallow: /
discobot disallow: /
discoverybot disallow: /
dloader disallow: /
dloader(naverrobot) disallow: /
doc disallow: /
dotbot disallow: /
download ninja disallow: /
dts agent disallow: /
exabot disallow: /
ezooms disallow: /
fairshare disallow: /
fetch disallow: /
flamingo_searchengine disallow: /
genieo disallow: /
gigabot disallow: /
grub-client disallow: /
heritrix disallow: /
heritrix/3.3.0 disallow: /
httrack disallow: /
integromedb disallow: /
istellabot disallow: /
jikespider disallow: /
jyxobot disallow: /
k2spider disallow: /
kimengi disallow: /
kimengi/nineconnections.com disallow: /
larbin disallow: /
lexxebot/1.0 disallow: /
libwww disallow: /
linko disallow: /
livelapbot disallow: /
magpie-crawler disallow: /
maxthon disallow: /
metauri disallow: /
microsoft.url.control disallow: /
mj12bot disallow: /
moreover disallow: /
moreoverbot disallow: /
msiecrawler disallow: /
nabot disallow: /
naverbot disallow: /
nerdbynature.bot disallow: /
netestate ne crawler disallow: /
netseer crawler disallow: /
newscan disallow: /
nextgensearchbot disallow: /
npbot disallow: /
nutch disallow: /
offline explorer disallow: /
omgilibot disallow: /
orthogaffe disallow: /
piplbot disallow: /
pixray-seeker disallow: /
proximic disallow: /
psbot disallow: /
queryseekerspider disallow: /
rogerbot disallow: /
seokicks disallow: /
seokicks-robot disallow: /
sitebot disallow: /
sitebot/0.1 disallow: /
sitecheck.internetseer.com disallow: /
sitesnagger disallow: /
slurp disallow: /
sogou disallow: /
sosospider disallow: /
spbot disallow: /
spinn3r disallow: /
teleport disallow: /
teleportpro disallow: /
trendictionbot disallow: /
trovitbot disallow: /
turnitinbot disallow: /
ubicrawler disallow: /
umbot-ln disallow: /
unisterbot disallow: /
universalfeedparser disallow: /
wbsearchbot disallow: /
webcopier disallow: /
webreaper disallow: /
webstripper disallow: /
webzip disallow: /
wesee:search disallow: /
wget disallow: /
wotbot disallow: /
wotbox disallow: /
xenu disallow: /
yasni disallow: /
zao disallow: /
zealbot disallow: /
zyborg disallow: /
offlineexplorer disallow: /
chatgpt-user disallow: /
gptbot disallow: /
ccbot disallow: /
anthropic-ai disallow: /
cohere-ai disallow: /
omgili disallow: /
claritybot disallow: /
google-extended disallow: /

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://cholloteca.com/sitemap.xml

Comments

  • General
  • Evita bloqueos de CSS y JS.
  • Lista de bots que deberías permitir.
  • Slurp (Yahoo!), Noxtrum y el bot de MSN que suelen generar excesivas consultas.
  • Redes publicitarias
  • Analizadores de enlaces
  • Descargadores
  • ---------------------------
  • lista de bots y ia a bloquear

Warnings

  • 1 invalid line.