eldiariodecoahuila.com.mx
robots.txt

Robots Exclusion Standard data for eldiariodecoahuila.com.mx

Resource Scan

Scan Details

Site Domain eldiariodecoahuila.com.mx
Base Domain eldiariodecoahuila.com.mx
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-21T17:27:43+00:00
Next Scan 2024-09-28T17:27:43+00:00

Last Successful Scan

Scanned2023-11-27T11:49:53+00:00
URL https://eldiariodecoahuila.com.mx/robots.txt
Domain IPs 104.26.12.202, 104.26.13.202, 172.67.74.246, 2606:4700:20::681a:cca, 2606:4700:20::681a:dca, 2606:4700:20::ac43:4af6
Response IP 172.67.74.246
Found Yes
Hash 3b33063cc07445a2a07856a5d16bf8042801eebe01b015ad08c9bb96cf6679ab
SimHash 6a2452513c34

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /respaldo/
Disallow /u/

Other Records

Field Value
crawl-delay 60

facebookbot

Rule Path
Disallow /wp-includes/

Other Records

Field Value Comment
crawl-delay 15 1 page per 15 seconds

facebookexternalhit

Rule Path
Disallow /wp-includes/

Other Records

Field Value Comment
crawl-delay 15 1 page per 15 seconds

*

Rule Path
Disallow /favicon.ico

twitterbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 15

okhttp/2.5.0

Rule Path
Disallow /

yandexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /site.webmanifest

mediapartners-google

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

amazonbot

Rule Path
Disallow /do-not-crawl/

sentibot

Rule Path
Disallow /

bloomberg-net

Rule Path
Disallow /wp-includes/

Other Records

Field Value Comment
crawl-delay 10 1 page per 5 seconds

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

newspaper/0.2.8

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

yeti

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

showyoubot

Rule Path
Disallow /

gozaikbot

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

buck

Rule Path
Disallow /

wikido

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

bidswitchbot

Rule Path
Disallow /

g-i-g-a-b-o-t

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

caam

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

clickagy intelligence bot

Rule Path
Disallow /

jersey

Rule Path
Disallow /

libwww-perl

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

omgili

Rule Path
Disallow /

piplbot

Rule Path
Disallow /