miotraliga.com
robots.txt

Robots Exclusion Standard data for miotraliga.com

Resource Scan

Scan Details

Site Domain miotraliga.com
Base Domain miotraliga.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-19T20:20:56+00:00
Next Scan 2024-09-26T20:20:56+00:00

Last Successful Scan

Scanned2024-09-11T16:32:23+00:00
URL https://miotraliga.com/robots.txt
Domain IPs 104.21.7.129, 172.67.155.122, 2606:4700:3030::6815:781, 2606:4700:3036::ac43:9b7a
Response IP 172.67.155.122
Found Yes
Hash 4e0f9fdde572f18070f4bc568444f157112205b67e93db3e9cc0b991cdabbe33
SimHash 6844dc808051

Groups

*

Rule Path
Disallow /ustedpregunta/categoria/sexo/
Disallow /adconion_preroll/
Disallow /videos/
Disallow /noticia/admin/
Disallow /imagen/admin/
Disallow /encuesta/admin/
Disallow /video/admin/
Disallow /directo/admin/
Disallow /admin/elementos/
Disallow /admin/publicidad/
Disallow /alert/
Disallow /user/admin/

mediapartners-google

Rule Path
Allow /

googlebot

Rule Path
Allow /
Disallow /es_ES/resultados/noticias/?q=
Disallow /es_ES/resultados/*
Disallow /intranet/*
Disallow /resultadoEncuesta/*
Disallow /admin/
Disallow /donbalonrosa/
Disallow /555/
Allow /feed/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://panel.defensacentral.com/sitemap_index.xml
sitemap https://portal23backoffice.defensacentral.com/sitemaps/sitemap-index.xml
sitemap https://portal23backoffice.defensacentral.com/sitemaps/sitemap-news.xml
sitemap https://portal23backoffice.defensacentral.com/sitemaps/category-sitemap.xml