pressaorural.com.br
robots.txt

Robots Exclusion Standard data for pressaorural.com.br

Resource Scan

Scan Details

Site Domain pressaorural.com.br
Base Domain pressaorural.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-08-31T16:31:10+00:00
Next Scan 2024-09-30T16:31:10+00:00

Last Successful Scan

Scanned2024-07-10T16:30:10+00:00
URL https://pressaorural.com.br/robots.txt
Domain IPs 104.21.72.205, 172.67.187.83, 2606:4700:3030::6815:48cd, 2606:4700:3034::ac43:bb53
Response IP 104.21.72.205
Found Yes
Hash 0483f4094b19ed9dc1277dc0bdeaa92e3812157187d748c56f026092bf44be8f
SimHash 3f70c7a4ca11

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /busca*
Disallow /*%26sort%3D*
Disallow /*%26order%3D*
Disallow /*%26limit%3D*
Disallow /*%26q%3D*
Disallow /*?q=*
Disallow /*%26filter%3D*
Disallow /*?filter=*
Disallow /*%26size%3D*
Disallow /*?size=*
Disallow /login$
Disallow /carrinho$
Disallow /cadastro$
Disallow /checkout$
Disallow /vale-presente
Disallow /meus-pedidos
Disallow /minha-conta
Disallow /minha-conta-afiliado

domaincrawler/3.0

Rule Path
Disallow /

dirbuster-0.12

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

spbot

Rule Path
Disallow /

everyonesocialbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://files.irroba.com.br/pressaor/feeds/sitemap.xml