acervo.estadao.com.br
robots.txt

Robots Exclusion Standard data for acervo.estadao.com.br

Resource Scan

Scan Details

Site Domain acervo.estadao.com.br
Base Domain estadao.com.br
Scan Status Ok
Last Scan2024-09-26T21:50:11+00:00
Next Scan 2024-10-10T21:50:11+00:00

Last Scan

Scanned2024-09-26T21:50:11+00:00
URL https://acervo.estadao.com.br/robots.txt
Domain IPs 23.45.207.200, 23.45.207.202, 2600:1413:b000:13::b857:c194, 2600:1413:b000:13::b857:c19e
Response IP 23.49.60.64
Found Yes
Hash bc938288faef2bc16c0a86b0472c512164e9d0000cc8e0ffc495d0d68e2b0230
SimHash 2e14936051b2

Groups

*

Rule Path
Disallow /ewok
Disallow /logs
Disallow /robos-linux
Disallow /shared
Disallow /estadao
Disallow /newsletter/
Disallow /correcoes/enviar?url=*
Disallow /ext/
Disallow /eleicoes/2018/busca*
Disallow /eleicoes/2020/busca*
Disallow /feed/$
Disallow /politica/eleicoes/2024/candidatos-*/*/vice-prefeito/$
Disallow /politica/eleicoes/2024/candidatos-*/prefeito/$
Disallow /politica/eleicoes/2024/candidatos-*/vereador/$
Disallow /politica/eleicoes/2024/candidatos-*/vice-prefeito/$
Disallow /politica/eleicoes/2024/candidatos/*
Allow /politica/eleicoes/2024/candidatos-*/*/prefeito/$
Allow /politica/eleicoes/2024/candidatos-*/*/vereador/$
Allow /ext/.pdf
Allow /estadao-verifica

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.estadao.com.br/arc/outboundfeeds/sitemap-index-by-day/?outputType=xml
sitemap https://www.estadao.com.br/sitemaps/custom/www.estadao
sitemap https://www.estadao.com.br/sitemap/galerias/www.estadao/auto.xml

Comments

  • Support directories
  • Sitemaps