correiodoestado.com.br
robots.txt

Robots Exclusion Standard data for correiodoestado.com.br

Resource Scan

Scan Details

Site Domain correiodoestado.com.br
Base Domain correiodoestado.com.br
Scan Status Ok
Last Scan2024-09-28T06:27:32+00:00
Next Scan 2024-10-05T06:27:32+00:00

Last Scan

Scanned2024-09-28T06:27:32+00:00
URL https://correiodoestado.com.br/robots.txt
Domain IPs 104.21.86.129, 172.67.220.47, 2606:4700:3032::6815:5681, 2606:4700:3034::ac43:dc2f
Response IP 172.67.220.47
Found Yes
Hash 3d463d23aeb86c2b359273d8aeb2d971bdd51211cf082317870f2e7a5c5a2fea
SimHash e90812448810

Groups

*

Rule Path
Disallow /noticia/versao_impressa/
Disallow /noticia/json/
Disallow /noticia/noticia_redirect/
Disallow /busca/
Disallow /frame_ultima_edicao/
Disallow /banner/clica_banner/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://correiodoestado.com.br/sitemap/
sitemap https://correiodoestado.com.br/sitemap/news/
sitemap https://correiodoestado.com.br/sitemap/categories/