progresso.com.br
robots.txt

Robots Exclusion Standard data for progresso.com.br

Resource Scan

Scan Details

Site Domain progresso.com.br
Base Domain progresso.com.br
Scan Status Ok
Last Scan2025-12-15T11:05:14+00:00
Next Scan 2025-12-22T11:05:14+00:00

Last Scan

Scanned2025-12-15T11:05:14+00:00
URL https://progresso.com.br/robots.txt
Redirect https://www.progresso.com.br/robots.txt
Redirect Domain www.progresso.com.br
Redirect Base progresso.com.br
Domain IPs 104.21.18.53, 172.67.180.106, 2606:4700:3035::6815:1235, 2606:4700:3036::ac43:b46a
Redirect IPs 104.21.18.53, 172.67.180.106, 2606:4700:3035::6815:1235, 2606:4700:3036::ac43:b46a
Response IP 104.21.18.53
Found Yes
Hash 0bec3b487f520ac909ef0429661ab41f09ae502b2d4f9ecf807932d1513ac9d7
SimHash 651d9e41b910

Groups

*

Rule Path
Disallow /noticia/versao_impressa/
Disallow /noticia/json/
Disallow /noticia/noticia_redirect/
Disallow /busca/
Disallow /frame_ultima_edicao/
Disallow /banner/clica_banner/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://www.progresso.com.br/sitemap/
sitemap https://www.progresso.com.br/sitemap/news/
sitemap https://www.progresso.com.br/sitemap/categories/