progresso.com.br
robots.txt
Robots Exclusion Standard data for progresso.com.br
Resource Scan
Scan Details
| Site Domain | progresso.com.br |
| Base Domain | progresso.com.br |
| Scan Status | Ok |
| Last Scan | 2025-12-15T11:05:14+00:00 |
| Next Scan | 2025-12-22T11:05:14+00:00 |
Last Scan
| Scanned | 2025-12-15T11:05:14+00:00 |
| URL | https://progresso.com.br/robots.txt |
| Redirect | https://www.progresso.com.br/robots.txt |
| Redirect Domain | www.progresso.com.br |
| Redirect Base | progresso.com.br |
| Domain IPs | 104.21.18.53, 172.67.180.106, 2606:4700:3035::6815:1235, 2606:4700:3036::ac43:b46a |
| Redirect IPs | 104.21.18.53, 172.67.180.106, 2606:4700:3035::6815:1235, 2606:4700:3036::ac43:b46a |
| Response IP | 104.21.18.53 |
| Found | Yes |
| Hash | 0bec3b487f520ac909ef0429661ab41f09ae502b2d4f9ecf807932d1513ac9d7 |
| SimHash | 651d9e41b910 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /noticia/versao_impressa/ |
| Disallow | /noticia/json/ |
| Disallow | /noticia/noticia_redirect/ |
| Disallow | /busca/ |
| Disallow | /frame_ultima_edicao/ |
| Disallow | /banner/clica_banner/ |
| Disallow | /cdn-cgi/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.progresso.com.br/sitemap/ |
| sitemap | https://www.progresso.com.br/sitemap/news/ |
| sitemap | https://www.progresso.com.br/sitemap/categories/ |