jornaldocomercio.com.br
robots.txt
Robots Exclusion Standard data for jornaldocomercio.com.br
Resource Scan
Scan Details
| Site Domain | jornaldocomercio.com.br |
| Base Domain | jornaldocomercio.com.br |
| Scan Status | Ok |
| Last Scan | 2026-02-26T03:00:58+00:00 |
| Next Scan | 2026-03-05T03:00:58+00:00 |
Last Scan
| Scanned | 2026-02-26T03:00:58+00:00 |
| URL | https://jornaldocomercio.com.br/robots.txt |
| Redirect | https://www.jornaldocomercio.com/robots.txt |
| Redirect Domain | www.jornaldocomercio.com |
| Redirect Base | jornaldocomercio.com |
| Domain IPs | 104.21.52.56, 172.67.195.241, 2606:4700:3030::6815:3438, 2606:4700:3036::ac43:c3f1 |
| Redirect IPs | 104.26.8.35, 104.26.9.35, 172.67.72.107, 2606:4700:20::681a:823, 2606:4700:20::681a:923, 2606:4700:20::ac43:486b |
| Response IP | 172.67.72.107 |
| Found | Yes |
| Hash | 79d3234909523b705024af282387350f8ae549a747ac123e9f56aa73b79b1a61 |
| SimHash | 48847ab7a913 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /_conteudos/ |
| Disallow | /*.json |
| Disallow | /*.php |
| Disallow | /webparts/ |
| Disallow | /search/ |
| Disallow | /tags/ |
| Disallow | /autor/ |
| Disallow | /site/* |
| Allow | /site/noticia.php |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.jornaldocomercio.com/sitemap.xml |