jornalcruzeiro.com.br
robots.txt
Robots Exclusion Standard data for jornalcruzeiro.com.br
Resource Scan
Scan Details
Site Domain | jornalcruzeiro.com.br |
Base Domain | jornalcruzeiro.com.br |
Scan Status | Ok |
Last Scan | 2024-06-26T09:44:09+00:00 |
Next Scan | 2024-07-03T09:44:09+00:00 |
Last Scan
Scanned | 2024-06-26T09:44:09+00:00 |
URL | https://jornalcruzeiro.com.br/robots.txt |
Redirect | https://www.jornalcruzeiro.com.br/robots.txt |
Redirect Domain | www.jornalcruzeiro.com.br |
Redirect Base | jornalcruzeiro.com.br |
Domain IPs | 104.21.33.31, 172.67.140.190, 2606:4700:3034::ac43:8cbe, 2606:4700:3035::6815:211f |
Redirect IPs | 104.21.33.31, 172.67.140.190, 2606:4700:3034::ac43:8cbe, 2606:4700:3035::6815:211f |
Response IP | 104.21.33.31 |
Found | Yes |
Hash | ac5f1a2acc4c54e7194005a403837bc425971b855ba9b76fc55d8ecb373c1a9f |
SimHash | 08945af74993 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /_conteudos/ |
Disallow | /*.json |
Disallow | /*.php |
Disallow | /webparts/ |
Disallow | /search/ |
Disallow | /tags/ |
Disallow | /autor/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.jornalcruzeiro.com.br/sitemap/1.xml |
sitemap | https://www.jornalcruzeiro.com.br/sitemap/map/day/sitemap.xml |