portalonorte.com.br
robots.txt

Robots Exclusion Standard data for portalonorte.com.br

Resource Scan

Scan Details

Site Domain portalonorte.com.br
Base Domain portalonorte.com.br
Scan Status Ok
Last Scan2024-09-18T19:47:31+00:00
Next Scan 2024-09-25T19:47:31+00:00

Last Scan

Scanned2024-09-18T19:47:31+00:00
URL https://portalonorte.com.br/robots.txt
Redirect https://www.portalonorte.com.br/robots.txt
Redirect Domain www.portalonorte.com.br
Redirect Base portalonorte.com.br
Domain IPs 104.21.26.124, 172.67.136.65, 2606:4700:3032::6815:1a7c, 2606:4700:3037::ac43:8841
Redirect IPs 104.21.26.124, 172.67.136.65, 2606:4700:3032::6815:1a7c, 2606:4700:3037::ac43:8841
Response IP 104.21.26.124
Found Yes
Hash bc8c47e0cde6bbb57f184992808e202b90e57bcb836c9ee070ac9cf421fed2f5
SimHash b9190e45ed12

Groups

*

Rule Path
Disallow /noticia/versao_impressa/
Disallow /noticia/json/
Disallow /noticia/noticia_redirect/
Disallow /busca/
Disallow /frame_ultima_edicao/
Disallow /banner/clica_banner/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://www.portalonorte.com.br/sitemap/
sitemap https://www.portalonorte.com.br/sitemap/news/
sitemap https://www.portalonorte.com.br/sitemap/categories/