topmidianews.com.br
robots.txt

Robots Exclusion Standard data for topmidianews.com.br

Resource Scan

Scan Details

Site Domain topmidianews.com.br
Base Domain topmidianews.com.br
Scan Status Ok
Last Scan2024-11-15T07:45:49+00:00
Next Scan 2024-11-22T07:45:49+00:00

Last Scan

Scanned2024-11-15T07:45:49+00:00
URL https://topmidianews.com.br/robots.txt
Redirect https://www.topmidianews.com.br/robots.txt
Redirect Domain www.topmidianews.com.br
Redirect Base topmidianews.com.br
Domain IPs 104.21.66.247, 172.67.166.136, 2606:4700:3030::ac43:a688, 2606:4700:3032::6815:42f7
Redirect IPs 104.21.66.247, 172.67.166.136, 2606:4700:3030::ac43:a688, 2606:4700:3032::6815:42f7
Response IP 104.21.66.247
Found Yes
Hash fb1cd9b89c0261a5d2dfce8c552dcec1d99fbbdfe5df6f0155373ef36cdff193
SimHash e9541a55e910

Groups

*

Rule Path
Disallow /noticia/versao_impressa/
Disallow /noticia/json/
Disallow /noticia/noticia_redirect/
Disallow /busca/
Disallow /frame_ultima_edicao/
Disallow /banner/clica_banner/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://www.topmidianews.com.br/sitemap/
sitemap https://www.topmidianews.com.br/sitemap/news/
sitemap https://www.topmidianews.com.br/sitemap/categories/