rss.noticias.uol.com.br
robots.txt

Robots Exclusion Standard data for rss.noticias.uol.com.br

Resource Scan

Scan Details

Site Domain rss.noticias.uol.com.br
Base Domain uol.com.br
Scan Status Ok
Last Scan2024-04-19T00:55:12+00:00
Next Scan 2024-05-19T00:55:12+00:00

Last Scan

Scanned2024-04-19T00:55:12+00:00
URL https://rss.noticias.uol.com.br/robots.txt
Redirect https://noticias.uol.com.br/robots.txt
Redirect Domain noticias.uol.com.br
Redirect Base uol.com.br
Domain IPs 200.147.4.74, 2804:49c:3101:405:ffff:ffff:ffff:22, 2804:49c:3102:405:ffff:ffff:ffff:6
Redirect IPs 23.209.46.14, 23.209.46.7, 2600:1413:b000:1e::17d1:2e4a, 2600:1413:b000:1e::17d1:2e5c
Response IP 184.27.123.11
Found Yes
Hash 97b37e09bd7a1ca6e68bdaa8fa45841e5bde3c87f75f81ff4c2f90ed4b3e9b0c
SimHash 281ce8465597

Groups

*

Rule Path
Disallow */dev/
Disallow /*.jhtm
Disallow /*.shl
Disallow /ultnot/
Disallow /uolnews/
Disallow /pelenet/
Disallow /pelenet/album/
Disallow /mundodigital/
Disallow /fernandorodrigues/
Disallow /censo-2010/
Disallow /elpais/
Disallow /vestibuol/etes/
Disallow /arquivohome/
Disallow /busca?q=
Disallow /next%3D
Disallow .jhtm

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://noticias.uol.com.br/sitemap/index.xml
sitemap https://noticias.uol.com.br/sitemap/v2/news-01.xml
sitemap https://noticias.uol.com.br/sitemap/v3/web-stories/