educacao.uol.com.br
robots.txt

Robots Exclusion Standard data for educacao.uol.com.br

Resource Scan

Scan Details

Site Domain educacao.uol.com.br
Base Domain uol.com.br
Scan Status Ok
Last Scan2024-05-09T00:14:59+00:00
Next Scan 2024-05-23T00:14:59+00:00

Last Scan

Scanned2024-05-09T00:14:59+00:00
URL https://educacao.uol.com.br/robots.txt
Domain IPs 23.211.140.51, 23.211.140.74, 2600:1413:b000:1e::17d1:2e4a, 2600:1413:b000:1e::17d1:2e58
Response IP 184.27.123.67
Found Yes
Hash 8a51dbd5cb0c1175336ee018be9f91c108bfc6f9251a20a4ed0a0fed39ba8368
SimHash a83dc844e593

Groups

*

Rule Path
Disallow */dev/
Disallow /busca?q=
Disallow /*.jhtm
Disallow /next%3D

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://educacao.uol.com.br/sitemap/index.xml
sitemap https://educacao.uol.com.br/sitemap/v2/news-01.xml