www.folha.uol.com.br
robots.txt

Robots Exclusion Standard data for www.folha.uol.com.br

Resource Scan

Scan Details

Site Domain www.folha.uol.com.br
Base Domain uol.com.br
Scan Status Ok
Last Scan2024-11-08T20:47:35+00:00
Next Scan 2024-11-22T20:47:35+00:00

Last Scan

Scanned2024-11-08T20:47:35+00:00
URL https://www.folha.uol.com.br/robots.txt
Domain IPs 13.35.210.124, 13.35.210.25, 13.35.210.28, 13.35.210.89
Response IP 13.35.210.28
Found Yes
Hash a52d3d003409524b2df3d6a08f19c55790d4d3079515ba896e47472ec62b4141
SimHash 891e01a44551

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /folha/
Disallow /guia/
Disallow /logs/
Disallow /simulador/

googlebot-news

Rule Path
Allow *

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /virtual/

Comments

  • robots.txt for http://www.folha.com.br/
  • Contact webmaster@grupofolha.com.br if you have questions regarding this file