globoesportes.com.br
robots.txt

Robots Exclusion Standard data for globoesportes.com.br

Resource Scan

Scan Details

Site Domain globoesportes.com.br
Base Domain globoesportes.com.br
Scan Status Ok
Last Scan2024-05-17T05:23:30+00:00
Next Scan 2024-05-24T05:23:30+00:00

Last Scan

Scanned2024-05-17T05:23:30+00:00
URL http://globoesportes.com.br/robots.txt
Redirect https://ge.globo.com/robots.txt
Redirect Domain ge.globo.com
Redirect Base globo.com
Domain IPs 186.192.83.5
Redirect IPs 35.227.102.207
Response IP 35.227.102.207
Found Yes
Hash 684effb9e6f1b81de550858738bd368544c1cdfc897ce4876079581b8c548dc0
SimHash 2004d90ac5f1

Groups

*

Rule Path
Disallow /publieditorial
Disallow /eu-atleta/zcalendario/calendario.html
Disallow /servico
Disallow /dynamo
Disallow /beta
Disallow *globo-cdn-src/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ge.globo.com/sitemap/ge/sitemap.xml