globoesporte.globo.com
robots.txt

Robots Exclusion Standard data for globoesporte.globo.com

Resource Scan

Scan Details

Site Domain globoesporte.globo.com
Base Domain globo.com
Scan Status Ok
Last Scan2024-05-07T20:38:19+00:00
Next Scan 2024-05-21T20:38:19+00:00

Last Scan

Scanned2024-05-07T20:38:19+00:00
URL https://globoesporte.globo.com/robots.txt
Redirect https://ge.globo.com/robots.txt
Redirect Domain ge.globo.com
Redirect Base globo.com
Domain IPs 186.192.81.25
Redirect IPs 34.160.91.32
Response IP 34.160.91.32
Found Yes
Hash 684effb9e6f1b81de550858738bd368544c1cdfc897ce4876079581b8c548dc0
SimHash 2004d90ac5f1

Groups

*

Rule Path
Disallow /publieditorial
Disallow /eu-atleta/zcalendario/calendario.html
Disallow /servico
Disallow /dynamo
Disallow /beta
Disallow *globo-cdn-src/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ge.globo.com/sitemap/ge/sitemap.xml