sportv.globo.com
robots.txt

Robots Exclusion Standard data for sportv.globo.com

Resource Scan

Scan Details

Site Domain sportv.globo.com
Base Domain globo.com
Scan Status Ok
Last Scan2024-06-18T04:27:47+00:00
Next Scan 2024-07-02T04:27:47+00:00

Last Scan

Scanned2024-06-18T04:27:47+00:00
URL https://sportv.globo.com/robots.txt
Domain IPs 186.192.81.26, 2804:294:4000:8000::5
Response IP 186.192.81.26
Found Yes
Hash 684effb9e6f1b81de550858738bd368544c1cdfc897ce4876079581b8c548dc0
SimHash 2004d90ac5f1

Groups

*

Rule Path
Disallow /publieditorial
Disallow /eu-atleta/zcalendario/calendario.html
Disallow /servico
Disallow /dynamo
Disallow /beta
Disallow *globo-cdn-src/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ge.globo.com/sitemap/ge/sitemap.xml