radioglobo.globo.com
robots.txt

Robots Exclusion Standard data for radioglobo.globo.com

Resource Scan

Scan Details

Site Domain radioglobo.globo.com
Base Domain globo.com
Scan Status Ok
Last Scan2025-12-12T15:04:16+00:00
Next Scan 2025-12-19T15:04:16+00:00

Last Scan

Scanned2025-12-12T15:04:16+00:00
URL https://radioglobo.globo.com/robots.txt
Domain IPs 186.192.81.189
Response IP 186.192.81.189
Found Yes
Hash a59d1032b336967c951ea2f17241371e5c9b15180062a85caa3907ce3245dd89
SimHash 2c010108e9f3

Groups

*

Rule Path
Disallow /busca/
Disallow /beta/

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

ai-powered-bot

Rule Path
Disallow /

sabiƔbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

grok

Rule Path
Disallow /

grokbot

Rule Path
Disallow /

grokai

Rule Path
Disallow /

xai

Rule Path
Disallow /

copilot-llm

Rule Path
Disallow /

copilot

Rule Path
Disallow /

copilotai

Rule Path
Disallow /

copilotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://radioglobo.globo.com/sitemap/radioglobo/news.xml
sitemap https://radioglobo.globo.com/sitemap/topic/radioglobo/sitemap.xml
sitemap https://radioglobo.globo.com/sitemap/radioglobo/sitemap.xml
sitemap https://radioglobo.globo.com/sitemap/home/radioglobo/sitemap.xml

Comments

  • robots.txt