oglobo.com
robots.txt

Robots Exclusion Standard data for oglobo.com

Resource Scan

Scan Details

Site Domain oglobo.com
Base Domain oglobo.com
Scan Status Ok
Last Scan2024-05-08T10:11:21+00:00
Next Scan 2024-05-15T10:11:21+00:00

Last Scan

Scanned2024-05-08T10:11:21+00:00
URL https://oglobo.com/robots.txt
Redirect https://oglobo.globo.com/robots.txt
Redirect Domain oglobo.globo.com
Redirect Base globo.com
Domain IPs 186.192.83.12
Redirect IPs 201.7.177.244
Response IP 201.7.177.244
Found Yes
Hash b51e08e5f83af8bbdd847613202b6700b6db4dd8f4278b38f777d6f911693a3b
SimHash cb4089f100f2

Groups

*

Rule Path
Disallow /_inc/
Disallow /_inc_novo/
Disallow /_img/
Disallow /skins/
Disallow /in/
Disallow /ece_incoming/
Disallow /inc/
Disallow /img/
Disallow /servicos/
Disallow /content/
Disallow /oglobo-mobile/
Disallow /config/
Disallow /email_marketing/
Disallow /email_mkt/
Disallow /rss.xml
Disallow /*.json
Disallow */ALTERNATES/*
Disallow */BINARY/*
Disallow /*?word*
Disallow /*?anyWord*
Disallow /*?noneWord*
Disallow /*?exactWord*
Disallow /*?decade*
Disallow /*?year*
Disallow /*?month*
Disallow /*?day*
Allow /in/*.jpg
Allow /in/*.JPG
Allow /in/*.jpeg
Allow /in/*.JPEG
Allow /in/*.png
Allow /in/*.PNG
Allow /in/*.gif
Allow /in/*.GIF
Allow /in/*.tif
Allow /in/*.TIF
Allow /in/*.bmp
Allow /in/*.BMP

googlebot-news

Rule Path
Allow *

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://oglobo.globo.com/sitemap/oglobo/news.xml
sitemap https://oglobo.globo.com/sitemap/topic/oglobo/sitemap.xml
sitemap https://oglobo.globo.com/sitemap/oglobo/sitemap.xml
sitemap https://oglobo.globo.com/sitemap/home/oglobo/sitemap.xml
sitemap https://oglobo.globo.com/sitemap/section/oglobo/sitemap.xml