oglobo.globo.com
robots.txt
Robots Exclusion Standard data for oglobo.globo.com
Resource Scan
Scan Details
Site Domain | oglobo.globo.com |
Base Domain | globo.com |
Scan Status | Ok |
Last Scan | 2024-06-26T09:06:54+00:00 |
Next Scan | 2024-07-03T09:06:54+00:00 |
Last Scan
Scanned | 2024-06-26T09:06:54+00:00 |
URL | https://oglobo.globo.com/robots.txt |
Domain IPs | 201.7.177.244 |
Response IP | 201.7.177.244 |
Found | Yes |
Hash | 684465a062b8c8034b07914dbe5376469049c3e78548c75528f2bcfb69930d2d |
SimHash | cb6089e100f2 |
Groups
*
Rule | Path |
---|---|
Disallow | /_inc/ |
Disallow | /_inc_novo/ |
Disallow | /_img/ |
Disallow | /skins/ |
Disallow | /in/ |
Disallow | /ece_incoming/ |
Disallow | /inc/ |
Disallow | /img/ |
Disallow | /servicos/ |
Disallow | /content/ |
Disallow | /oglobo-mobile/ |
Disallow | /config/ |
Disallow | /email_marketing/ |
Disallow | /email_mkt/ |
Disallow | /rss.xml |
Disallow | /*.json |
Disallow | */ALTERNATES/* |
Disallow | */BINARY/* |
Disallow | /*?word* |
Disallow | /*?anyWord* |
Disallow | /*?noneWord* |
Disallow | /*?exactWord* |
Disallow | /*?decade* |
Disallow | /*?year* |
Disallow | /*?month* |
Disallow | /*?day* |
Disallow | /busca/ |
Disallow | /beta/ |
Allow | /in/*.jpg |
Allow | /in/*.JPG |
Allow | /in/*.jpeg |
Allow | /in/*.JPEG |
Allow | /in/*.png |
Allow | /in/*.PNG |
Allow | /in/*.gif |
Allow | /in/*.GIF |
Allow | /in/*.tif |
Allow | /in/*.TIF |
Allow | /in/*.bmp |
Allow | /in/*.BMP |
Other Records
Field | Value |
---|---|
sitemap | https://oglobo.globo.com/sitemap/oglobo/news.xml |
sitemap | https://oglobo.globo.com/sitemap/topic/oglobo/sitemap.xml |
sitemap | https://oglobo.globo.com/sitemap/oglobo/sitemap.xml |
sitemap | https://oglobo.globo.com/sitemap/home/oglobo/sitemap.xml |
sitemap | https://oglobo.globo.com/sitemap/section/oglobo/sitemap.xml |