m.extra.globo.com
robots.txt
Robots Exclusion Standard data for m.extra.globo.com
Resource Scan
Scan Details
Site Domain | m.extra.globo.com |
Base Domain | globo.com |
Scan Status | Ok |
Last Scan | 2024-05-07T12:46:01+00:00 |
Next Scan | 2024-06-06T12:46:01+00:00 |
Last Scan
Scanned | 2024-05-07T12:46:01+00:00 |
URL | http://m.extra.globo.com/robots.txt |
Redirect | https://extra.globo.com/robots.txt |
Redirect Domain | extra.globo.com |
Redirect Base | globo.com |
Domain IPs | 186.192.81.177 |
Redirect IPs | 186.192.81.177 |
Response IP | 186.192.81.177 |
Found | Yes |
Hash | 7b2b4846f3f53d38e3cfe4fb65fbcf22d2633ffea74afb23e2f6e5c14791d5a1 |
SimHash | a10589600711 |
Groups
*
Rule | Path |
---|---|
Disallow | /busca/ |
Disallow | /beta/ |
Other Records
Field | Value |
---|---|
sitemap | https://extra.globo.com/sitemap/extra/news.xml |
sitemap | https://extra.globo.com/sitemap/topic/extra/sitemap.xml |
sitemap | https://extra.globo.com/sitemap/extra/sitemap.xml |
sitemap | https://extra.globo.com/sitemap/home/extra/sitemap.xml |
Comments