gq.globo.com
robots.txt
Robots Exclusion Standard data for gq.globo.com
Resource Scan
Scan Details
Site Domain | gq.globo.com |
Base Domain | globo.com |
Scan Status | Ok |
Last Scan | 2024-04-27T00:48:01+00:00 |
Next Scan | 2024-05-04T00:48:01+00:00 |
Last Scan
Scanned | 2024-04-27T00:48:01+00:00 |
URL | https://gq.globo.com/robots.txt |
Domain IPs | 201.7.177.252 |
Response IP | 201.7.177.252 |
Found | Yes |
Hash | c1eba5b50c01ad14f93d3c9e9a946b86bea05015cb77d8504d07998b91b208c9 |
SimHash | a42d0844c113 |
Groups
*
Rule | Path |
---|---|
Disallow | /busca/ |
Disallow | /beta/ |
Other Records
Field | Value |
---|---|
sitemap | https://gq.globo.com/sitemap/last-news.xml |
sitemap | https://gq.globo.com/sitemap/gq/news.xml |
sitemap | https://gq.globo.com/sitemap/gq/sitemap.xml |
sitemap | https://gq.globo.com/sitemap/home/gq/sitemap.xml |
sitemap | https://gq.globo.com/sitemap/videos/gq/sitemap.xml |
Comments