globoads.globo.com
robots.txt

Robots Exclusion Standard data for globoads.globo.com

Resource Scan

Scan Details

Site Domain globoads.globo.com
Base Domain globo.com
Scan Status Ok
Last Scan2025-08-30T20:22:49+00:00
Next Scan 2025-09-29T20:22:49+00:00

Last Scan

Scanned2025-08-30T20:22:49+00:00
URL https://globoads.globo.com/robots.txt
Domain IPs 34.111.146.228
Response IP 34.111.146.228
Found Yes
Hash 43de352bced2bfeecd35b626d33029194665ad829cfa912b3e6cc9319ae35a08
SimHash 2a2c8944a1b0

Groups

*

Rule Path
Disallow /busca/
Disallow /beta/
Disallow /protocol/
Disallow /auth/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://globoads.globo.com/sitemap/home/globo-negocios/sitemap.xml
sitemap https://globoads.globo.com/sitemap/globo-negocios/sitemap.xml

Comments

  • Sitemaps