nationalgeographicbrasil.com
robots.txt

Robots Exclusion Standard data for nationalgeographicbrasil.com

Resource Scan

Scan Details

Site Domain nationalgeographicbrasil.com
Base Domain nationalgeographicbrasil.com
Scan Status Ok
Last Scan2024-11-14T03:22:58+00:00
Next Scan 2024-11-21T03:22:58+00:00

Last Scan

Scanned2024-11-14T03:22:58+00:00
URL https://www.nationalgeographicbrasil.com/robots.txt
Domain IPs 23.215.7.16, 23.215.7.20
Response IP 23.52.40.33
Found Yes
Hash 133bad51b3a5662e2c87f1ea9f8fc99ee8bcd90df5e844c21cc140c4f0233f8b
SimHash 6d1c9a52d303

Groups

*

Rule Path
Disallow /*%7B*
Disallow /api-merlin*
Disallow /admin*
Disallow /node*
Disallow /files*
Disallow /*?*
Disallow /*.aspx*
Disallow /*.php*
Disallow /canary-test/*
Allow /*?page=*
Allow /*?cmp=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographicbrasil.com/sitemap/sitemap.xml
sitemap https://www.nationalgeographicbrasil.com/sitemap-video.xml
sitemap https://www.nationalgeographicbrasil.com/google-news.xml