nationalgeographicla.com
robots.txt

Robots Exclusion Standard data for nationalgeographicla.com

Resource Scan

Scan Details

Site Domain nationalgeographicla.com
Base Domain nationalgeographicla.com
Scan Status Ok
Last Scan2024-11-11T06:06:37+00:00
Next Scan 2024-11-18T06:06:37+00:00

Last Scan

Scanned2024-11-11T06:06:37+00:00
URL https://nationalgeographicla.com/robots.txt
Redirect https://www.nationalgeographicla.com/robots.txt
Redirect Domain www.nationalgeographicla.com
Redirect Base nationalgeographicla.com
Domain IPs 35.173.116.2, 54.166.186.169
Redirect IPs 23.215.7.19, 23.215.7.20
Response IP 23.215.7.19
Found Yes
Hash b72dd5ebb45369791479b6e630ff58fcda07a099a5d24eeda3ce90c8d505e0fd
SimHash 495c8252db13

Groups

*

Rule Path
Disallow /*%7B*
Disallow /api-merlin*
Disallow /admin*
Disallow /node*
Disallow /files*
Disallow /*?*
Disallow /*.aspx*
Disallow /*.php*
Disallow /canary-test/*
Allow /*?page=*
Allow /*?cmp=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographicla.com/sitemap/sitemap.xml
sitemap https://www.nationalgeographicla.com/sitemap-video.xml
sitemap https://www.nationalgeographicla.com/google-news.xml