nationalgeographicla.com
robots.txt

Robots Exclusion Standard data for nationalgeographicla.com

Resource Scan

Scan Details

Site Domain nationalgeographicla.com
Base Domain nationalgeographicla.com
Scan Status Ok
Last Scan2024-06-02T22:43:41+00:00
Next Scan 2024-06-09T22:43:41+00:00

Last Scan

Scanned2024-06-02T22:43:41+00:00
URL https://nationalgeographicla.com/robots.txt
Redirect https://www.nationalgeographicla.com/robots.txt
Redirect Domain www.nationalgeographicla.com
Redirect Base nationalgeographicla.com
Domain IPs 3.209.166.169, 34.232.191.251
Redirect IPs 104.69.174.172
Response IP 104.69.174.172
Found Yes
Hash a875278c59c249ace7bcf7b9ce4a2dbe23468b837fed28497af34d3d21ad8a84
SimHash 695c8252db33

Groups

*

Rule Path
Disallow /*%7B*
Disallow /api-merlin*
Disallow /admin*
Disallow /node*
Disallow /files*
Disallow /*?*
Disallow /*.aspx*
Disallow /*.php*
Allow /*?page=*
Allow /*?cmp=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographicla.com/sitemap/sitemap.xml
sitemap https://www.nationalgeographicla.com/sitemap-video.xml
sitemap https://www.nationalgeographicla.com/google-news.xml