www-cdn.natgeofe.com
robots.txt

Robots Exclusion Standard data for www-cdn.natgeofe.com

Resource Scan

Scan Details

Site Domain www-cdn.natgeofe.com
Base Domain natgeofe.com
Scan Status Ok
Last Scan2024-11-08T14:25:27+00:00
Next Scan 2024-11-15T14:25:27+00:00

Last Scan

Scanned2024-11-08T14:25:27+00:00
URL https://www-cdn.natgeofe.com/robots.txt
Redirect https://www.nationalgeographic.com/robots.txt
Redirect Domain www.nationalgeographic.com
Redirect Base nationalgeographic.com
Domain IPs 13.33.28.108, 13.33.28.21, 13.33.28.30, 13.33.28.4
Redirect IPs 13.33.28.108, 13.33.28.21, 13.33.28.30, 13.33.28.4
Response IP 13.33.28.30
Found Yes
Hash fd0127c2f2a8a002f63e597e43836e82e2682ca1bab3bdc30f279924d357d6d1
SimHash 30182945c872

Groups

*

Rule Path
Disallow /search*
Disallow /tv/watch-live/
Disallow /tv/browse
Disallow /tv/movies-and-specials/
Disallow /tv/shows/
Disallow /tv/my-profile
Disallow /api/federation/
Disallow /cgi-bin/
Disallow /*.swf$
Disallow /*eid%3D
Disallow /*email%3D
Disallow /*intcmp%3D
Disallow /*ngc%3D
Disallow /*referrer%3D
Disallow /*widgets%3D
Disallow /admin*
Disallow /ads/
Disallow /au/
Disallow /cp/
Disallow /ebooklets/
Disallow /event.ng/
Disallow /image/
Disallow /in/
Disallow /magazines/l/multisubs/*
Disallow /magazines/all-banners-2*
Disallow /magazines/all-banners*
Disallow /magazines/all-dl*
Disallow /magazines/all-flyout*
Disallow /magazines/all-Ip*
Disallow /magazines/all-lp*
Disallow /magazines/all-sem*
Disallow /magazines/all-subs*
Disallow /magazines/Ip*
Disallow /magazines/lp*
Disallow /magazines/natgeomagazines-all*
Disallow /magazines/natgeomagazines*
Disallow /magazines/print-sem*
Disallow /media/interactive/cdrom/onehundredandeightyears/
Disallow /media/tv/channel/credits/
Disallow /pc/
Disallow /resources/ngo/maps/atlas/
Disallow /topics/

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

googleother-image

Rule Path
Disallow /

googleother-video

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographic.com/sitemaps/sitemap.xml
sitemap https://nationalgeographic.com/tv/sitemapindex-episodes.xml
sitemap https://nationalgeographic.com/tv/sitemapindex-showmap.xml
sitemap https://nationalgeographic.com/tv/sitemapindex-videomap.xml
sitemap https://nationalgeographic.com/expeditions/sitemap.xml

Comments

  • Directory Excludes
  • Legacy Excludes
  • Disallow Directives for AI Web Crawlers
  • Announce Sitemap Locations