natgeo.com
robots.txt

Robots Exclusion Standard data for natgeo.com

Resource Scan

Scan Details

Site Domain natgeo.com
Base Domain natgeo.com
Scan Status Ok
Last Scan2024-04-30T20:29:14+00:00
Next Scan 2024-05-07T20:29:14+00:00

Last Scan

Scanned2024-04-30T20:29:14+00:00
URL https://natgeo.com/robots.txt
Redirect https://www.nationalgeographic.com/robots.txt
Redirect Domain www.nationalgeographic.com
Redirect Base nationalgeographic.com
Domain IPs 75.2.26.191, 99.83.251.4
Redirect IPs 13.33.30.113, 13.33.30.120, 13.33.30.125, 13.33.30.45
Response IP 13.33.30.120
Found Yes
Hash 4a11169e86022462def11b145441cf398bf0d4b37f0dd3a240dcf527b7cec4f3
SimHash 70183944c83a

Groups

*

Rule Path
Disallow /search*
Disallow /tv/watch-live/
Disallow /tv/browse
Disallow /tv/movies-and-specials/
Disallow /tv/shows/
Disallow /tv/my-profile
Disallow /cgi-bin/
Disallow /*.swf$
Disallow /*eid%3D
Disallow /*email%3D
Disallow /*intcmp%3D
Disallow /*ngc%3D
Disallow /*referrer%3D
Disallow /*widgets%3D
Disallow /admin*
Disallow /ads/
Disallow /au/
Disallow /cp/
Disallow /ebooklets/
Disallow /event.ng/
Disallow /image/
Disallow /in/
Disallow /magazines/l/multisubs/*
Disallow /magazines/all-banners-2*
Disallow /magazines/all-banners*
Disallow /magazines/all-dl*
Disallow /magazines/all-flyout*
Disallow /magazines/all-Ip*
Disallow /magazines/all-lp*
Disallow /magazines/all-sem*
Disallow /magazines/all-subs*
Disallow /magazines/Ip*
Disallow /magazines/lp*
Disallow /magazines/natgeomagazines-all*
Disallow /magazines/natgeomagazines*
Disallow /magazines/print-sem*
Disallow /media/interactive/cdrom/onehundredandeightyears/
Disallow /media/tv/channel/credits/
Disallow /pc/
Disallow /resources/ngo/maps/atlas/
Disallow /topics/

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographic.com/sitemaps/sitemap.xml
sitemap https://nationalgeographic.com/tv/sitemapindex-episodes.xml
sitemap https://nationalgeographic.com/tv/sitemapindex-showmap.xml
sitemap https://nationalgeographic.com/tv/sitemapindex-videomap.xml

Comments

  • Directory Excludes
  • Legacy Excludes
  • Disallow Directives for AI Web Crawlers
  • Announce Sitemap Locations