nationalgeographic.de
robots.txt

Robots Exclusion Standard data for nationalgeographic.de

Resource Scan

Scan Details

Site Domain nationalgeographic.de
Base Domain nationalgeographic.de
Scan Status Ok
Last Scan2024-10-30T17:22:14+00:00
Next Scan 2024-11-06T17:22:14+00:00

Last Scan

Scanned2024-10-30T17:22:14+00:00
URL https://www.nationalgeographic.de/robots.txt
Domain IPs 23.215.7.19, 23.215.7.20
Response IP 23.215.7.19
Found Yes
Hash 9c58811c0e2a29cb6aafe56745b0cb2caace43d8a4b8849ebd4ebe10b0410334
SimHash 690c9250df03

Groups

*

Rule Path
Disallow /*%7B*
Disallow /api-merlin*
Disallow /admin*
Disallow /node*
Disallow /files*
Disallow /*?*
Disallow /*.aspx*
Disallow /*.php*
Disallow /canary-test/*
Allow /*?page=*
Allow /*?cmp=*

upday

Rule Path
Allow /rss_latest_contents?ptt=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographic.de/sitemap/sitemap.xml
sitemap https://www.nationalgeographic.de/sitemap-video.xml
sitemap https://www.nationalgeographic.de/google-news.xml