nationalgeographic.de
robots.txt

Robots Exclusion Standard data for nationalgeographic.de

Resource Scan

Scan Details

Site Domain nationalgeographic.de
Base Domain nationalgeographic.de
Scan Status Ok
Last Scan2024-05-28T19:13:50+00:00
Next Scan 2024-06-04T19:13:50+00:00

Last Scan

Scanned2024-05-28T19:13:50+00:00
URL https://www.nationalgeographic.de/robots.txt
Domain IPs 118.215.83.81
Response IP 104.69.174.172
Found Yes
Hash 35ee0dd6c749ff838df4c240a3f30232bf42ecd3d0c737e153f877d4630c8666
SimHash 690c9250df23

Groups

*

Rule Path
Disallow /*%7B*
Disallow /api-merlin*
Disallow /admin*
Disallow /node*
Disallow /files*
Disallow /*?*
Disallow /*.aspx*
Disallow /*.php*
Allow /*?page=*
Allow /*?cmp=*

upday

Rule Path
Allow /rss_latest_contents?ptt=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographic.de/sitemap/sitemap.xml
sitemap https://www.nationalgeographic.de/sitemap-video.xml
sitemap https://www.nationalgeographic.de/google-news.xml