nationalgeographic.de
robots.txt

Robots Exclusion Standard data for nationalgeographic.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	nationalgeographic.de
Base Domain	nationalgeographic.de
Scan Status	Ok
Last Scan	2024-10-30T17:22:14+00:00
Next Scan	2024-11-06T17:22:14+00:00

Last Scan

Scanned	2024-10-30T17:22:14+00:00
URL	https://www.nationalgeographic.de/robots.txt
Domain IPs	23.215.7.19, 23.215.7.20
Response IP	23.215.7.19
Found	Yes
Hash	9c58811c0e2a29cb6aafe56745b0cb2caace43d8a4b8849ebd4ebe10b0410334
SimHash	690c9250df03

Groups

*

Rule	Path
Disallow	/%7B
Disallow	/api-merlin*
Disallow	/admin*
Disallow	/node*
Disallow	/files*
Disallow	/?
Disallow	/.aspx
Disallow	/.php
Disallow	/canary-test/*
Allow	/?page=
Allow	/?cmp=

Rule

Path

Disallow

/*%7B*

Disallow

/api-merlin*

Disallow

/admin*

Disallow

/node*

Disallow

/files*

Disallow

/*?*

Disallow

/*.aspx*

Disallow

/*.php*

Disallow

/canary-test/*

Allow

/*?page=*

Allow

/*?cmp=*

upday

Rule	Path
Allow	/rss_latest_contents?ptt=*

Rule

Path

Allow

/rss_latest_contents?ptt=*

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.nationalgeographic.de/sitemap/sitemap.xml
sitemap	https://www.nationalgeographic.de/sitemap-video.xml
sitemap	https://www.nationalgeographic.de/google-news.xml

Field

Value

sitemap

https://www.nationalgeographic.de/sitemap/sitemap.xml

sitemap

https://www.nationalgeographic.de/sitemap-video.xml

sitemap

https://www.nationalgeographic.de/google-news.xml

Back to top

nationalgeographic.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

upday

gptbot

google-extended

Other Records

nationalgeographic.de
robots.txt