nationalgeographic.de
robots.txt
Robots Exclusion Standard data for nationalgeographic.de
Resource Scan
Scan Details
Site Domain | nationalgeographic.de |
Base Domain | nationalgeographic.de |
Scan Status | Ok |
Last Scan | 2024-05-28T19:13:50+00:00 |
Next Scan | 2024-06-04T19:13:50+00:00 |
Last Scan
Scanned | 2024-05-28T19:13:50+00:00 |
URL | https://www.nationalgeographic.de/robots.txt |
Domain IPs | 118.215.83.81 |
Response IP | 104.69.174.172 |
Found | Yes |
Hash | 35ee0dd6c749ff838df4c240a3f30232bf42ecd3d0c737e153f877d4630c8666 |
SimHash | 690c9250df23 |
Groups
*
Rule | Path |
---|---|
Disallow | /*%7B* |
Disallow | /api-merlin* |
Disallow | /admin* |
Disallow | /node* |
Disallow | /files* |
Disallow | /*?* |
Disallow | /*.aspx* |
Disallow | /*.php* |
Allow | /*?page=* |
Allow | /*?cmp=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.nationalgeographic.de/sitemap/sitemap.xml |
sitemap | https://www.nationalgeographic.de/sitemap-video.xml |
sitemap | https://www.nationalgeographic.de/google-news.xml |