nationalgeographic.cz
robots.txt

Robots Exclusion Standard data for nationalgeographic.cz

Resource Scan

Scan Details

Site Domain nationalgeographic.cz
Base Domain nationalgeographic.cz
Scan Status Ok
Last Scan2024-11-08T14:37:31+00:00
Next Scan 2024-11-15T14:37:31+00:00

Last Scan

Scanned2024-11-08T14:37:31+00:00
URL https://nationalgeographic.cz/robots.txt
Redirect https://www.nationalgeographic.cz/robots.txt
Redirect Domain www.nationalgeographic.cz
Redirect Base nationalgeographic.cz
Domain IPs 104.21.71.38, 172.67.143.20, 2606:4700:3030::6815:4726, 2606:4700:3030::ac43:8f14
Redirect IPs 104.21.71.38, 172.67.143.20, 2606:4700:3030::6815:4726, 2606:4700:3030::ac43:8f14
Response IP 104.21.71.38
Found Yes
Hash 2c960ed8ae07aaf9119e8dc113dd160277d8977c9334030b7eeb51a27037aa5b
SimHash 80155460e303

Groups

ia_archiver

Rule Path
Disallow /

grapeshot

Rule Path
Disallow

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nationalgeographic.cz/sitemap.xml