atmoskop.cz
robots.txt

Robots Exclusion Standard data for atmoskop.cz

Resource Scan

Scan Details

Site Domain atmoskop.cz
Base Domain atmoskop.cz
Scan Status Ok
Last Scan2024-05-14T04:54:31+00:00
Next Scan 2024-05-28T04:54:31+00:00

Last Scan

Scanned2024-05-14T04:54:31+00:00
URL https://atmoskop.cz/robots.txt
Redirect https://www.atmoskop.cz/robots.txt
Redirect Domain www.atmoskop.cz
Redirect Base atmoskop.cz
Domain IPs 13.35.93.17, 13.35.93.41, 13.35.93.49, 13.35.93.69
Redirect IPs 18.154.206.101, 18.154.206.41, 18.154.206.62, 18.154.206.66
Response IP 18.165.171.18
Found Yes
Hash b232ad5cc2daba40da39867b72f050a9233778f249bab65e33f8fd9ab603578a
SimHash 7a5cd844a933

Groups

*

Rule Path
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.atmoskop.cz/sitemap_index.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • https://platform.openai.com/docs/gptbot
  • https://platform.openai.com/docs/plugins/bot
  • https://commoncrawl.org/faq
  • https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers