seznamzpravy.cz
robots.txt

Robots Exclusion Standard data for seznamzpravy.cz

Resource Scan

Scan Details

Site Domain seznamzpravy.cz
Base Domain seznamzpravy.cz
Scan Status Ok
Last Scan2024-11-16T09:17:58+00:00
Next Scan 2024-11-23T09:17:58+00:00

Last Scan

Scanned2024-11-16T09:17:58+00:00
URL https://seznamzpravy.cz/robots.txt
Redirect https://www.seznamzpravy.cz/robots.txt
Redirect Domain www.seznamzpravy.cz
Redirect Base seznamzpravy.cz
Domain IPs 185.66.189.31, 2a02:598:a::78:215, 2a02:598:c:189::31, 77.75.78.215
Redirect IPs 185.66.189.31, 2a02:598:a::78:215, 2a02:598:c:189::31, 77.75.78.215
Response IP 77.75.78.215
Found Yes
Hash 5f194c32c4cd61fb5d209c27ce3e81412a9418d8583496a40f3530a2ebae2c09
SimHash cd054917400b

Groups

*

Rule Path
Disallow /clanek/*timeline--pageItem%3D
Disallow /*mol-gallery--expanded%3D
Disallow /*mol-gallery--selected%3D
Disallow /*fts--order%3Dalphabetical-desc
Disallow /*fts--order%3Daccept-time-desc
Disallow /*fts--order%3Daccept-time-asc
Disallow /*fts--order%3Dranking-asc
Disallow /*fts--order%3Dranking-desc
Disallow /*fts--order%3Drandom
Disallow /fts--search
Disallow /kampus/p/stredni-vzdelavani/hledani
Disallow /*ribbon--menu%3D
Disallow /*ribbon--search%3D
Disallow /*menu--open
Disallow /*previewToken%3D
Disallow /*bankid%3D

seznamsocialbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.seznamzpravy.cz/sitemaps/sitemap_articles.xml
sitemap https://www.seznamzpravy.cz/sitemaps/sitemap_news.xml
sitemap https://www.seznamzpravy.cz/sitemaps/sitemap_sections.xml
sitemap https://www.seznamzpravy.cz/sitemaps/sitemap_tags.xml

Comments

  • dont crawl pagination on article pages
  • dont crawl the same page with opened gallery
  • from gallery in newsfeed
  • category of photo contests
  • Campus
  • legacy parameters
  • historical parameters
  • article preview
  • social profile preview tab