super.cz
robots.txt

Robots Exclusion Standard data for super.cz

Resource Scan

Scan Details

Site Domain super.cz
Base Domain super.cz
Scan Status Ok
Last Scan2024-05-23T03:04:11+00:00
Next Scan 2024-05-30T03:04:11+00:00

Last Scan

Scanned2024-05-23T03:04:11+00:00
URL https://super.cz/robots.txt
Redirect https://www.super.cz/robots.txt
Redirect Domain www.super.cz
Redirect Base super.cz
Domain IPs 2a02:598:a::78:140, 2a02:598:a::78:144, 77.75.78.140, 77.75.78.144
Redirect IPs 2a02:598:a::78:140, 2a02:598:a::78:144, 77.75.78.140, 77.75.78.144
Response IP 77.75.78.144
Found Yes
Hash 8ea14a444c8149360cce5a9f7dcaefc2ac3da02543333960688f5d86f1695646
SimHash 49000d37000b

Groups

*

Rule Path
Disallow /clanek/*timeline--pageItem%3D
Disallow /*mol-gallery--expanded%3D
Disallow /*mol-gallery--selected%3D
Disallow /*fts--order%3Dalphabetical-desc
Disallow /*fts--order%3Daccept-time-desc
Disallow /*fts--order%3Daccept-time-asc
Disallow /*fts--order%3Dranking-asc
Disallow /*fts--order%3Dranking-desc
Disallow /*fts--order%3Drandom
Disallow /*ribbon--menu%3D
Disallow /*ribbon--search%3D
Disallow /diskuse

seznamsocialbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.super.cz/sitemaps/sitemap_articles.xml
sitemap https://www.super.cz/sitemaps/sitemap_news.xml
sitemap https://www.super.cz/sitemaps/sitemap_sections.xml
sitemap https://www.super.cz/sitemaps/sitemap_tags.xml

Comments

  • dont crawl pagination on article pages
  • dont crawl the same page with opened gallery
  • from gallery in newsfeed
  • category of photo contests
  • legacy parameters
  • historical parameters
  • social profile preview tab