parkiet.com
robots.txt

Robots Exclusion Standard data for parkiet.com

Resource Scan

Scan Details

Site Domain parkiet.com
Base Domain parkiet.com
Scan Status Ok
Last Scan2024-06-22T08:20:22+00:00
Next Scan 2024-06-29T08:20:22+00:00

Last Scan

Scanned2024-06-22T08:20:22+00:00
URL https://parkiet.com/robots.txt
Redirect https://www.parkiet.com/robots.txt
Redirect Domain www.parkiet.com
Redirect Base parkiet.com
Domain IPs 104.22.74.155, 104.22.75.155, 172.67.30.153, 2606:4700:10::6816:4a9b, 2606:4700:10::6816:4b9b, 2606:4700:10::ac43:1e99
Redirect IPs 104.22.74.155, 104.22.75.155, 172.67.30.153, 2606:4700:10::6816:4a9b, 2606:4700:10::6816:4b9b, 2606:4700:10::ac43:1e99
Response IP 104.22.75.155
Found Yes
Hash f3713bc7bce5b7d6a4da1aaca8278ff1dee2da838b75684eafb530e31c0b1a6a
SimHash 0f54c2019792

Groups

*

Rule Path
Allow /
Disallow /*template%3Dprintart*
Disallow /*template%3Dartzen*
Disallow /*template%3Dloadcomments*
Disallow /*template%3Dartstatus*
Disallow /*template%3Dtestontheart*
Disallow /*template%3Dslider*
Disallow /section/advanced-search*
Disallow /GBC*
Disallow /szukaj/*
Disallow /szukaj
Disallow /search/*
Disallow /content/preview/*
Disallow /layout/preview/*
Disallow /navigation-submenu
Disallow /healthz
Disallow /*template%3Dinfinityscroll*
Disallow /*template%3DgetParagraphToLiveMamut*
Disallow /*template%3DgetParagraph*
Disallow /*name%3D*.php5*
Disallow /*name%3D*.js*
Disallow /*/JavaScript%3A*
Disallow /brak_autora
Disallow /*lpurl%3D*
Disallow /*/apps/pbcsi.dll/bilde*
Disallow /*lopenr%3D*
Disallow /*/apps/pbcs.dll/article*
Disallow /*/apps/pbcs.dll/section*
Disallow /*/apps/pbcs.dll/exec*
Disallow /*/apps/pbcs.dll/error?404*
Disallow /*/apps/pbcsedit.dll*
Disallow /cdn-cgi/*
Disallow /temat/*
Disallow /ht_biznes/*
Disallow /404
Disallow /tagi/*
Disallow */null

Other Records

Field Value
sitemap https://www.parkiet.com/sitemaps/sitemap.xml
sitemap https://www.parkiet.com/sitemaps/news-sitemap.xml

Comments

  • Dead requests found in Google Search Console
  • We don't want to index our scripts
  • We don't want to index pages without meaningfull content
  • We don't want to index advert preview pages
  • We don't want to index images by they technical url - we want "pretty" one
  • We don't want to index technical url's
  • We don't want to index entities that don't exists anymore