rp.pl
robots.txt

Robots Exclusion Standard data for rp.pl

Resource Scan

Scan Details

Site Domain rp.pl
Base Domain rp.pl
Scan Status Ok
Last Scan2024-05-20T11:40:38+00:00
Next Scan 2024-05-27T11:40:38+00:00

Last Scan

Scanned2024-05-20T11:40:38+00:00
URL https://rp.pl/robots.txt
Redirect https://www.rp.pl/robots.txt
Redirect Domain www.rp.pl
Redirect Base rp.pl
Domain IPs 104.22.68.85, 104.22.69.85, 172.67.6.239, 2606:4700:10::6816:4455, 2606:4700:10::6816:4555, 2606:4700:10::ac43:6ef
Redirect IPs 104.22.68.85, 104.22.69.85, 172.67.6.239, 2606:4700:10::6816:4455, 2606:4700:10::6816:4555, 2606:4700:10::ac43:6ef
Response IP 104.22.68.85
Found Yes
Hash 8f98f05b1a49671ec9d1767fdce63b657fffcf03f9f16ca6aea2029ee5943826
SimHash 2f548258d782

Groups

*

Rule Path
Allow /
Disallow /*template%3Dprintart*
Disallow /*template%3Dartzen*
Disallow /*template%3Dloadcomments*
Disallow /*template%3Dartstatus*
Disallow /*template%3Dtestontheart*
Disallow /*template%3Dslider*
Disallow /szukaj/*
Disallow /section/advanced-search*
Disallow /GBC*
Disallow /*template%3Dinfinityscroll*
Disallow /*template%3DgetParagraphToLiveMamut*
Disallow /*template%3DgetParagraph*
Disallow /*name%3D*.php5*
Disallow /*name%3D*.js*
Disallow /*/JavaScript%3A*
Disallow /content/preview/*
Disallow /brak_autora
Disallow /*lpurl%3D*
Disallow /*lopenr%3D*
Disallow /cdn-cgi/*
Disallow /ht_biznes/*
Disallow /404
Disallow /przemysl-spozywczy/art39005721-*

Other Records

Field Value
sitemap https://www.rp.pl/sitemaps/sitemap.xml
sitemap https://www.rp.pl/sitemaps/news-sitemap.xml

Comments

  • Dead requests found in Google Search Console
  • We don't want to index our scripts
  • We don't want to index preview pages
  • We don't want to index pages without meaningfull content
  • We don't want to index advert preview pages
  • We don't want to index technical url's