gazeta.pl
robots.txt
Robots Exclusion Standard data for gazeta.pl
Resource Scan
Scan Details
Site Domain | gazeta.pl |
Base Domain | gazeta.pl |
Scan Status | Ok |
Last Scan | 2024-11-16T16:27:53+00:00 |
Next Scan | 2024-11-23T16:27:53+00:00 |
Last Scan
Scanned | 2024-11-16T16:27:53+00:00 |
URL | https://gazeta.pl/robots.txt |
Domain IPs | 80.252.0.145 |
Response IP | 80.252.0.145 |
Found | Yes |
Hash | a82e96d16bc252da2772a31c0590c611678f37e3dd2714def4d44c384388b4d3 |
SimHash | 731f536b45a3 |
Groups
*
Rule | Path |
---|---|
Disallow | /*amtp_pnHP_gallery* |
Disallow | /*mtpromo* |
Disallow | /*/wyszukaj/ |
Disallow | /*servlet |
Disallow | /reloadwww? |
Disallow | /dfptools/adview/ |
Disallow | /pub/ips/* |
Disallow | /ods? |
Disallow | /getFile.servlet* |
Disallow | /aliasy/blad.jsp |
Disallow | /znajdz.do |
Disallow | /portalSearch.do |
Disallow | /im/ab/b4/10/z17515435Q.jpg |
Disallow | /75224259/ |
Disallow | /0%2C0.html?mtpromo |
Disallow | /0%2C0.html?foryou |
Disallow | /0%2C0.html?mtpromo* |
Disallow | /0%2C0.html?foryou* |
Disallow | /*_gl |
Warnings
- 4 invalid lines.
Comments