wiadomosci.gazeta.pl
robots.txt
Robots Exclusion Standard data for wiadomosci.gazeta.pl
Resource Scan
Scan Details
Site Domain | wiadomosci.gazeta.pl |
Base Domain | gazeta.pl |
Scan Status | Ok |
Last Scan | 2024-11-04T00:45:29+00:00 |
Next Scan | 2024-11-11T00:45:29+00:00 |
Last Scan
Scanned | 2024-11-04T00:45:29+00:00 |
URL | https://wiadomosci.gazeta.pl/robots.txt |
Domain IPs | 80.252.0.132 |
Response IP | 80.252.0.132 |
Found | Yes |
Hash | c9c04b4899319e8d4c8682e55341ffa8a80ab2b21b9eaa4193d304d4b9ca826a |
SimHash | 709f736347b6 |
Groups
*
Rule | Path |
---|---|
Disallow | /ot-amp-consent |
Disallow | /*/wyszukaj/ |
Disallow | /*servlet |
Disallow | /reloadwww? |
Disallow | /dfptools/adview/ |
Disallow | /pub/ips/* |
Disallow | /ods? |
Disallow | /getFile.servlet* |
Disallow | /aliasy/blad.jsp |
Disallow | /znajdz.do |
Disallow | /portalSearch.do |
Disallow | /im/ab/b4/10/z17515435Q.jpg |
Disallow | /75224259/ |
Disallow | /wiadomosci/1%2C53600%2C2342480.html |
Disallow | /wiadomosci/1%2C114873%2C4688178.html |
Disallow | /wiadomosci/1%2C114871%2C14710459%2CNiewidomy___Restauracja_nie_wpuscila_mnie_z_psem_przewodnikiem__.html |
Disallow | /wiadomosci/0%2C114911%2C8062111.html? |
Warnings
- 4 invalid lines.
Comments