wiadomosci.onet.pl
robots.txt
Robots Exclusion Standard data for wiadomosci.onet.pl
Resource Scan
Scan Details
Site Domain | wiadomosci.onet.pl |
Base Domain | onet.pl |
Scan Status | Ok |
Last Scan | 2024-05-06T12:22:17+00:00 |
Next Scan | 2024-05-20T12:22:17+00:00 |
Last Scan
Scanned | 2024-05-06T12:22:17+00:00 |
URL | https://wiadomosci.onet.pl/robots.txt |
Domain IPs | 108.156.133.20, 108.156.133.83, 108.156.133.95, 108.156.133.99 |
Response IP | 108.156.133.83 |
Found | Yes |
Hash | af9333a4ffa46616df6c9ddaae0c1f58ec67ce7b8e27b60713802aabb45da864 |
SimHash | 4250450409d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /szukaj/* |
Disallow | /*zglos-naruszenie.html |
Disallow | /*odpowiedz.html |
Disallow | /*komentuj.html |
Disallow | /*fb_comment_id%3D* |
Disallow | /*dodaj-watek.html |
Disallow | /*watek-odpowiedz.html |
Disallow | /*odpowiedz-cytuj.html |
Disallow | /*td-naruszenie-zasad.html |
Disallow | /paywall/* |
Disallow | *?ress=mobile&ajax=* |
Disallow | /widget-liveblog-story-results.html* |
Disallow | /user-session-proxy/* |