wiadomosci.onet.pl
robots.txt

Robots Exclusion Standard data for wiadomosci.onet.pl

Resource Scan

Scan Details

Site Domain wiadomosci.onet.pl
Base Domain onet.pl
Scan Status Ok
Last Scan2024-05-06T12:22:17+00:00
Next Scan 2024-05-20T12:22:17+00:00

Last Scan

Scanned2024-05-06T12:22:17+00:00
URL https://wiadomosci.onet.pl/robots.txt
Domain IPs 108.156.133.20, 108.156.133.83, 108.156.133.95, 108.156.133.99
Response IP 108.156.133.83
Found Yes
Hash af9333a4ffa46616df6c9ddaae0c1f58ec67ce7b8e27b60713802aabb45da864
SimHash 4250450409d1

Groups

*

Rule Path
Disallow /szukaj/*
Disallow /*zglos-naruszenie.html
Disallow /*odpowiedz.html
Disallow /*komentuj.html
Disallow /*fb_comment_id%3D*
Disallow /*dodaj-watek.html
Disallow /*watek-odpowiedz.html
Disallow /*odpowiedz-cytuj.html
Disallow /*td-naruszenie-zasad.html
Disallow /paywall/*
Disallow *?ress=mobile&ajax=*
Disallow /widget-liveblog-story-results.html*
Disallow /user-session-proxy/*