sports.pl
robots.txt
Robots Exclusion Standard data for sports.pl
Resource Scan
Scan Details
Site Domain | sports.pl |
Base Domain | sports.pl |
Scan Status | Ok |
Last Scan | 2024-11-08T12:02:09+00:00 |
Next Scan | 2024-11-15T12:02:09+00:00 |
Last Scan
Scanned | 2024-11-08T12:02:09+00:00 |
URL | http://sports.pl/robots.txt |
Redirect | https://przegladsportowy.onet.pl/robots.txt |
Redirect Domain | przegladsportowy.onet.pl |
Redirect Base | onet.pl |
Domain IPs | 13.248.130.170, 76.223.2.215 |
Redirect IPs | 108.157.254.119, 108.157.254.49, 108.157.254.5, 108.157.254.58 |
Response IP | 108.157.254.49 |
Found | Yes |
Hash | f98472a5cefce51e02ded32886b374c02bc7967e6c6f8075a48e9a689104058e |
SimHash | 6141ea5a48d5 |
Groups
*
Rule | Path |
---|---|
Disallow | /*.pdf$ |
Disallow | /*.wmv$ |
Disallow | /*.flv$ |
Disallow | /*.mpg$ |
Disallow | /*.avi$ |
Disallow | /*.inl$ |
Disallow | /*.doc$ |
Disallow | /*.xls$ |
Disallow | /Drukuj/ |
Disallow | /fb_comment_id%3D |
Disallow | /paywall/* |
Disallow | /user-session-proxy/* |
Disallow | /njYjD8BNiL/* |
Disallow | /__acc/ |
Disallow | /_cdf/ |
Disallow | /_variant/ |
Disallow | /a8f4d8cd95e164917035b64b867a45dd |