newsweek.pl
robots.txt
Robots Exclusion Standard data for newsweek.pl
Resource Scan
Scan Details
Site Domain | newsweek.pl |
Base Domain | newsweek.pl |
Scan Status | Ok |
Last Scan | 2024-06-07T03:25:11+00:00 |
Next Scan | 2024-06-14T03:25:11+00:00 |
Last Scan
Scanned | 2024-06-07T03:25:11+00:00 |
URL | https://newsweek.pl/robots.txt |
Redirect | https://www.newsweek.pl/robots.txt |
Redirect Domain | www.newsweek.pl |
Redirect Base | newsweek.pl |
Domain IPs | 178.239.128.26, 195.93.178.26 |
Redirect IPs | 13.33.30.51, 13.33.30.78, 13.33.30.79, 13.33.30.85 |
Response IP | 13.33.30.85 |
Found | Yes |
Hash | 9a6f81dc7be188b635b2dcc51fc9cfc8bee3c068e49a42734243f1e76429486c |
SimHash | 2e60c80087f0 |
Groups
*
Rule | Path |
---|---|
Disallow | /rss_google_play.xml$ |
Disallow | /kupony-rabatowe/przejdz-do-kuponow/* |
Disallow | /kupony-rabatowe/search? |
Disallow | /szukaj?q=* |
Disallow | *?src=* |
Disallow | /subskrypcja?* |
Disallow | *?fbclid=* |
Disallow | *?fb_comment=* |
Disallow | */sync/getUserData |
Disallow | */utils/config/getConfiguration |
Disallow | /paywall/* |
Disallow | /user-files* |
Disallow | /getNewestEditions |
Disallow | /kupony-rabatowe/tracking/set |