newsweek.pl
robots.txt

Robots Exclusion Standard data for newsweek.pl

Resource Scan

Scan Details

Site Domain newsweek.pl
Base Domain newsweek.pl
Scan Status Ok
Last Scan2024-06-07T03:25:11+00:00
Next Scan 2024-06-14T03:25:11+00:00

Last Scan

Scanned2024-06-07T03:25:11+00:00
URL https://newsweek.pl/robots.txt
Redirect https://www.newsweek.pl/robots.txt
Redirect Domain www.newsweek.pl
Redirect Base newsweek.pl
Domain IPs 178.239.128.26, 195.93.178.26
Redirect IPs 13.33.30.51, 13.33.30.78, 13.33.30.79, 13.33.30.85
Response IP 13.33.30.85
Found Yes
Hash 9a6f81dc7be188b635b2dcc51fc9cfc8bee3c068e49a42734243f1e76429486c
SimHash 2e60c80087f0

Groups

*

Rule Path
Disallow /rss_google_play.xml$
Disallow /kupony-rabatowe/przejdz-do-kuponow/*
Disallow /kupony-rabatowe/search?
Disallow /szukaj?q=*
Disallow *?src=*
Disallow /subskrypcja?*
Disallow *?fbclid=*
Disallow *?fb_comment=*
Disallow */sync/getUserData
Disallow */utils/config/getConfiguration
Disallow /paywall/*
Disallow /user-files*
Disallow /getNewestEditions
Disallow /kupony-rabatowe/tracking/set