topnewsy.pl
robots.txt
Robots Exclusion Standard data for topnewsy.pl
Resource Scan
Scan Details
Site Domain | topnewsy.pl |
Base Domain | topnewsy.pl |
Scan Status | Ok |
Last Scan | 2024-09-23T04:24:21+00:00 |
Next Scan | 2024-09-30T04:24:21+00:00 |
Last Scan
Scanned | 2024-09-23T04:24:21+00:00 |
URL | https://topnewsy.pl/robots.txt |
Domain IPs | 31.220.144.67, 31.220.144.69 |
Response IP | 31.220.144.67 |
Found | Yes |
Hash | 08886066532a4ab029ce2899a7b9f58b40061857db1e1f95018a6aa4385f70d6 |
SimHash | 8830d820a373 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /search/ |
Disallow | /more* |
Disallow | /*-json/$ |
Disallow | /*?ticket=* |
Disallow | /next-posts/ |
Disallow | /fb-get* |
Disallow | /post-stats-add/ |
Disallow | /ad-mdata/ |
Disallow | /ifslot/ |
Disallow | /search-stats-add/ |
Disallow | /share-* |