plejada.pl
robots.txt
Robots Exclusion Standard data for plejada.pl
Resource Scan
Scan Details
Site Domain | plejada.pl |
Base Domain | plejada.pl |
Scan Status | Ok |
Last Scan | 2024-11-13T17:22:16+00:00 |
Next Scan | 2024-11-20T17:22:16+00:00 |
Last Scan
Scanned | 2024-11-13T17:22:16+00:00 |
URL | https://plejada.pl/robots.txt |
Domain IPs | 13.33.88.10, 13.33.88.120, 13.33.88.129, 13.33.88.47 |
Response IP | 13.33.88.10 |
Found | Yes |
Hash | eef02ebb25a6feab1206e13541752dee092997115b38e98b4d97361ff5df9103 |
SimHash | 06d1c0240dd1 |
Groups
*
Rule | Path |
---|---|
Disallow | /szukaj/* |
Disallow | /*zglos-naruszenie.html |
Disallow | /*odpowiedz.html |
Disallow | /*komentuj.html |
Disallow | /*dodaj-watek.html |
Disallow | /*watek-odpowiedz.html |
Disallow | /*odpowiedz-cytuj.html |
Disallow | /*td-naruszenie-zasad.html |
Disallow | /paywall/* |
Disallow | /?ajax=1&page=* |
Disallow | /*?ajax=1&page=* |
Disallow | /*?ress=mobile&ajax=* |
Disallow | /widget-liveblog-story-results.html* |
Disallow | /user-session-proxy/* |
Disallow | /a8f4d8cd95e164917035b64b867a45* |