plejada.pl
robots.txt

Robots Exclusion Standard data for plejada.pl

Resource Scan

Scan Details

Site Domain plejada.pl
Base Domain plejada.pl
Scan Status Ok
Last Scan2024-11-13T17:22:16+00:00
Next Scan 2024-11-20T17:22:16+00:00

Last Scan

Scanned2024-11-13T17:22:16+00:00
URL https://plejada.pl/robots.txt
Domain IPs 13.33.88.10, 13.33.88.120, 13.33.88.129, 13.33.88.47
Response IP 13.33.88.10
Found Yes
Hash eef02ebb25a6feab1206e13541752dee092997115b38e98b4d97361ff5df9103
SimHash 06d1c0240dd1

Groups

*

Rule Path
Disallow /szukaj/*
Disallow /*zglos-naruszenie.html
Disallow /*odpowiedz.html
Disallow /*komentuj.html
Disallow /*dodaj-watek.html
Disallow /*watek-odpowiedz.html
Disallow /*odpowiedz-cytuj.html
Disallow /*td-naruszenie-zasad.html
Disallow /paywall/*
Disallow /?ajax=1&page=*
Disallow /*?ajax=1&page=*
Disallow /*?ress=mobile&ajax=*
Disallow /widget-liveblog-story-results.html*
Disallow /user-session-proxy/*
Disallow /a8f4d8cd95e164917035b64b867a45*