plurk.com
robots.txt
Robots Exclusion Standard data for plurk.com
Resource Scan
Scan Details
Site Domain | plurk.com |
Base Domain | plurk.com |
Scan Status | Ok |
Last Scan | 2025-03-10T13:25:50+00:00 |
Next Scan | 2025-03-17T13:25:50+00:00 |
Last Scan
Scanned | 2025-03-10T13:25:50+00:00 |
URL | https://plurk.com/robots.txt |
Redirect | https://www.plurk.com/robots.txt |
Redirect Domain | www.plurk.com |
Redirect Base | plurk.com |
Domain IPs | 104.17.79.77, 104.18.65.15, 2606:4700::6811:4f4d, 2606:4700::6812:410f |
Redirect IPs | 104.17.79.77, 104.18.65.15, 2606:4700::6811:4f4d, 2606:4700::6812:410f |
Response IP | 104.17.79.77 |
Found | Yes |
Hash | b728d4c6dbc99fcf3d06d8a902c8706f48eadf94d4a8ca23f9ad8f70d075bf73 |
SimHash | ed15fa16cbd7 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /Friends/inviteFriends/ |
Disallow | /Notifications/ |
Disallow | /Settings/ |
Disallow | /Cliques/ |
Disallow | /Affiliate/ |
Disallow | /Admin/ |
Disallow | /redeemByURL |
Disallow | /redeemInvite |
Disallow | /Users/ |
Disallow | /API/ |
Disallow | /IM/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.plurk.com/sitemap.xml |