selfh.st
robots.txt
Robots Exclusion Standard data for selfh.st
Resource Scan
Scan Details
Site Domain | selfh.st |
Base Domain | selfh.st |
Scan Status | Ok |
Last Scan | 2025-08-26T18:51:05+00:00 |
Next Scan | 2025-09-25T18:51:05+00:00 |
Last Scan
Scanned | 2025-08-26T18:51:05+00:00 |
URL | https://selfh.st/robots.txt |
Domain IPs | 104.21.92.88, 172.67.190.220, 2606:4700:3033::6815:5c58, 2606:4700:3033::ac43:bedc |
Response IP | 172.67.190.220 |
Found | Yes |
Hash | 701bc2fae375b5a2b56b1591b456b3d20931866b7d536be6221308d915d50ed1 |
SimHash | e0144515ff13 |
Groups
*
Rule | Path |
---|---|
Disallow | /ghost/ |
Disallow | /email/ |
Disallow | /members/api/comments/counts/ |
Disallow | /r/ |
Disallow | /webmentions/receive/ |
Other Records
Field | Value |
---|---|
sitemap | https://selfh.st/sitemap.xml |