patrickwthomas.net
robots.txt
Robots Exclusion Standard data for patrickwthomas.net
Resource Scan
Scan Details
Site Domain | patrickwthomas.net |
Base Domain | patrickwthomas.net |
Scan Status | Ok |
Last Scan | 2025-10-03T14:29:29+00:00 |
Next Scan | 2025-11-02T14:29:29+00:00 |
Last Scan
Scanned | 2025-10-03T14:29:29+00:00 |
URL | https://patrickwthomas.net/robots.txt |
Domain IPs | 104.21.34.43, 172.67.197.156, 2606:4700:3037::6815:222b, 2606:4700:3037::ac43:c59c |
Response IP | 104.21.34.43 |
Found | Yes |
Hash | 5cfadca84e0843a70a126c6425c2df588d5dc75a4c600b1d72d76b696a256db2 |
SimHash | e0044594ff13 |
Groups
*
Rule | Path |
---|---|
Disallow | /ghost/ |
Disallow | /email/ |
Disallow | /members/api/comments/counts/ |
Disallow | /r/ |
Disallow | /webmentions/receive/ |
Other Records
Field | Value |
---|---|
sitemap | https://patrickwthomas.net/sitemap.xml |