johnherdman.net
robots.txt
Robots Exclusion Standard data for johnherdman.net
Resource Scan
Scan Details
| Site Domain | johnherdman.net |
| Base Domain | johnherdman.net |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Couldn't connect to server. |
| Last Scan | 2025-09-30T19:21:36+00:00 |
| Next Scan | 2025-12-29T19:21:36+00:00 |
Last Successful Scan
| Scanned | 2022-04-05T18:05:50+00:00 |
| URL | https://johnherdman.net/robots.txt |
| Redirect | https://blog.johnherdman.net/robots.txt |
| Redirect Domain | blog.johnherdman.net |
| Redirect Base | johnherdman.net |
| Response IP | 142.251.12.121 |
| Found | Yes |
| Hash | 40a6a7b57b9b8c56cfbb9f7b37dacfdbaa26106af43168185e3664a533924863 |
| SimHash | 090492704613 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://blog.johnherdman.net/sitemap.xml |