johnherdman.net
robots.txt

Robots Exclusion Standard data for johnherdman.net

Resource Scan

Scan Details

Site Domain johnherdman.net
Base Domain johnherdman.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-30T19:21:36+00:00
Next Scan 2025-12-29T19:21:36+00:00

Last Successful Scan

Scanned2022-04-05T18:05:50+00:00
URL https://johnherdman.net/robots.txt
Redirect https://blog.johnherdman.net/robots.txt
Redirect Domain blog.johnherdman.net
Redirect Base johnherdman.net
Response IP 142.251.12.121
Found Yes
Hash 40a6a7b57b9b8c56cfbb9f7b37dacfdbaa26106af43168185e3664a533924863
SimHash 090492704613

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap https://blog.johnherdman.net/sitemap.xml