gregherman.net
robots.txt

Robots Exclusion Standard data for gregherman.net

Resource Scan

Scan Details

Site Domain gregherman.net
Base Domain gregherman.net
Scan Status Ok
Last Scan2026-03-21T15:03:59+00:00
Next Scan 2026-03-28T15:03:59+00:00

Last Scan

Scanned2026-03-21T15:03:59+00:00
URL http://www.gregherman.net/robots.txt
Domain IPs 172.253.118.121, 2404:6800:4003:c03::79
Response IP 172.217.194.121
Found Yes
Hash 9433297c50d043aa89653997046f09ed00088e263a4c0ea1c1a8357b6f4bfe48
SimHash 0d049050cf53

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap http://www.gregherman.net/sitemap.xml