greghillassociates.com
robots.txt

Robots Exclusion Standard data for greghillassociates.com

Resource Scan

Scan Details

Site Domain greghillassociates.com
Base Domain greghillassociates.com
Scan Status Ok
Last Scan2024-05-14T08:39:08+00:00
Next Scan 2024-06-13T08:39:08+00:00

Last Scan

Scanned2024-05-14T08:39:08+00:00
URL https://greghillassociates.com/robots.txt
Redirect https://www.greghillassociates.com/robots.txt
Redirect Domain www.greghillassociates.com
Redirect Base greghillassociates.com
Domain IPs 50.17.254.73
Redirect IPs 13.225.4.47, 13.225.4.55, 13.225.4.75, 13.225.4.91
Response IP 13.225.4.55
Found Yes
Hash c14a199f4897771c0907da3510972a6eb8bb5be8b554729f7134a269ea934bd2
SimHash 291c92738f93

Groups

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/*
Disallow /captcha/*
Allow /

Other Records

Field Value
sitemap https://www.greghillassociates.com/geositemap.xml
sitemap https://www.greghillassociates.com/sitemap.xml