guidancehub.org
robots.txt

Robots Exclusion Standard data for guidancehub.org

Resource Scan

Scan Details

Site Domain guidancehub.org
Base Domain guidancehub.org
Scan Status Ok
Last Scan2025-10-02T00:44:22+00:00
Next Scan 2025-11-01T00:44:22+00:00

Last Scan

Scanned2025-10-02T00:44:22+00:00
URL https://guidancehub.org/robots.txt
Redirect https://www.guidancehub.org/robots.txt
Redirect Domain www.guidancehub.org
Redirect Base guidancehub.org
Domain IPs 199.34.228.70
Redirect IPs 199.34.228.70
Response IP 199.34.228.70
Found Yes
Hash 86d0215b15473dcea3784199b79b463676f3958ef1ba658779c211f6021c4264
SimHash 6254dc84cfd3

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /islamic-studies-for-children-5-10-years.html
Disallow /http%3A//guidancehub.eventbrite.co.uk/
Disallow /islamic-education.html
Disallow /sports.html

Other Records

Field Value
sitemap https://www.guidancehub.org/sitemap.xml