gloucesterfhc.com
robots.txt

Robots Exclusion Standard data for gloucesterfhc.com

Resource Scan

Scan Details

Site Domain gloucesterfhc.com
Base Domain gloucesterfhc.com
Scan Status Ok
Last Scan2025-10-28T10:14:56+00:00
Next Scan 2025-11-04T10:14:56+00:00

Last Scan

Scanned2025-10-28T10:14:56+00:00
URL https://gloucesterfhc.com/robots.txt
Redirect https://www.gloucesterfhc.com/robots.txt
Redirect Domain www.gloucesterfhc.com
Redirect Base gloucesterfhc.com
Domain IPs 199.34.228.46
Redirect IPs 199.34.228.46
Response IP 199.34.228.46
Found Yes
Hash e4db14c52c370f4aaa6950982504b55edcbed3e0336d1b382ebf820a4d8a8508
SimHash 6954dc746793

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/

Other Records

Field Value
sitemap https://www.gloucesterfhc.com/sitemap.xml