hedgecraftherbals.ca
robots.txt

Robots Exclusion Standard data for hedgecraftherbals.ca

Resource Scan

Scan Details

Site Domain hedgecraftherbals.ca
Base Domain hedgecraftherbals.ca
Scan Status Ok
Last Scan2024-05-14T15:29:45+00:00
Next Scan 2024-05-28T15:29:45+00:00

Last Scan

Scanned2024-05-14T15:29:45+00:00
URL https://hedgecraftherbals.ca/robots.txt
Redirect https://www.hedgecraftherbals.ca/robots.txt
Redirect Domain www.hedgecraftherbals.ca
Redirect Base hedgecraftherbals.ca
Domain IPs 199.34.228.59
Redirect IPs 199.34.228.59
Response IP 199.34.228.59
Found Yes
Hash 4bf4e0f3b4e4a37b5e9cfabcd68ae4ddba8fa0ac8b5f635cc840a53978705024
SimHash 2354dc762793

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/

Other Records

Field Value
sitemap https://www.hedgecraftherbals.ca/sitemap.xml