gettheguidelight.io
robots.txt

Robots Exclusion Standard data for gettheguidelight.io

Resource Scan

Scan Details

Site Domain gettheguidelight.io
Base Domain gettheguidelight.io
Scan Status Ok
Last Scan2025-12-12T20:55:12+00:00
Next Scan 2026-01-11T20:55:12+00:00

Last Scan

Scanned2025-12-12T20:55:12+00:00
URL https://gettheguidelight.io/robots.txt
Domain IPs 151.101.131.220, 151.101.195.220, 151.101.3.220, 151.101.67.220, 2a04:4e42:200::988, 2a04:4e42:400::988, 2a04:4e42:600::988, 2a04:4e42::988
Response IP 151.101.3.220
Found Yes
Hash c7f112f6d95711d256bc1487960920a241758e8ac5c8a60524f1cbc975d228f2
SimHash 6940d8f2cab1

Groups

*

Rule Path
Allow /
Disallow /checkout
Disallow /thank-you

Other Records

Field Value
crawl-delay 20