weargustin.com
robots.txt

Robots Exclusion Standard data for weargustin.com

Resource Scan

Scan Details

Site Domain weargustin.com
Base Domain weargustin.com
Scan Status Ok
Last Scan2024-05-24T20:31:21+00:00
Next Scan 2024-06-23T20:31:21+00:00

Last Scan

Scanned2024-05-24T20:31:21+00:00
URL https://weargustin.com/robots.txt
Domain IPs 23.22.5.68, 3.226.182.14, 52.21.227.162, 54.237.159.171
Response IP 3.226.182.14
Found Yes
Hash d629b1a00df9ff010be279e5c66a8035ddcf28fd9910bd54ec4d6bed0a59c376
SimHash 08591052cdd3

Groups

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /community/
Disallow /orders
Disallow /account
Disallow /welcome-back

Other Records

Field Value
sitemap https://www.weargustin.com/sitemap.xml