thewoodlandsmall.com
robots.txt
Robots Exclusion Standard data for thewoodlandsmall.com
Resource Scan
Scan Details
Site Domain | thewoodlandsmall.com |
Base Domain | thewoodlandsmall.com |
Scan Status | Ok |
Last Scan | 2024-05-24T16:53:14+00:00 |
Next Scan | 2024-06-23T16:53:14+00:00 |
Last Scan
Scanned | 2024-05-24T16:53:14+00:00 |
URL | https://thewoodlandsmall.com/robots.txt |
Redirect | https://www.thewoodlandsmall.com:443/robots.txt |
Redirect Domain | www.thewoodlandsmall.com |
Redirect Base | thewoodlandsmall.com |
Domain IPs | 15.197.183.245, 76.223.22.14 |
Redirect IPs | 23.45.207.169, 2600:1413:1::48f7:7fe1, 2600:1413:1::48f7:7ff2 |
Response IP | 42.99.140.146 |
Found | Yes |
Hash | ff41a7d9b46d65903880c60dcc03169891f9bf434548a50dc60adb4ce3409a1a |
SimHash | 717836419a90 |
Groups
*
Rule | Path |
---|---|
Disallow | */wifi |
Disallow | */wifi/connected |
Disallow | */the-club/thank-you |
Disallow | */directory/map/* |
Disallow | */directory/wayfinding/* |
Disallow | /*date%3D |
Disallow | */sweepstakes |
Allow | */directory/map/$ |
Other Records
Field | Value |
---|---|
sitemap | https://www.thewoodlandsmall.com/arc/outboundfeeds/sitemap-index/ |
sitemap | https://www.thewoodlandsmall.com/arc/outboundfeeds/sitemap-section-index/ |