wholefoodsmagazine.com
robots.txt
Robots Exclusion Standard data for wholefoodsmagazine.com
Resource Scan
Scan Details
Site Domain | wholefoodsmagazine.com |
Base Domain | wholefoodsmagazine.com |
Scan Status | Ok |
Last Scan | 2024-06-07T20:14:57+00:00 |
Next Scan | 2024-06-14T20:14:57+00:00 |
Last Scan
Scanned | 2024-06-07T20:14:57+00:00 |
URL | https://wholefoodsmagazine.com/robots.txt |
Redirect | https://www.wholefoodsmagazine.com/robots.txt |
Redirect Domain | www.wholefoodsmagazine.com |
Redirect Base | wholefoodsmagazine.com |
Domain IPs | 208.91.62.10, 208.91.62.11, 208.91.62.12, 208.91.62.13 |
Redirect IPs | 208.91.62.10, 208.91.62.11, 208.91.62.12, 208.91.62.13 |
Response IP | 208.91.62.11 |
Found | Yes |
Hash | 9854f1b5dadcea1e3f23f17f901a2afb28bcd8d56cd5cd835456ec662302f0c2 |
SimHash | af9d0e2d0670 |
Groups
*
Rule | Path |
---|---|
Disallow | /comments/flag/ |
Disallow | /search |
Disallow | /articles/comment/abuse |
Disallow | /articles/email |
Disallow | /articles/preview |
Disallow | /articles/print |
Disallow | /products/email |
Disallow | /products/print |
Disallow | /cart |
Disallow | /user/* |
Disallow | /*/log_view |
Disallow | /query/* |
Disallow | /media/video/ |
Allow | /media/videos/play/ |
Disallow | /polls |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.wholefoodsmagazine.com/sitemap.xml |
Comments