thrivemarket.com
robots.txt
Robots Exclusion Standard data for thrivemarket.com
Resource Scan
Scan Details
Site Domain | thrivemarket.com |
Base Domain | thrivemarket.com |
Scan Status | Ok |
Last Scan | 2024-09-11T13:11:13+00:00 |
Next Scan | 2024-09-25T13:11:13+00:00 |
Last Scan
Scanned | 2024-09-11T13:11:13+00:00 |
URL | https://thrivemarket.com/robots.txt |
Domain IPs | 3.213.223.212, 34.194.235.141, 34.233.67.34 |
Response IP | 3.213.223.212 |
Found | Yes |
Hash | 4abd6422857f661d61e12f3e64a6989132527f3124e09b62bfd977adcb4f8392 |
SimHash | 79155d408792 |
Groups
*
Rule | Path |
---|---|
Disallow | /checkout/* |
Disallow | /customer/* |
Disallow | /catalogsearch/* |
Disallow | /review/ |
Disallow | /enable-cookies/ |
Disallow | /*%28?%7C&%29sort |
Disallow | /*%28?%7C&%29filter |
Other Records
Field | Value |
---|---|
sitemap | https://thrivemarket.com/sitemap.xml |