hookah.com
robots.txt
Robots Exclusion Standard data for hookah.com
Resource Scan
Scan Details
Site Domain | hookah.com |
Base Domain | hookah.com |
Scan Status | Ok |
Last Scan | 2025-09-24T02:07:32+00:00 |
Next Scan | 2025-10-24T02:07:32+00:00 |
Last Scan
Scanned | 2025-09-24T02:07:32+00:00 |
URL | https://hookah.com/robots.txt |
Domain IPs | 108.157.254.107, 108.157.254.108, 108.157.254.117, 108.157.254.5 |
Response IP | 108.157.254.107 |
Found | Yes |
Hash | 6173c0a1cc37b2a6ed592a8edc5e0cfa44014f3ea30eed5b5686f35403731c93 |
SimHash | 61397a304fda |
Groups
*
Rule | Path |
---|---|
Allow | /*graphql |
Allow | /*static |
Disallow | /search |
Disallow | /*filter%3D |
Disallow | /*enable-cookies |
Disallow | /account |
Disallow | /checkout |
Disallow | /*?utm_source |
Disallow | /*?utm_ |
Disallow | *?page= |
Disallow | /blog/wp-admin/ |
Other Records
Field | Value |
---|---|
sitemap | https://hookah.com/sitemap.xml |
sitemap | https://hookah.com/blog/sitemap_index.xml |
Comments