hookah.com
robots.txt

Robots Exclusion Standard data for hookah.com

Resource Scan

Scan Details

Site Domain hookah.com
Base Domain hookah.com
Scan Status Ok
Last Scan2025-09-24T02:07:32+00:00
Next Scan 2025-10-24T02:07:32+00:00

Last Scan

Scanned2025-09-24T02:07:32+00:00
URL https://hookah.com/robots.txt
Domain IPs 108.157.254.107, 108.157.254.108, 108.157.254.117, 108.157.254.5
Response IP 108.157.254.107
Found Yes
Hash 6173c0a1cc37b2a6ed592a8edc5e0cfa44014f3ea30eed5b5686f35403731c93
SimHash 61397a304fda

Groups

*

Rule Path
Allow /*graphql
Allow /*static
Disallow /search
Disallow /*filter%3D
Disallow /*enable-cookies
Disallow /account
Disallow /checkout
Disallow /*?utm_source
Disallow /*?utm_
Disallow *?page=
Disallow /blog/wp-admin/

Other Records

Field Value
sitemap https://hookah.com/sitemap.xml
sitemap https://hookah.com/blog/sitemap_index.xml

Comments

  • Allowed Resources
  • Internal Search
  • My Account