hookah.bio
robots.txt

Robots Exclusion Standard data for hookah.bio

Resource Scan

Scan Details

Site Domain hookah.bio
Base Domain hookah.bio
Scan Status Ok
Last Scan2025-10-06T22:38:20+00:00
Next Scan 2025-11-05T22:38:20+00:00

Last Scan

Scanned2025-10-06T22:38:20+00:00
URL https://hookah.bio/robots.txt
Domain IPs 104.21.21.35, 172.67.196.91, 2606:4700:3030::6815:1523, 2606:4700:3031::ac43:c45b
Response IP 172.67.196.91
Found Yes
Hash 4e93aaf39724a75b84a0707e4ee9d846f00bd19662f5f915c7e1776c3825bd7e
SimHash 4b2248462353

Groups

*

Rule Path
Allow /
Allow /am/
Allow /ar/
Allow /az/
Allow /bn/
Allow /cs/
Allow /da/
Allow /de/
Allow /el/
Allow /en/
Allow /es/
Allow /fa/
Allow /fr/
Allow /hi/
Allow /ht/
Allow /hu/
Allow /id/
Allow /ig/
Allow /it/
Allow /ja/
Allow /jv/
Allow /ko/
Allow /ms/
Allow /mr/
Allow /ne/
Allow /nl/
Allow /no/
Allow /om/
Allow /pa/
Allow /pl/
Allow /pt/
Allow /ro/
Allow /ru/
Allow /sd/
Allow /sv/
Allow /ta/
Allow /te/
Allow /th/
Allow /tl/
Allow /tr/
Allow /uk/
Allow /ur/
Allow /uz/
Allow /vi/
Allow /yo/
Allow /zh/
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /*.php$
Disallow /*.inc$
Disallow /*.sql
Disallow /*.gz
Disallow /*.log
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php

Other Records

Field Value
sitemap https://hookah.bio/sitemap.xml

Comments

  • Disallow admin and system files/directories
  • Sitemap location