htobaco.com
robots.txt
Robots Exclusion Standard data for htobaco.com
Resource Scan
Scan Details
| Site Domain | htobaco.com |
| Base Domain | htobaco.com |
| Scan Status | Ok |
| Last Scan | 2025-10-25T07:05:50+00:00 |
| Next Scan | 2025-11-08T07:05:50+00:00 |
Last Scan
| Scanned | 2025-10-25T07:05:50+00:00 |
| URL | https://htobaco.com/robots.txt |
| Domain IPs | 104.21.39.230, 172.67.149.175, 2606:4700:3032::ac43:95af, 2606:4700:3035::6815:27e6 |
| Response IP | 104.21.39.230 |
| Found | Yes |
| Hash | 8a5c38a122dcaba92967720c3f53d186995e94b81e3805dacc6a813c26e7f2c2 |
| SimHash | 634767710ba7 |
Groups
*
No rules defined. All paths allowed.
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /*%26amp%3Blt%3Biframe |
| Disallow | /*?currency= |
| Disallow | /*/p*?page=* |
| Disallow | /*/page-*?page=* |
| Disallow | /cart |
| Disallow | */redirect |
Other Records
| Field | Value |
|---|---|
| sitemap | https://htobaco.com/en/sitemap.xml |
| sitemap | https://htobaco.com/ar/sitemap.xml |
Warnings
- 10 invalid lines.