guerrillatees.com
robots.txt
Robots Exclusion Standard data for guerrillatees.com
Resource Scan
Scan Details
| Site Domain | guerrillatees.com |
| Base Domain | guerrillatees.com |
| Scan Status | Ok |
| Last Scan | 2025-11-24T07:34:52+00:00 |
| Next Scan | 2025-12-24T07:34:52+00:00 |
Last Scan
| Scanned | 2025-11-24T07:34:52+00:00 |
| URL | https://guerrillatees.com/robots.txt |
| Domain IPs | 103.133.1.1 |
| Response IP | 103.133.1.1 |
| Found | Yes |
| Hash | 9acc3d37e5587bf56516ff05984bec5db5d42db6ec489182bec45b5380409b44 |
| SimHash | e1541e174053 |
Groups
googlebot
| Rule | Path |
|---|---|
| Disallow | /uploads/ |
| Disallow | /search/products |
| Disallow | /locations/search |
| Disallow | /webadmin |
googlebot-image
| Rule | Path |
|---|---|
| Disallow | /uploads/ |
| Disallow | /search/products |
| Disallow | /locations/search |
| Disallow | /webadmin |
*
| Rule | Path |
|---|---|
| Disallow | /uploads/ |
| Disallow | /search/products |
| Disallow | /locations/search |
| Disallow | /webadmin |
Other Records
| Field | Value |
|---|---|
| sitemap | https://guerrillatees.com/sitemap.xml |