guerrillatees.com
robots.txt

Robots Exclusion Standard data for guerrillatees.com

Resource Scan

Scan Details

Site Domain guerrillatees.com
Base Domain guerrillatees.com
Scan Status Ok
Last Scan2025-11-24T07:34:52+00:00
Next Scan 2025-12-24T07:34:52+00:00

Last Scan

Scanned2025-11-24T07:34:52+00:00
URL https://guerrillatees.com/robots.txt
Domain IPs 103.133.1.1
Response IP 103.133.1.1
Found Yes
Hash 9acc3d37e5587bf56516ff05984bec5db5d42db6ec489182bec45b5380409b44
SimHash e1541e174053

Groups

googlebot

Rule Path
Disallow /uploads/
Disallow /search/products
Disallow /locations/search
Disallow /webadmin

googlebot-image

Rule Path
Disallow /uploads/
Disallow /search/products
Disallow /locations/search
Disallow /webadmin

facebookexternalhit

Rule Path
Allow /
Disallow /webadmin

*

Rule Path
Disallow /uploads/
Disallow /search/products
Disallow /locations/search
Disallow /webadmin

Other Records

Field Value
sitemap https://guerrillatees.com/sitemap.xml