guyisa.com
robots.txt
Robots Exclusion Standard data for guyisa.com
Resource Scan
Scan Details
Site Domain | guyisa.com |
Base Domain | guyisa.com |
Scan Status | Ok |
Last Scan | 2025-10-12T18:02:31+00:00 |
Next Scan | 2025-10-19T18:02:31+00:00 |
Last Scan
Scanned | 2025-10-12T18:02:31+00:00 |
URL | https://guyisa.com/robots.txt |
Domain IPs | 104.18.8.146 |
Response IP | 104.18.8.146 |
Found | Yes |
Hash | c38ca4f7cec091921e9ce0aa1e5e8daafc7af19d4a8db6eae5bf446b9ceb7ea8 |
SimHash | 287d02f26e81 |
Groups
*
Rule | Path |
---|---|
Disallow | /inc/ |
Allow | /static/js/ |
Allow | /static/css/ |
Allow | /static/themes/ |
Allow | /static/themes-v2/ |
Disallow | /static/ |
Disallow | /account/ |
Disallow | /tmp/ |
Disallow | /ajax/ |
Disallow | /cdn-cgi/ |
Disallow | /v-code/ |
Other Records
Field | Value |
---|---|
sitemap | https://guyisa.com/guyisa-com-sitemap.xml |