laguzman.com
robots.txt
Robots Exclusion Standard data for laguzman.com
Resource Scan
Scan Details
| Site Domain | laguzman.com |
| Base Domain | laguzman.com |
| Scan Status | Ok |
| Last Scan | 2025-11-28T08:37:53+00:00 |
| Next Scan | 2025-12-05T08:37:53+00:00 |
Last Scan
| Scanned | 2025-11-28T08:37:53+00:00 |
| URL | https://laguzman.com/robots.txt |
| Domain IPs | 104.21.5.154, 172.67.133.134, 2606:4700:3030::6815:59a, 2606:4700:3036::ac43:8586 |
| Response IP | 172.67.133.134 |
| Found | Yes |
| Hash | 392e3a0b178701a8f7626fc11d27684870bcfb4744ae5db7d7e7b68ddba9dca6 |
| SimHash | ab2cc4646818 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /bin/ |
| Disallow | /cache/ |
| Disallow | /cli/ |
| Disallow | /components/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /layouts/ |
| Disallow | /libraries/ |
| Disallow | /logs/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /tmp/ |
| Disallow | /blog/ |
| Disallow | /galeria/ |
| Disallow | /noticias/blogs/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.laguzman.com/xmlsitemap.xml |