trefamiglia.com
robots.txt
Robots Exclusion Standard data for trefamiglia.com
Resource Scan
Scan Details
| Site Domain | trefamiglia.com |
| Base Domain | trefamiglia.com |
| Scan Status | Ok |
| Last Scan | 2026-01-05T02:44:09+00:00 |
| Next Scan | 2026-02-04T02:44:09+00:00 |
Last Scan
| Scanned | 2026-01-05T02:44:09+00:00 |
| URL | https://www.trefamiglia.com/robots.txt |
| Domain IPs | 35.212.40.238 |
| Response IP | 35.212.40.238 |
| Found | Yes |
| Hash | 348563d69eb16e8eaa45faf554abc9c246ed4bf3e97f4d171a2ed2909417603e |
| SimHash | a95d557025d9 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /cache/ |
| Disallow | /components/ |
| Disallow | /images/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /libraries/ |
| Disallow | /media/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /templates/ |
| Disallow | /tmp/ |
| Disallow | /xmlrpc/ |
Other Records
| Field | Value |
|---|---|
| sitemap | http://cdn.attracta.com/sitemap/341218.xml.gz |
| sitemap | http://cdn.attracta.com/sitemap/1302198.xml.gz |
Comments