marcomweb.it
robots.txt
Robots Exclusion Standard data for marcomweb.it
Resource Scan
Scan Details
| Site Domain | marcomweb.it |
| Base Domain | marcomweb.it |
| Scan Status | Ok |
| Last Scan | 2025-10-31T05:27:20+00:00 |
| Next Scan | 2025-11-14T05:27:20+00:00 |
Last Scan
| Scanned | 2025-10-31T05:27:20+00:00 |
| URL | https://marcomweb.it/robots.txt |
| Domain IPs | 104.21.43.15, 172.67.215.215, 2606:4700:3036::ac43:d7d7, 2606:4700:3037::6815:2b0f |
| Response IP | 172.67.215.215 |
| Found | Yes |
| Hash | 8ed3d786acf560b9592785581172212a3c66cc2feef8b860196d2b9e81f2a9f2 |
| SimHash | 42350953cdd4 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /api/ |
| Disallow | /bin/ |
| Disallow | /cache/ |
| Disallow | /cli/ |
| Disallow | /components/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /layouts/ |
| Disallow | /libraries/ |
| Disallow | /logs/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /tmp/ |
Warnings
- `content-signal` is not a known field.
Comments