web.tiscali.it
robots.txt
Robots Exclusion Standard data for web.tiscali.it
Resource Scan
Scan Details
| Site Domain | web.tiscali.it |
| Base Domain | tiscali.it |
| Scan Status | Ok |
| Last Scan | 2026-04-03T23:57:55+00:00 |
| Next Scan | 2026-04-10T23:57:55+00:00 |
Last Scan
| Scanned | 2026-04-03T23:57:55+00:00 |
| URL | https://web.tiscali.it/robots.txt |
| Redirect | https://www.tiscali.it/export/sites/default/robots.txt |
| Redirect Domain | www.tiscali.it |
| Redirect Base | tiscali.it |
| Domain IPs | 213.205.32.58 |
| Redirect IPs | 213.205.32.10 |
| Response IP | 213.205.32.10 |
| Found | Yes |
| Hash | 5a1d6a7c271231a29e331a2e43f6c0e51b03b00d69b15d782998e02a930c7245 |
| SimHash | e8187d42e413 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /services/ |
| Disallow | /system/ |
| Disallow | /export/system/ |
| Disallow | /.content/ |
| Disallow | /search/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.tiscali.it/sitemap.xml |