tiscali.de
robots.txt
Robots Exclusion Standard data for tiscali.de
Resource Scan
Scan Details
| Site Domain | tiscali.de |
| Base Domain | tiscali.de |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Couldn't connect to server. |
| Last Scan | 2026-04-06T02:33:38+00:00 |
| Next Scan | 2026-04-20T02:33:38+00:00 |
Last Successful Scan
| Scanned | 2026-02-27T00:00:49+00:00 |
| URL | http://www.tiscali.de/robots.txt |
| Redirect | https://www.tiscali.it/export/sites/default/robots.txt |
| Redirect Domain | www.tiscali.it |
| Redirect Base | tiscali.it |
| Domain IPs | 213.205.32.58 |
| Redirect IPs | 213.205.32.10 |
| Response IP | 213.205.32.10 |
| Found | Yes |
| Hash | 5a1d6a7c271231a29e331a2e43f6c0e51b03b00d69b15d782998e02a930c7245 |
| SimHash | e8187d42e413 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /services/ |
| Disallow | /system/ |
| Disallow | /export/system/ |
| Disallow | /.content/ |
| Disallow | /search/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.tiscali.it/sitemap.xml |