tiscali.it
robots.txt
Robots Exclusion Standard data for tiscali.it
Resource Scan
Scan Details
Site Domain | tiscali.it |
Base Domain | tiscali.it |
Scan Status | Ok |
Last Scan | 2024-11-15T04:34:23+00:00 |
Next Scan | 2024-11-16T04:34:23+00:00 |
Last Scan
Scanned | 2024-11-15T04:34:23+00:00 |
URL | https://tiscali.it/robots.txt |
Redirect | https://www.tiscali.it/export/sites/default/robots.txt |
Redirect Domain | www.tiscali.it |
Redirect Base | tiscali.it |
Domain IPs | 213.205.32.10 |
Redirect IPs | 213.205.32.10 |
Response IP | 213.205.32.10 |
Found | Yes |
Hash | 5a1d6a7c271231a29e331a2e43f6c0e51b03b00d69b15d782998e02a930c7245 |
SimHash | e8187d42e413 |
Groups
*
Rule | Path |
---|---|
Disallow | /services/ |
Disallow | /system/ |
Disallow | /export/system/ |
Disallow | /.content/ |
Disallow | /search/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.tiscali.it/sitemap.xml |