webspace.tiscali.it
robots.txt

Robots Exclusion Standard data for webspace.tiscali.it

Resource Scan

Scan Details

Site Domain webspace.tiscali.it
Base Domain tiscali.it
Scan Status Ok
Last Scan2024-09-16T15:04:50+00:00
Next Scan 2024-09-23T15:04:50+00:00

Last Scan

Scanned2024-09-16T15:04:50+00:00
URL https://webspace.tiscali.it/robots.txt
Redirect https://www.tiscali.it/export/sites/default/robots.txt
Redirect Domain www.tiscali.it
Redirect Base tiscali.it
Domain IPs 213.205.32.10
Redirect IPs 213.205.32.10
Response IP 213.205.32.10
Found Yes
Hash 5a1d6a7c271231a29e331a2e43f6c0e51b03b00d69b15d782998e02a930c7245
SimHash e8187d42e413

Groups

*

Rule Path
Disallow /services/
Disallow /system/
Disallow /export/system/
Disallow /.content/
Disallow /search/

Other Records

Field Value
sitemap https://www.tiscali.it/sitemap.xml