sardegna.com
robots.txt

Robots Exclusion Standard data for sardegna.com

Resource Scan

Scan Details

Site Domain sardegna.com
Base Domain sardegna.com
Scan Status Ok
Last Scan2024-06-20T17:05:41+00:00
Next Scan 2024-07-20T17:05:41+00:00

Last Scan

Scanned2024-06-20T17:05:41+00:00
URL https://www.sardegna.com/robots.txt
Domain IPs 104.88.70.106, 23.50.232.232, 2600:1413:a000::1734:282a, 2600:1413:a000::1734:284a
Response IP 23.44.4.160
Found Yes
Hash 982c04558c5d3726dc59f54cade9085e06aaf13c6029b3d061b68ba7047a6fa9
SimHash 8c53c7868d72

Groups

sistrix

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

*

Rule Path
Disallow /code/admin/
Disallow /code/no/
Disallow /code/booking_ecommerce/
Disallow /code/commenti/
Disallow /code/monumenti_int/subregion/2/path_ospitalita_indice/
Disallow /code/booking/preventivo/
Disallow /*/path_ospitalita_indice/
Disallow /*/0/$
Disallow /jp/
Disallow /*/LINGUA/JP
Disallow /code/popup/
Disallow /*/click_ricerca_strutture/
Disallow /*/kors/
Disallow /code/trasporti/escursioni/
Disallow */archiv*

Warnings

  • 2 invalid lines.