treccani.it
robots.txt

Robots Exclusion Standard data for treccani.it

Resource Scan

Scan Details

Site Domain treccani.it
Base Domain treccani.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-16T21:58:08+00:00
Next Scan 2025-02-14T21:58:08+00:00

Last Successful Scan

Scanned2023-10-24T18:43:05+00:00
URL https://treccani.it/robots.txt
Redirect https://www.treccani.it/robots.txt
Redirect Domain www.treccani.it
Redirect Base treccani.it
Domain IPs 156.54.191.160
Redirect IPs 156.54.191.160
Response IP 156.54.191.160
Found Yes
Hash c44d9e51c7c59025ea45b408b587db4af97b0ca65b2c3494d41d0722c788b3d6
SimHash addd3bc0c771

Groups

mediapartners-google

Rule Path
Disallow

icc-crawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

yandex

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

rogerbot

Rule Path
Allow /Portale/

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

*

Rule Path
Disallow /Portale/
Disallow /vocabolario/tag/
Disallow *?gclid=*$
Disallow /enciclopedia/ricerca/
Disallow /enciclopedia/dettaglio-immagini/
Disallow /ext-tool/
Disallow *?pubblica=1$
Disallow *?stampa=1$
Disallow *%%28link
Disallow *?ovo_video=
Disallow *?nt=1$