hoepli.it
robots.txt

Robots Exclusion Standard data for hoepli.it

Resource Scan

Scan Details

Site Domain hoepli.it
Base Domain hoepli.it
Scan Status Ok
Last Scan2025-12-14T04:26:41+00:00
Next Scan 2026-01-13T04:26:41+00:00

Last Scan

Scanned2025-12-14T04:26:41+00:00
URL https://hoepli.it/robots.txt
Redirect https://www.hoepli.it/robots.txt
Redirect Domain www.hoepli.it
Redirect Base hoepli.it
Domain IPs 194.113.89.83
Redirect IPs 194.113.89.83
Response IP 194.113.89.83
Found Yes
Hash a30f6ca0fc3b0ea05ba8ba2142c9c75d0b02b7c72f1a40946b370cc5ca74ed63
SimHash 684edc6ab31e

Groups

*

Rule Path
Disallow /addcart.asp
Disallow /addcart.asp?
Disallow /LasciaUnCommento.aspx?
Disallow /addtocartConsigli.aspx
Disallow /VerifyImage.asp?
Disallow /compratiInsieme.asp
Disallow /Libro_Preview.asp
Disallow /carrelloRicerca.asp
Disallow /carrelloRicerca.asp?
Disallow /WebService/xt
Disallow /webservice/xt
Disallow /add
Disallow /carrello.
Disallow /Carrello.
Disallow /cerca/
Disallow /Cerca/
Disallow *.html*so%3D*
Disallow *.html*vz%3D*
Disallow *.html*va%3D*
Disallow *.html*od%3D*
Disallow /giacenze/
Disallow /Giacenze/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.hoepli.it/siteMap/sitemapindex.xml