conti.segugio.it
robots.txt

Robots Exclusion Standard data for conti.segugio.it

Resource Scan

Scan Details

Site Domain conti.segugio.it
Base Domain segugio.it
Scan Status Ok
Last Scan2025-09-22T09:34:29+00:00
Next Scan 2025-10-22T09:34:29+00:00

Last Scan

Scanned2025-09-22T09:34:29+00:00
URL https://conti.segugio.it/robots.txt
Domain IPs 213.92.12.182
Response IP 213.92.12.182
Found Yes
Hash 614ae40759cafead66fe846a88dc87caa1efc4f7b0885e4a541b2181dedf3068
SimHash 686d94945548

Groups

*

Rule Path
Disallow /index-light.aspx
Disallow /trasferimenti/*
Disallow /news-conti/news.aspx
Disallow /news/news.aspx
Disallow /banche-conti/banca.aspx
Disallow /*risultati-ricerca*.aspx
Disallow /banche-conti/*?in
Disallow /news-conti/Votazione/

semrushbot

Rule Path
Disallow /

mozilla/4.0 (compatible; synapse)

Rule Path
Disallow /

synapse

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /
Disallow /tm/*

Other Records

Field Value
sitemap https://img.gruppomol.it/sitemaps/sitemap-segugioconti.xml

Comments

  • Location of bd6ff4239bdf4dabb5c78ee6fbc9eca4.txt

Warnings

  • `bd6ff4239bdf4dabb5c78ee6fbc9eca4` is not a known field.