sostenibilita.ilgiornaledivicenza.it
robots.txt

Robots Exclusion Standard data for sostenibilita.ilgiornaledivicenza.it

Resource Scan

Scan Details

Site Domain sostenibilita.ilgiornaledivicenza.it
Base Domain ilgiornaledivicenza.it
Scan Status Ok
Last Scan2024-04-25T23:18:24+00:00
Next Scan 2024-05-25T23:18:24+00:00

Last Scan

Scanned2024-04-25T23:18:24+00:00
URL https://sostenibilita.ilgiornaledivicenza.it/robots.txt
Domain IPs 156.54.131.85
Response IP 156.54.131.85
Found Yes
Hash 6b4fdf4d15e8830e0e68f1c55ba3e000222c9a0aae8220ad964934a3012b8340
SimHash 00048960c331

Groups

*

Rule Path
Allow /
Disallow /topics
Disallow /altro/ricerca
Disallow /t/
Disallow /q/

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /