gazzettadimodena.gelocal.it
robots.txt

Robots Exclusion Standard data for gazzettadimodena.gelocal.it

Resource Scan

Scan Details

Site Domain gazzettadimodena.gelocal.it
Base Domain gelocal.it
Scan Status Ok
Last Scan2024-05-10T23:04:11+00:00
Next Scan 2024-05-17T23:04:11+00:00

Last Scan

Scanned2024-05-10T23:04:11+00:00
URL https://gazzettadimodena.gelocal.it/robots.txt
Domain IPs 13.226.210.117, 13.226.210.12, 13.226.210.40, 13.226.210.72
Response IP 18.165.171.5
Found Yes
Hash 0572767a39a9bf7592a7fdd4603a1d85672b8111b694a25ecc440d5466065294
SimHash 7a384860a133

Groups

*

Rule Path
Disallow /stampa-articolo/
Disallow /dettaglio/*?edizione=
Disallow /dettaglio-news/*?edizione=

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /