static.repubblica.it
robots.txt

Robots Exclusion Standard data for static.repubblica.it

Resource Scan

Scan Details

Site Domain static.repubblica.it
Base Domain repubblica.it
Scan Status Ok
Last Scan2024-09-21T17:38:32+00:00
Next Scan 2024-09-28T17:38:32+00:00

Last Scan

Scanned2024-09-21T17:38:32+00:00
URL https://static.repubblica.it/robots.txt
Domain IPs 13.226.2.120, 13.226.2.28, 13.226.2.49, 13.226.2.68
Response IP 18.165.140.116
Found Yes
Hash b4eeef4e99429cd34336cf955886c13ae532793c209321f768f87d1f45f2f50a
SimHash 00044b602133

Groups

*

Rule Path
Allow /
Disallow /parma/report/dossierLibera2013.pdf

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /