la.repubblica.it
robots.txt

Robots Exclusion Standard data for la.repubblica.it

Resource Scan

Scan Details

Site Domain la.repubblica.it
Base Domain repubblica.it
Scan Status Ok
Last Scan2024-05-17T15:51:19+00:00
Next Scan 2024-05-24T15:51:19+00:00

Last Scan

Scanned2024-05-17T15:51:19+00:00
URL https://la.repubblica.it/robots.txt
Domain IPs 18.239.199.118, 18.239.199.50, 18.239.199.82, 18.239.199.83
Response IP 18.165.171.36
Found Yes
Hash 4e69a41362c1d545e1abfce885661825fc1b3f3bd46a5a0f6636c879d79e489f
SimHash 40204a602133

Groups

*

Rule Path
Allow /
Disallow /ssincludes/
Disallow /cucina?s=
Disallow /cucina/wp-content/
Disallow /cucina/wp-admin/

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /