corrieredirieti.it
robots.txt

Robots Exclusion Standard data for corrieredirieti.it

Resource Scan

Scan Details

Site Domain corrieredirieti.it
Base Domain corrieredirieti.it
Scan Status Ok
Last Scan2024-11-14T15:43:00+00:00
Next Scan 2024-11-21T15:43:00+00:00

Last Scan

Scanned2024-11-14T15:43:00+00:00
URL https://corrieredirieti.it/robots.txt
Redirect https://www.corrieredirieti.it/robots.txt
Redirect Domain www.corrieredirieti.it
Redirect Base corrieredirieti.it
Domain IPs 104.21.47.244, 172.67.174.143, 2606:4700:3032::6815:2ff4, 2606:4700:3033::ac43:ae8f
Redirect IPs 104.21.47.244, 172.67.174.143, 2606:4700:3032::6815:2ff4, 2606:4700:3033::ac43:ae8f
Response IP 104.21.47.244
Found Yes
Hash efa072281d551b6e4e7abd36b0bf0895ef02280412ff43b7109605905071559c
SimHash 689c4940a093

Groups

*

Rule Path
Disallow /?s=
Disallow /wp-admin/
Disallow /galleria/

claudebot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.corrieredirieti.it/sitemap_index.xml