ilgiornaledivicenza.it
robots.txt

Robots Exclusion Standard data for ilgiornaledivicenza.it

Resource Scan

Scan Details

Site Domain ilgiornaledivicenza.it
Base Domain ilgiornaledivicenza.it
Scan Status Ok
Last Scan2024-11-09T17:26:44+00:00
Next Scan 2024-11-16T17:26:44+00:00

Last Scan

Scanned2024-11-09T17:26:44+00:00
URL https://ilgiornaledivicenza.it/robots.txt
Redirect https://www.ilgiornaledivicenza.it/robots.txt
Redirect Domain www.ilgiornaledivicenza.it
Redirect Base ilgiornaledivicenza.it
Domain IPs 156.54.131.85, 156.54.187.100
Redirect IPs 3.165.82.15, 3.165.82.39, 3.165.82.42, 3.165.82.50
Response IP 3.165.82.39
Found Yes
Hash 1b7d1d5624c10251ac39c24998105d1105ab976c1f4e5a8a022d3c4f63a39f95
SimHash 620c4a608331

Groups

*

Rule Path
Allow /
Disallow /topics
Disallow /altro/ricerca
Disallow /t/
Disallow /q/

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ilgiornaledivicenza.it/sitemap.xml