claudioguarini.it
robots.txt

Robots Exclusion Standard data for claudioguarini.it

Resource Scan

Scan Details

Site Domain claudioguarini.it
Base Domain claudioguarini.it
Scan Status Ok
Last Scan2024-10-08T22:09:36+00:00
Next Scan 2024-10-15T22:09:36+00:00

Last Scan

Scanned2024-10-08T22:09:36+00:00
URL https://www.claudioguarini.it/robots.txt
Domain IPs 2404:6800:4003:c02::79, 74.125.68.121
Response IP 74.125.130.121
Found Yes
Hash b025777d1ab62c2abcbd94e1545648ee0db10f042390e23481469bec1c85e0ad
SimHash 6b0492704613

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap https://www.claudioguarini.it/sitemap.xml