giuntiscuola.it
robots.txt

Robots Exclusion Standard data for giuntiscuola.it

Resource Scan

Scan Details

Site Domain giuntiscuola.it
Base Domain giuntiscuola.it
Scan Status Ok
Last Scan2024-09-22T15:50:59+00:00
Next Scan 2024-10-22T15:50:59+00:00

Last Scan

Scanned2024-09-22T15:50:59+00:00
URL https://www.giuntiscuola.it/robots.txt
Domain IPs 13.35.18.14, 13.35.18.24, 13.35.18.5, 13.35.18.98, 2600:9000:20c7:2400:e:2b58:6c00:93a1, 2600:9000:20c7:2800:e:2b58:6c00:93a1, 2600:9000:20c7:3a00:e:2b58:6c00:93a1, 2600:9000:20c7:400:e:2b58:6c00:93a1, 2600:9000:20c7:4a00:e:2b58:6c00:93a1, 2600:9000:20c7:ba00:e:2b58:6c00:93a1, 2600:9000:20c7:ee00:e:2b58:6c00:93a1, 2600:9000:20c7:f000:e:2b58:6c00:93a1
Response IP 13.35.18.24
Found Yes
Hash cb7b117be68522dc764f5b5d4fc94dcffd1ae56691047ed6badb6ee9237accca
SimHash 6950dc208d13

Groups

*

Rule Path
Disallow /area-personale*
Disallow /ricerca*
Disallow /d/api/*
Disallow /d/graphql/*

Other Records

Field Value
sitemap https://www.giuntiscuola.it/d/sitemap.xml