giuntiscuola.it
robots.txt
Robots Exclusion Standard data for giuntiscuola.it
Resource Scan
Scan Details
Site Domain | giuntiscuola.it |
Base Domain | giuntiscuola.it |
Scan Status | Ok |
Last Scan | 2024-09-22T15:50:59+00:00 |
Next Scan | 2024-10-22T15:50:59+00:00 |
Last Scan
Scanned | 2024-09-22T15:50:59+00:00 |
URL | https://www.giuntiscuola.it/robots.txt |
Domain IPs | 13.35.18.14, 13.35.18.24, 13.35.18.5, 13.35.18.98, 2600:9000:20c7:2400:e:2b58:6c00:93a1, 2600:9000:20c7:2800:e:2b58:6c00:93a1, 2600:9000:20c7:3a00:e:2b58:6c00:93a1, 2600:9000:20c7:400:e:2b58:6c00:93a1, 2600:9000:20c7:4a00:e:2b58:6c00:93a1, 2600:9000:20c7:ba00:e:2b58:6c00:93a1, 2600:9000:20c7:ee00:e:2b58:6c00:93a1, 2600:9000:20c7:f000:e:2b58:6c00:93a1 |
Response IP | 13.35.18.24 |
Found | Yes |
Hash | cb7b117be68522dc764f5b5d4fc94dcffd1ae56691047ed6badb6ee9237accca |
SimHash | 6950dc208d13 |
Groups
*
Rule | Path |
---|---|
Disallow | /area-personale* |
Disallow | /ricerca* |
Disallow | /d/api/* |
Disallow | /d/graphql/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.giuntiscuola.it/d/sitemap.xml |