andreapacchiarotti.it
robots.txt
Robots Exclusion Standard data for andreapacchiarotti.it
Resource Scan
Scan Details
| Site Domain | andreapacchiarotti.it |
| Base Domain | andreapacchiarotti.it |
| Scan Status | Ok |
| Last Scan | 2025-08-14T19:30:36+00:00 |
| Next Scan | 2025-08-21T19:30:36+00:00 |
Last Scan
| Scanned | 2025-08-14T19:30:36+00:00 |
| URL | https://andreapacchiarotti.it/robots.txt |
| Redirect | https://www.andreapacchiarotti.it/robots.txt |
| Redirect Domain | www.andreapacchiarotti.it |
| Redirect Base | andreapacchiarotti.it |
| Domain IPs | 89.46.105.33 |
| Redirect IPs | 89.46.105.33 |
| Response IP | 89.46.105.33 |
| Found | Yes |
| Hash | 709461e7a4dcff120f6610c36b2119d8768577a26e81497674f0038fe937b399 |
| SimHash | a95947858793 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Allow | /archivio/roma |
| Allow | /archivio/religione |
| Allow | /archivio/genealogia |
| Disallow | /httpdocs/ |
| Disallow | /fonts/ |
| Disallow | /cgi-bin/ |
| Disallow | /privacy-cookie/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.andreapacchiarotti.it/sitemap.xml |