andreapacchiarotti.it
robots.txt

Robots Exclusion Standard data for andreapacchiarotti.it

Resource Scan

Scan Details

Site Domain andreapacchiarotti.it
Base Domain andreapacchiarotti.it
Scan Status Ok
Last Scan2025-08-14T19:30:36+00:00
Next Scan 2025-08-21T19:30:36+00:00

Last Scan

Scanned2025-08-14T19:30:36+00:00
URL https://andreapacchiarotti.it/robots.txt
Redirect https://www.andreapacchiarotti.it/robots.txt
Redirect Domain www.andreapacchiarotti.it
Redirect Base andreapacchiarotti.it
Domain IPs 89.46.105.33
Redirect IPs 89.46.105.33
Response IP 89.46.105.33
Found Yes
Hash 709461e7a4dcff120f6610c36b2119d8768577a26e81497674f0038fe937b399
SimHash a95947858793

Groups

*

Rule Path
Allow /
Allow /archivio/roma
Allow /archivio/religione
Allow /archivio/genealogia
Disallow /httpdocs/
Disallow /fonts/
Disallow /cgi-bin/
Disallow /privacy-cookie/

Other Records

Field Value
sitemap https://www.andreapacchiarotti.it/sitemap.xml