istitutocapirola.edu.it
robots.txt

Robots Exclusion Standard data for istitutocapirola.edu.it

Resource Scan

Scan Details

Site Domain istitutocapirola.edu.it
Base Domain istitutocapirola.edu.it
Scan Status Ok
Last Scan2025-12-06T11:53:26+00:00
Next Scan 2026-01-05T11:53:26+00:00

Last Scan

Scanned2025-12-06T11:53:26+00:00
URL https://istitutocapirola.edu.it/robots.txt
Domain IPs 104.21.16.214, 172.67.215.243, 2606:4700:3034::ac43:d7f3, 2606:4700:3035::6815:10d6
Response IP 104.21.16.214
Found Yes
Hash e2ce9b70decb44dd9f9961a73ac6e05fd8741921fcf5e05cf39a44f492de9fc8
SimHash eba3c8424013

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /readme.html
Disallow /license.txt
Disallow /cgi-bin/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /*/feed/
Disallow /*/embed/
Allow /wp-admin/admin-ajax.php
Disallow /*?*utm_source=
Disallow /*?*utm_medium=
Disallow /*?*utm_campaign=

Other Records

Field Value
sitemap https://istitutocapirola.edu.it/sitemap.xml

Comments

  • Blocca i parametri comuni usati per il tracciamento
  • Sitemap