csiti.cl
robots.txt

Robots Exclusion Standard data for csiti.cl

Resource Scan

Scan Details

Site Domain csiti.cl
Base Domain csiti.cl
Scan Status Ok
Last Scan2025-12-13T22:47:57+00:00
Next Scan 2025-12-20T22:47:57+00:00

Last Scan

Scanned2025-12-13T22:47:57+00:00
URL https://csiti.cl/robots.txt
Domain IPs 104.21.22.167, 172.67.205.238, 2606:4700:3032::ac43:cdee, 2606:4700:3036::6815:16a7
Response IP 172.67.205.238
Found Yes
Hash b456b0b74c55b5a19bc7c730093dec6ebc5a86f6b51a039ed2e5a1e6da666b23
SimHash d318480a2a3b

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /wp-content/debug.log
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /wp-json/
Disallow /comments/
Disallow /?s=
Disallow /search/
Disallow /feed/
Disallow /?attachment_id=
Disallow /*.php$
Disallow /*.cgi$
Disallow /*.log$
Disallow /*.json$
Disallow /*.git$
Disallow /*.env$

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

httrack

Rule Path
Disallow /

wget

Rule Path
Disallow /
Allow /wp-content/uploads/

Other Records

Field Value
sitemap https://csiti.cl/sitemap_index.xml

Comments

  • Bloquea el acceso al backend de WordPress
  • Evita que los motores de búsqueda indexen archivos internos
  • Evita que los bots accedan a rutas sensibles
  • Bloquea feeds RSS y archivos innecesarios
  • Bloqueo adicional para prevenir bots agresivos (Google seguirá accediendo)
  • Permitir acceso a imágenes y contenido útil
  • Sitemap para ayudar a los motores de búsqueda a indexar correctamente