csisaludintegral.com
robots.txt

Robots Exclusion Standard data for csisaludintegral.com

Resource Scan

Scan Details

Site Domain csisaludintegral.com
Base Domain csisaludintegral.com
Scan Status Ok
Last Scan2026-02-04T18:58:11+00:00
Next Scan 2026-03-06T18:58:11+00:00

Last Scan

Scanned2026-02-04T18:58:11+00:00
URL https://csisaludintegral.com/robots.txt
Domain IPs 64.31.53.210
Response IP 64.31.53.210
Found Yes
Hash 3ab2b7b504532f75063a446097ccd175626c87d4cad86eee5256d85b8dd19951
SimHash eb6288420033

Groups

*

Rule Path
Disallow
Allow /wp-admin/admin-ajax.php
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/

gptbot

Rule Path
Allow /
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/
Disallow /*.php$
Disallow /wp-login.php
Disallow /xmlrpc.php

google-extended

Rule Path
Allow /
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/
Disallow /*.php$
Disallow /wp-login.php
Disallow /xmlrpc.php

Other Records

Field Value
sitemap https://www.csisaludintegral.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK
  • Permitir acceso publico al GPTBot
  • Permitir acceso a Google Gemini (Google-Extended)