clau.com
robots.txt

Robots Exclusion Standard data for clau.com

Resource Scan

Scan Details

Site Domain clau.com
Base Domain clau.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-07-29T11:16:34+00:00
Next Scan 2025-10-27T11:16:34+00:00

Last Successful Scan

Scanned2025-03-09T08:24:03+00:00
URL https://clau.com/robots.txt
Redirect https://www.clau.com/robots.txt
Redirect Domain www.clau.com
Redirect Base clau.com
Domain IPs 66.33.60.34, 66.33.60.67
Redirect IPs 66.33.60.67, 76.76.21.22
Response IP 76.76.21.61
Found Yes
Hash 792a3f936fc85ee2ab53e5fc4a94579757c9b5c149532773c3e71c7adbf4e1f5
SimHash aa10e80547a5

Groups

*

Rule Path
Disallow /favoritos
Disallow /historial
Disallow /pdf/
Disallow /_next/
Disallow /files/
Disallow /manifest.json
Disallow /*.json$
Disallow /*.css$
Disallow /*.js$
Disallow /*?dpl=*
Disallow /*%3A//*
Disallow /typesense-service
Disallow /*.clau.com/
Disallow /api/*

Other Records

Field Value
sitemap https://www.clau.com/elliving/sitemap.xml
sitemap https://www.clau.com/catalogo.xml
sitemap https://www.clau.com/catalogo-desarrollos.xml
sitemap https://www.clau.com/catalogo-recamaras.xml
sitemap https://www.clau.com/catalogo-estacionamientos.xml
sitemap https://www.clau.com/catalogo-amenidades.xml

Comments

  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.