webalia.com
robots.txt

Robots Exclusion Standard data for webalia.com

Resource Scan

Scan Details

Site Domain webalia.com
Base Domain webalia.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-08-28T07:41:56+00:00
Next Scan 2024-09-27T07:41:56+00:00

Last Successful Scan

Scanned2024-07-30T07:40:46+00:00
URL https://webalia.com/robots.txt
Domain IPs 82.98.174.142
Response IP 82.98.174.142
Found Yes
Hash abf7828ebd0c52560ceb287b9338dc7f996d9c4968b0719f49b4062d18a88cff
SimHash c9495c38bfb2

Groups

*

Rule Path
Disallow /loguearse-con-facebook/
Disallow /temp/
Disallow /informacion-general/
Disallow /perfil/
Disallow /gestionmax/
Allow /gestionmax/cookies/
Allow /gestionmax/js/
Allow /gestionmax/css/
Disallow /cgi-bin/
Disallow /click.php
Disallow /error.php
Disallow *print%3DS*
Disallow *accion%3D*
Disallow *mensaje%3D*
Disallow *-ordfecha*
Disallow *-ordtitulo*
Disallow *-ordautor*
Disallow *-ordprecio*
Disallow *-ordprioridad*

Comments

  • Pseudo-parĂ¡metros para imprimir, acciones, mensajes
  • Pseudo-parĂ¡metros para ordenaciones