webalia.com
robots.txt
Robots Exclusion Standard data for webalia.com
Resource Scan
Scan Details
Site Domain | webalia.com |
Base Domain | webalia.com |
Scan Status | Ok |
Last Scan | 2024-11-16T08:43:56+00:00 |
Next Scan | 2024-11-23T08:43:56+00:00 |
Last Scan
Scanned | 2024-11-16T08:43:56+00:00 |
URL | https://webalia.com/robots.txt |
Domain IPs | 82.98.174.142 |
Response IP | 82.98.174.142 |
Found | Yes |
Hash | abf7828ebd0c52560ceb287b9338dc7f996d9c4968b0719f49b4062d18a88cff |
SimHash | c9495c38bfb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /loguearse-con-facebook/ |
Disallow | /temp/ |
Disallow | /informacion-general/ |
Disallow | /perfil/ |
Disallow | /gestionmax/ |
Allow | /gestionmax/cookies/ |
Allow | /gestionmax/js/ |
Allow | /gestionmax/css/ |
Disallow | /cgi-bin/ |
Disallow | /click.php |
Disallow | /error.php |
Disallow | *print%3DS* |
Disallow | *accion%3D* |
Disallow | *mensaje%3D* |
Disallow | *-ordfecha* |
Disallow | *-ordtitulo* |
Disallow | *-ordautor* |
Disallow | *-ordprecio* |
Disallow | *-ordprioridad* |
Comments