as.com
robots.txt

Robots Exclusion Standard data for as.com

Resource Scan

Scan Details

Site Domain as.com
Base Domain as.com
Scan Status Ok
Last Scan2024-04-28T07:03:11+00:00
Next Scan 2024-05-05T07:03:11+00:00

Last Scan

Scanned2024-04-28T07:03:11+00:00
URL https://as.com/robots.txt
Domain IPs 23.209.46.76, 23.209.46.92, 2600:1413:b000:14::b857:c145, 2600:1413:b000:14::b857:c14c
Response IP 42.99.140.208
Found Yes
Hash ed4da8f26ea91351575414d41a411667ae8bdd8e46a5dbf29d79216f61bd573e
SimHash 610c14d73368

Groups

*

Rule Path
Disallow /buscador/
Disallow /publicidad/
Disallow /pruebas/
Disallow /kioskoymas/promociones/
Disallow /notificarelacionadas
Disallow /pxlctl.gif
Disallow /pxlctl2.gif
Disallow /*.swf$
Disallow /eskupTSUpdate
Disallow /ThreadeskupSimple
Disallow /Comentarios/
Disallow /OuteskupSimple
Disallow /vdpep/1/
Disallow /estaticos/aviso_legal.html
Disallow /formularios/
Disallow /diarioas/politica_privacidad.html
Disallow /pdf/Condiciones_Generales_Suscripciones_DIARIO_AS.pdf
Disallow *environment%3Dint*
Disallow *environment%3Dprod*
Disallow /encuestas/resultados/
Disallow /*.js2$
Allow /Comentarios/comentarios_js_v3.html