los40.cl
robots.txt
Robots Exclusion Standard data for los40.cl
Resource Scan
Scan Details
Site Domain | los40.cl |
Base Domain | los40.cl |
Scan Status | Ok |
Last Scan | 2024-06-04T07:34:49+00:00 |
Next Scan | 2024-06-11T07:34:49+00:00 |
Last Scan
Scanned | 2024-06-04T07:34:49+00:00 |
URL | https://los40.cl/robots.txt |
Domain IPs | 23.52.171.128, 23.52.171.160, 2600:1413:3800:3::172d:cfc8, 2600:1413:3800:3::172d:cfcf |
Response IP | 42.99.140.138 |
Found | Yes |
Hash | 8995a26b39e24bf4aadfae0e40b23aca67d8424cac446133c4c0b25f92cd9f21 |
SimHash | b40c77354795 |
Groups
*
Rule | Path |
---|---|
Disallow | /pxlctl.gif |
Disallow | /pxlctl2.gif |
Disallow | /*.swf$ |
Disallow | /pruebas/ |
Disallow | /newsletter/ |
Disallow | /especiales/reunionprog/ |
Disallow | /includes/ |
Disallow | /mnt/ |
Disallow | /pf/api/ |
Disallow | /tag//* |
Disallow | /embed/ |
Disallow | /preview/ |
Disallow | */Aes/$ |
Disallow | /*.Aes/$ |
Disallow | */Ves/$ |
Disallow | */Tes/$ |
Disallow | /buscar/ |
Disallow | */buscador/ |
Comments