ccaa.elpais.com
robots.txt
Robots Exclusion Standard data for ccaa.elpais.com
Resource Scan
Scan Details
Site Domain | ccaa.elpais.com |
Base Domain | elpais.com |
Scan Status | Ok |
Last Scan | 2024-04-24T01:23:36+00:00 |
Next Scan | 2024-05-08T01:23:36+00:00 |
Last Scan
Scanned | 2024-04-24T01:23:36+00:00 |
URL | https://ccaa.elpais.com/robots.txt |
Domain IPs | 199.232.194.133, 199.232.198.133 |
Response IP | 151.101.42.133 |
Found | Yes |
Hash | e58b5f15876826714c4267cdf67c215e04a703cc79d48d96e85aa33704a94c0d |
SimHash | b0164c242bb3 |
Groups
*
Rule | Path |
---|---|
Disallow | /buscador/ |
Disallow | /m/buscador/ |
Disallow | /pruebas/ |
Disallow | /publicidad/ |
Disallow | /notificarelacionadas |
Disallow | /*.swf$ |