soydecaravaca.laverdad.es
robots.txt
Robots Exclusion Standard data for soydecaravaca.laverdad.es
Resource Scan
Scan Details
Site Domain | soydecaravaca.laverdad.es |
Base Domain | laverdad.es |
Scan Status | Ok |
Last Scan | 2024-05-21T07:10:16+00:00 |
Next Scan | 2024-06-20T07:10:16+00:00 |
Last Scan
Scanned | 2024-05-21T07:10:16+00:00 |
URL | https://soydecaravaca.laverdad.es/robots.txt |
Domain IPs | 23.215.7.14, 23.215.7.21 |
Response IP | 23.59.168.184 |
Found | Yes |
Hash | bb01ffd6afe8417afe47cfa4929275234363fbba447baac40ef0363234859b35 |
SimHash | 2f0098c5a537 |
Groups
*
Rule | Path |
---|---|
Disallow | /modulos/ |
Disallow | /includes/ |
Disallow | /noticias/*/sincomentario |
Disallow | /NFS/ |
Disallow | /*?ns_ |
Disallow | /4900/webm.LAVERDAD/ |
Disallow | /4900/vocento.laverdad/ |
Disallow | /guia-tv/ |
Disallow | /temas/ |
Disallow | /directos/ |
Disallow | /la_verdad/noticias/ |
Other Records
Field | Value |
---|---|
sitemap | https://soydecaravaca.laverdad.es/sitemap.xml |
sitemap | https://soydecaravaca.laverdad.es/sitemap.incremental.xml |
Comments