media.lavozdegalicia.es
robots.txt
Robots Exclusion Standard data for media.lavozdegalicia.es
Resource Scan
Scan Details
Site Domain | media.lavozdegalicia.es |
Base Domain | lavozdegalicia.es |
Scan Status | Ok |
Last Scan | 2024-05-11T07:12:26+00:00 |
Next Scan | 2024-05-18T07:12:26+00:00 |
Last Scan
Scanned | 2024-05-11T07:12:26+00:00 |
URL | https://media.lavozdegalicia.es/robots.txt |
Domain IPs | 52.31.103.128, 52.48.120.227, 54.171.138.58 |
Response IP | 52.48.120.227 |
Found | Yes |
Hash | 8d5a0b105132188b81aac31fbcaf7b92d52570172366553018f71894aaa05430 |
SimHash | ebdb78c7cff5 |
Groups
*
Rule | Path |
---|---|
Disallow | /SSEE/ |
Disallow | /VentaPDF/ |
Disallow | /votacion/ |
Disallow | /comentarios/ |
Disallow | /test/ |
Disallow | /embeds/post_scriptum.php |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
Warnings
- 2 invalid lines.