cgkaarlanderveen.nl
robots.txt
Robots Exclusion Standard data for cgkaarlanderveen.nl
Resource Scan
Scan Details
Site Domain | cgkaarlanderveen.nl |
Base Domain | cgkaarlanderveen.nl |
Scan Status | Ok |
Last Scan | 2024-11-02T12:50:42+00:00 |
Next Scan | 2024-11-16T12:50:42+00:00 |
Last Scan
Scanned | 2024-11-02T12:50:42+00:00 |
URL | https://cgkaarlanderveen.nl/robots.txt |
Domain IPs | 141.138.168.125, 2a03:3c00:a002:180::1000 |
Response IP | 141.138.168.125 |
Found | Yes |
Hash | 1895183657f0ef26883a373c2c0583c62f26c8fa5b13c4296a1f5bfca228d5ee |
SimHash | 5ad10912ee30 |
Groups
*
Rule | Path |
---|---|
Disallow | /dlm_uploads/ |
*
Rule | Path |
---|---|
Disallow | /site/wp-content/uploads/dlm_uploads/ |
*
Rule | Path |
---|---|
Allow | / |
Warnings
- 12 invalid lines.