lidl.de
robots.txt
Robots Exclusion Standard data for lidl.de
Resource Scan
Scan Details
Site Domain | lidl.de |
Base Domain | lidl.de |
Scan Status | Ok |
Last Scan | 2024-09-24T15:03:26+00:00 |
Next Scan | 2024-10-08T15:03:26+00:00 |
Last Scan
Scanned | 2024-09-24T15:03:26+00:00 |
URL | https://lidl.de/robots.txt |
Redirect | https://www.lidl.de/robots.txt |
Redirect Domain | www.lidl.de |
Redirect Base | lidl.de |
Domain IPs | 185.85.1.129, 2a02:cb40:200::13 |
Redirect IPs | 185.85.1.129, 2a02:cb40:200::13 |
Response IP | 185.85.1.129 |
Found | Yes |
Hash | 221324c16f525a4dff9554f2d3d3401f225aa00046efd9fdaa96f203bb52315a |
SimHash | 811adc018e90 |
Groups
*
Rule | Path |
---|---|
Disallow | /cc.js* |
Disallow | /cdn/assets/cwv/ |
Disallow | /user-api/* |
Disallow | /cqe/* |
Disallow | *search?q=* |
Disallow | *?offset=* |
Disallow | *idsOnly%3D* |
Disallow | *productsOnly%3D* |
Disallow | *id%3D* |
Disallow | *pageId%3D* |
Disallow | *advisor%3D* |
Disallow | *sort%3D* |
Other Records
Field | Value |
---|---|
sitemap | https://www.lidl.de/static/sitemap.xml |