simplywebshop.de
robots.txt
Robots Exclusion Standard data for simplywebshop.de
Resource Scan
Scan Details
Site Domain | simplywebshop.de |
Base Domain | simplywebshop.de |
Scan Status | Ok |
Last Scan | 2024-09-11T13:14:39+00:00 |
Next Scan | 2024-10-11T13:14:39+00:00 |
Last Scan
Scanned | 2024-09-11T13:14:39+00:00 |
URL | https://www.simplywebshop.de/robots.txt |
Domain IPs | 18.197.126.163, 3.127.120.187, 3.78.9.220 |
Response IP | 18.197.126.163 |
Found | Yes |
Hash | a5b6bbbb7a84e5ee49090da94fd7a143b3be54df62ab8f12767b1f64d0278004 |
SimHash | 285b4086a672 |
Groups
webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /dataPrivacy |
Disallow | /imprint |
Disallow | /apilogindenied |
Disallow | /autologout |
Disallow | /commonerror |
Disallow | /devdata |
Disallow | /permissiondenied |
Disallow | /nojs |
Disallow | /newerBrowser |
Disallow | /setIFrameSession |
Disallow | /health |
Disallow | /systemdata |
Disallow | /saveCookieChoise |
Disallow | /simulateWebsiteWithIframe |
Other Records
Field | Value |
---|---|
sitemap | https://www.simplywebshop.de/sitemap |