simplywebshop.de
robots.txt

Robots Exclusion Standard data for simplywebshop.de

Resource Scan

Scan Details

Site Domain simplywebshop.de
Base Domain simplywebshop.de
Scan Status Ok
Last Scan2024-09-11T13:14:39+00:00
Next Scan 2024-10-11T13:14:39+00:00

Last Scan

Scanned2024-09-11T13:14:39+00:00
URL https://www.simplywebshop.de/robots.txt
Domain IPs 18.197.126.163, 3.127.120.187, 3.78.9.220
Response IP 18.197.126.163
Found Yes
Hash a5b6bbbb7a84e5ee49090da94fd7a143b3be54df62ab8f12767b1f64d0278004
SimHash 285b4086a672

Groups

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

*

Rule Path
Disallow /dataPrivacy
Disallow /imprint
Disallow /apilogindenied
Disallow /autologout
Disallow /commonerror
Disallow /devdata
Disallow /permissiondenied
Disallow /nojs
Disallow /newerBrowser
Disallow /setIFrameSession
Disallow /health
Disallow /systemdata
Disallow /saveCookieChoise
Disallow /simulateWebsiteWithIframe

Other Records

Field Value
sitemap https://www.simplywebshop.de/sitemap