shop.gregspizza.de
robots.txt

Robots Exclusion Standard data for shop.gregspizza.de

Resource Scan

Scan Details

Site Domain shop.gregspizza.de
Base Domain gregspizza.de
Scan Status Ok
Last Scan2024-09-15T13:18:55+00:00
Next Scan 2024-10-15T13:18:55+00:00

Last Scan

Scanned2024-09-15T13:18:55+00:00
URL https://shop.gregspizza.de/robots.txt
Domain IPs 13.226.2.128, 13.226.2.6, 13.226.2.9, 13.226.2.93, 2600:9000:2022:2a00:14:59b2:be80:93a1, 2600:9000:2022:5a00:14:59b2:be80:93a1, 2600:9000:2022:5e00:14:59b2:be80:93a1, 2600:9000:2022:7400:14:59b2:be80:93a1, 2600:9000:2022:7800:14:59b2:be80:93a1, 2600:9000:2022:b400:14:59b2:be80:93a1, 2600:9000:2022:e200:14:59b2:be80:93a1, 2600:9000:2022:ea00:14:59b2:be80:93a1
Response IP 3.164.206.104
Found Yes
Hash 27d5066ba622272fb3d1e89a5678e923c8e488c181690628f5cca0016c1cb7cd
SimHash 28196086e672

Groups

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

*

Rule Path
Disallow /dataPrivacy
Disallow /imprint
Disallow /apilogindenied
Disallow /autologout
Disallow /commonerror
Disallow /devdata
Disallow /permissiondenied
Disallow /nojs
Disallow /newerBrowser
Disallow /setIFrameSession
Disallow /health
Disallow /systemdata
Disallow /saveCookieChoise
Disallow /simulateWebsiteWithIframe

Other Records

Field Value
sitemap https://shop.gregspizza.de/sitemap