pizzabulls.simplywebshop.de
robots.txt

Robots Exclusion Standard data for pizzabulls.simplywebshop.de

Resource Scan

Scan Details

Site Domain pizzabulls.simplywebshop.de
Base Domain simplywebshop.de
Scan Status Ok
Last Scan2024-05-26T17:09:58+00:00
Next Scan 2024-06-25T17:09:58+00:00

Last Scan

Scanned2024-05-26T17:09:58+00:00
URL https://pizzabulls.simplywebshop.de/robots.txt
Domain IPs 2600:9000:2366:1600:13:ff2a:a500:93a1, 2600:9000:2366:1c00:13:ff2a:a500:93a1, 2600:9000:2366:3800:13:ff2a:a500:93a1, 2600:9000:2366:7000:13:ff2a:a500:93a1, 2600:9000:2366:8e00:13:ff2a:a500:93a1, 2600:9000:2366:d000:13:ff2a:a500:93a1, 2600:9000:2366:f400:13:ff2a:a500:93a1, 2600:9000:2366:fe00:13:ff2a:a500:93a1, 65.9.112.125, 65.9.112.18, 65.9.112.46, 65.9.112.72
Response IP 108.157.52.26
Found Yes
Hash 5154e2311aa20ba51af105852024d9ff2d01a719786be403ffabf5fdbbba561c
SimHash 281a4086e672

Groups

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

*

Rule Path
Disallow /dataPrivacy
Disallow /imprint
Disallow /apilogindenied
Disallow /autologout
Disallow /commonerror
Disallow /devdata
Disallow /permissiondenied
Disallow /nojs
Disallow /newerBrowser
Disallow /setIFrameSession
Disallow /health
Disallow /systemdata
Disallow /saveCookieChoise
Disallow /simulateWebsiteWithIframe

Other Records

Field Value
sitemap https://pizzabulls.simplywebshop.de/sitemap