curioctopus.de
robots.txt

Robots Exclusion Standard data for curioctopus.de

Resource Scan

Scan Details

Site Domain curioctopus.de
Base Domain curioctopus.de
Scan Status Ok
Last Scan2024-11-09T10:33:35+00:00
Next Scan 2024-11-16T10:33:35+00:00

Last Scan

Scanned2024-11-09T10:33:35+00:00
URL https://curioctopus.de/robots.txt
Redirect https://www.curioctopus.de/robots.txt
Redirect Domain www.curioctopus.de
Redirect Base curioctopus.de
Domain IPs 13.33.88.116, 13.33.88.47, 13.33.88.69, 13.33.88.8
Redirect IPs 13.33.88.116, 13.33.88.47, 13.33.88.69, 13.33.88.8, 2600:9000:2514:1000:17:b92f:8180:93a1, 2600:9000:2514:2600:17:b92f:8180:93a1, 2600:9000:2514:5800:17:b92f:8180:93a1, 2600:9000:2514:600:17:b92f:8180:93a1, 2600:9000:2514:6c00:17:b92f:8180:93a1, 2600:9000:2514:7e00:17:b92f:8180:93a1, 2600:9000:2514:800:17:b92f:8180:93a1, 2600:9000:2514:9400:17:b92f:8180:93a1
Response IP 13.33.88.47
Found Yes
Hash 615a2b394fbf2d279b099cc176e5acab2c28e9c3a6248bdc132395e073e56670
SimHash 6f9d1c50e6d8

Groups

*

Rule Path
Disallow /search

mauibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

linguee

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.curioctopus.de/sitemap.xml