dreamkala.com
robots.txt

Robots Exclusion Standard data for dreamkala.com

Resource Scan

Scan Details

Site Domain dreamkala.com
Base Domain dreamkala.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-09-29T13:48:43+00:00
Next Scan 2024-12-28T13:48:43+00:00

Last Successful Scan

Scanned2024-05-10T13:47:00+00:00
URL https://dreamkala.com/robots.txt
Domain IPs 93.115.150.51
Response IP 93.115.150.51
Found Yes
Hash fbe1b0aba4515255932741f19e16c70cd4dd7452d8134f3a7aac940bfe751247
SimHash ca0dc0e06e35

Groups

*

Rule Path
Allow /
Disallow /*%26limit
Disallow /*%26sort
Disallow /*?route=checkout%2F
Disallow /*?route=account%2F
Disallow /*?route=product%2Fsearch
Disallow /*?route=affiliate%2F
Disallow /admin/
Disallow /catalog/
Disallow /install/
Disallow /system/
Disallow /vqmod/
Allow /image/
Allow /image/data/

mj12bot
ahrefsbot
semrushbot
mauibot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

baidu

Rule Path
Disallow /