knowcat.cn
robots.txt

Robots Exclusion Standard data for knowcat.cn

Resource Scan

Scan Details

Site Domain knowcat.cn
Base Domain knowcat.cn
Scan Status Ok
Last Scan2025-12-17T01:19:43+00:00
Next Scan 2026-01-16T01:19:43+00:00

Last Scan

Scanned2025-12-17T01:19:43+00:00
URL http://knowcat.cn/robots.txt
Redirect http://www.knowcat.cn/robots.txt
Redirect Domain www.knowcat.cn
Redirect Base knowcat.cn
Domain IPs 47.101.205.49
Redirect IPs 47.101.205.49
Response IP 47.101.205.49
Found Yes
Hash 85329e9ced11e2241d5063fe8988e8042917e72029678952007965203fde25a1
SimHash 485ed7317913

Groups

*

Rule Path
Disallow /d/
Disallow /e/class/
Disallow /e/config/
Disallow /e/data/
Disallow /e/enews/
Disallow /e/update/

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

deusu

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

amazonaws

Rule Path
Disallow /

amazon

Rule Path
Disallow /

blexbot

Rule Path
Disallow /