wika.cn
robots.txt

Robots Exclusion Standard data for wika.cn

Archived Snapshots

Resource Scan

Scan Details

Site Domain	wika.cn
Base Domain	wika.cn
Scan Status	Ok
Last Scan	2024-07-01T07:25:21+00:00
Next Scan	2024-07-15T07:25:21+00:00

Last Scan

Scanned	2024-07-01T07:25:21+00:00
URL	https://wika.cn/robots.txt
Redirect	https://www.wika.cn/robots.txt
Redirect Domain	www.wika.cn
Redirect Base	wika.cn
Domain IPs	81.20.83.9
Redirect IPs	163.181.81.215
Response IP	163.181.201.222
Found	Yes
Hash	1351f5e0d0024c621e2df82a43b91bc1b251e4bd0ab83aafbb7862fe5d05dd18
SimHash	039d896ec670

Groups

*

Rule	Path
Allow	/
Disallow	/LivePersonTest*
Disallow	/defaultMultiSites.aspx
Disallow	/ScriptResource.axd
Disallow	/WebResource.axd
Disallow	/docs/
Disallow	/publish/
Disallow	/ClickTaleCache.ashx
Disallow	/apps/
Disallow	/ProductDetailHandler
Disallow	/webshop/
Disallow	/pro_1.asp
Disallow	/pro_detail.asp
Disallow	/newscontentgeneric.WIKA?AxID=462
Disallow	/newscontentgeneric.WIKA?AxID=467
Disallow	/newscontentgeneric.WIKA?AxID=469
Disallow	/newscontentgeneric.WIKA?AxID=473
Disallow	/newscontentgeneric.WIKA?AxID=474
Disallow	/newscontentgeneric.WIKA?AxID=475
Disallow	/newscontentgeneric.WIKA?AxID=470
Disallow	/newscontentgeneric.WIKA?AxID=471
Disallow	/newscontentgeneric.WIKA?AxID=472
Disallow	/newscontentgeneric.WIKA?AxID=468
Disallow	/newscontentgeneric.WIKA?AxID=452
Disallow	/newscontentgeneric.WIKA?AxID=457
Disallow	/newscontentgeneric.WIKA?AxID=458
Disallow	/newscontentgeneric.WIKA?AxID=459
Disallow	/newscontentgeneric.WIKA?AxID=460
Disallow	/newscontentgeneric.WIKA?AxID=461
Disallow	/newscontentgeneric.WIKA?AxID=462
Disallow	/newscontentgeneric.WIKA?AxID=463
Disallow	/newscontentgeneric.WIKA?AxID=466
Disallow	/newscontentgeneric.WIKA?AxID=453
Disallow	/newscontentgeneric.WIKA?AxID=454

Rule

Path

Allow

Disallow

/LivePersonTest*

Disallow

/defaultMultiSites.aspx

Disallow

/ScriptResource.axd

Disallow

/WebResource.axd

Disallow

/docs/

Disallow

/publish/

Disallow

/ClickTaleCache.ashx

Disallow

/apps/

Disallow

/ProductDetailHandler

Disallow

/webshop/

Disallow

/pro_1.asp

Disallow

/pro_detail.asp

Disallow

/newscontentgeneric.WIKA?AxID=462

Disallow

/newscontentgeneric.WIKA?AxID=467

Disallow

/newscontentgeneric.WIKA?AxID=469

Disallow

/newscontentgeneric.WIKA?AxID=473

Disallow

/newscontentgeneric.WIKA?AxID=474

Disallow

/newscontentgeneric.WIKA?AxID=475

Disallow

/newscontentgeneric.WIKA?AxID=470

Disallow

/newscontentgeneric.WIKA?AxID=471

Disallow

/newscontentgeneric.WIKA?AxID=472

Disallow

/newscontentgeneric.WIKA?AxID=468

Disallow

/newscontentgeneric.WIKA?AxID=452

Disallow

/newscontentgeneric.WIKA?AxID=457

Disallow

/newscontentgeneric.WIKA?AxID=458

Disallow

/newscontentgeneric.WIKA?AxID=459

Disallow

/newscontentgeneric.WIKA?AxID=460

Disallow

/newscontentgeneric.WIKA?AxID=461

Disallow

/newscontentgeneric.WIKA?AxID=462

Disallow

/newscontentgeneric.WIKA?AxID=463

Disallow

/newscontentgeneric.WIKA?AxID=466

Disallow

/newscontentgeneric.WIKA?AxID=453

Disallow

/newscontentgeneric.WIKA?AxID=454

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier v4.0

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier+v4.0

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft+office+protocol+discovery

Rule	Path
Disallow	/

Rule

Path

Disallow

infometrics-bot, http://www.infometrics.de

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

betabot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

boardreader

Rule	Path
Disallow	/

Rule

Path

Disallow

bpimagewalker

Rule	Path
Disallow	/

Rule

Path

Disallow

checkmarknetwork

Rule	Path
Disallow	/

Rule

Path

Disallow

clockwork data vault

Rule	Path
Disallow	/

Rule

Path

Disallow

coccoc

Rule	Path
Disallow	/

Rule

Path

Disallow

crazywebcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

domain re-animator

Rule	Path
Disallow	/

Rule

Path

Disallow

domainstatsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

easouspider

Rule	Path
Disallow	/

Rule

Path

Disallow

ezooms

Rule	Path
Disallow	/

Rule

Path

Disallow

findxbot

Rule	Path
Disallow	/

Rule

Path

Disallow

getintent

Rule	Path
Disallow	/

Rule

Path

Disallow

grammarly

Rule	Path
Disallow	/

Rule

Path

Disallow

guardcrwlr

Rule	Path
Disallow	/

Rule

Path

Disallow

hubspot links crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

istellabot

Rule	Path
Disallow	/

Rule

Path

Disallow

kocmohabt

Rule	Path
Disallow	/

Rule

Path

Disallow

linkscrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

linkspammer

Rule	Path
Disallow	/

Rule

Path

Disallow

linkwalker

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

mail.ru_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

nettrack

Rule	Path
Disallow	/

Rule

Path

Disallow

okhttp

Rule	Path
Disallow	/

Rule

Path

Disallow

openlinkprofiler

Rule	Path
Disallow	/

Rule

Path

Disallow

paperlibot

Rule	Path
Disallow	/

Rule

Path

Disallow

pingdom

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

pu_in

Rule	Path
Disallow	/

Rule

Path

Disallow

pulsepoint

Rule	Path
Disallow	/

Rule

Path

Disallow

qwantify

Rule	Path
Disallow	/

Rule

Path

Disallow

rankvalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

salesintelligent

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

screaming frog seo spider

Rule	Path
Disallow	/

Rule

Path

Disallow

searchie

Rule

Path

Disallow

semrushbot

Rule

Path

Disallow

semrushbot-sa

Rule

Path

Disallow

seokicks-robot

Rule

Path

Disallow

socialrankiobot

Rule

Path

Disallow

spbot

Rule

Path

Disallow

surveybot

Rule

Path

Disallow

uptimebot

Rule

Path

Disallow

vegi bot

Rule

Path

Disallow

vericitecrawler

Rule

Path

Disallow

webmeup-crawler

Rule

Path

Disallow

weborama-fetcher

Rule

Path

Disallow

Other Records

Field

Value

sitemap

http://www.wika.cn/sitemap_www_wika_cn.xml

Comments

Reference to sitemap XML
Disallow LivePerson Chat Test application
Disallow General CMS structure not needed for crawlers
Disallow News entry of microsite "HLM"
Disallow special crawler making us problems
Disallow other crawlers made us CPU load
Ezooms and dotbot
User-agent: link checker
Disallow: /
User-agent: linkcheck
Disallow: /
User-agent: Link Sleuth
Disallow: /

Warnings

2 invalid lines.

wika.cnrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

webcopier

webcopier v4.0

webcopier+v4.0

microsoft+office+protocol+discovery

infometrics-bot, http://www.infometrics.de

ahrefsbot

betabot

blexbot

boardreader

bpimagewalker

checkmarknetwork

clockwork data vault

coccoc

crazywebcrawler

domain re-animator

domainstatsbot

dotbot

dotbot

easouspider

ezooms

findxbot

getintent

grammarly

guardcrwlr

hubspot links crawler

istellabot

kocmohabt

linkscrawler

linkdexbot

linkspammer

linkwalker

ltx71

mail.ru_bot

mj12bot

nettrack

okhttp

openlinkprofiler

paperlibot

pingdom

proximic

pu_in

pulsepoint

qwantify

rankvalbot

salesintelligent

scrapy

screaming frog seo spider

searchie

semrushbot

semrushbot-sa

seokicks-robot

socialrankiobot

spbot

surveybot

uptimebot

vegi bot

vericitecrawler

webmeup-crawler

weborama-fetcher

Other Records

Comments

Warnings

wika.cn
robots.txt