wika.cn
robots.txt

Robots Exclusion Standard data for wika.cn

Resource Scan

Scan Details

Site Domain wika.cn
Base Domain wika.cn
Scan Status Ok
Last Scan2024-07-01T07:25:21+00:00
Next Scan 2024-07-15T07:25:21+00:00

Last Scan

Scanned2024-07-01T07:25:21+00:00
URL https://wika.cn/robots.txt
Redirect https://www.wika.cn/robots.txt
Redirect Domain www.wika.cn
Redirect Base wika.cn
Domain IPs 81.20.83.9
Redirect IPs 163.181.81.215
Response IP 163.181.201.222
Found Yes
Hash 1351f5e0d0024c621e2df82a43b91bc1b251e4bd0ab83aafbb7862fe5d05dd18
SimHash 039d896ec670

Groups

*

Rule Path
Allow /
Disallow /LivePersonTest*
Disallow /defaultMultiSites.aspx
Disallow /ScriptResource.axd
Disallow /WebResource.axd
Disallow /docs/
Disallow /publish/
Disallow /ClickTaleCache.ashx
Disallow /apps/
Disallow /ProductDetailHandler
Disallow /webshop/
Disallow /pro_1.asp
Disallow /pro_detail.asp
Disallow /newscontentgeneric.WIKA?AxID=462
Disallow /newscontentgeneric.WIKA?AxID=467
Disallow /newscontentgeneric.WIKA?AxID=469
Disallow /newscontentgeneric.WIKA?AxID=473
Disallow /newscontentgeneric.WIKA?AxID=474
Disallow /newscontentgeneric.WIKA?AxID=475
Disallow /newscontentgeneric.WIKA?AxID=470
Disallow /newscontentgeneric.WIKA?AxID=471
Disallow /newscontentgeneric.WIKA?AxID=472
Disallow /newscontentgeneric.WIKA?AxID=468
Disallow /newscontentgeneric.WIKA?AxID=452
Disallow /newscontentgeneric.WIKA?AxID=457
Disallow /newscontentgeneric.WIKA?AxID=458
Disallow /newscontentgeneric.WIKA?AxID=459
Disallow /newscontentgeneric.WIKA?AxID=460
Disallow /newscontentgeneric.WIKA?AxID=461
Disallow /newscontentgeneric.WIKA?AxID=462
Disallow /newscontentgeneric.WIKA?AxID=463
Disallow /newscontentgeneric.WIKA?AxID=466
Disallow /newscontentgeneric.WIKA?AxID=453
Disallow /newscontentgeneric.WIKA?AxID=454

webcopier

Rule Path
Disallow /

webcopier v4.0

Rule Path
Disallow /

webcopier+v4.0

Rule Path
Disallow /

microsoft+office+protocol+discovery

Rule Path
Disallow /

infometrics-bot, http://www.infometrics.de

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

betabot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

bpimagewalker

Rule Path
Disallow /

checkmarknetwork

Rule Path
Disallow /

clockwork data vault

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

crazywebcrawler

Rule Path
Disallow /

domain re-animator

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

findxbot

Rule Path
Disallow /

getintent

Rule Path
Disallow /

grammarly

Rule Path
Disallow /

guardcrwlr

Rule Path
Disallow /

hubspot links crawler

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

kocmohabt

Rule Path
Disallow /

linkscrawler

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

linkspammer

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

nettrack

Rule Path
Disallow /

okhttp

Rule Path
Disallow /

openlinkprofiler

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

pingdom

Rule Path
Disallow /

proximic

Rule Path
Disallow /

pu_in

Rule Path
Disallow /

pulsepoint

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

rankvalbot

Rule Path
Disallow /

salesintelligent

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

searchie

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

socialrankiobot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

vericitecrawler

Rule Path
Disallow /

webmeup-crawler

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.wika.cn/sitemap_www_wika_cn.xml

Comments

  • Reference to sitemap XML
  • Disallow LivePerson Chat Test application
  • Disallow General CMS structure not needed for crawlers
  • Disallow News entry of microsite "HLM"
  • Disallow special crawler making us problems
  • Disallow other crawlers made us CPU load
  • Ezooms and dotbot
  • User-agent: link checker
  • Disallow: /
  • User-agent: linkcheck
  • Disallow: /
  • User-agent: Link Sleuth
  • Disallow: /

Warnings

  • 2 invalid lines.