,An accelerated K-means clustering algorithm using selection and erasure rules

来源 :浙江大学学报(英文版)(C辑:计算机与电子) | 被引量 : 0次 | 上传用户:roytseng
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The K-means method is a well-known clustering algorithm with an extensive range of applications,such as biological classification,disease analysis,data mining,and image compression.However,the plain K-means method is not fast when the number of clusters or the number of data points becomes large.A modified K-means algorithm was presented by Fahim et al.(2006).The modified algorithm produced clusters whose mean square error was very similar to that of the plain K-means,but the execution time was shorter.In this study,we try to further increase its speed.There are two rules in our method:a selection rule,used to acquire a good candidate as the initial center to be checked,and an erasure rule,used to delete one or many unqualified centers each time a specified condition is satisfied.Our clustering results are identical to those of Fahim et al.(2006).However,our method further cuts computation time when the number of clusters increases.The mathematical reasoning used in our design is included.
其他文献
棉花是世界重要的经济作物之一,也是主要的自然纤维作物。棉纤维是由棉花胚珠外珠被表皮层的单细胞发育而成,是纺织工业的原料,有重要的经济价值。湘杂棉2号是我国上世纪90年代选育的棉花杂交种,在棉花产量方面具有较高的杂种优势。亲本中12是高产、优质、抗病的棉花品种,荆8891是高产棉花品系。本研究是以本实验室利用以湘杂棉2号构建的重组自交系群体以及永久性F2群体,采用cDNA-AFLP技术构建的叶片转录
该文以华中农业大学玉米研究室合成的广基玉米基础群体WBMC、WLSC、LBMC、LLSC及组建它们的主体亲本群体巫溪14(W)、兰花早(L)、BSSS(B)、Lancaster(Lan)、黑黄九(M)、Suwan2
A new efficient parallel finite-difference time-domain (FDTD) meshing algorithm, based on the ray tracing technique, is proposed in this paper. This algorithm c