Improvement and Parallelism of k-Means Clustering Algorithm

来源 :清华大学学报自然科学版(英文版) | 被引量 : 0次 | 上传用户:zilianyy
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The k-means clustering algorithm is one of the most commonly used algorithms for clustering analysis. The traditional k-means algorithm is, however, inefficient while working on large numbers of data sets and improving the algorithm efficiency remains a problem. This paper focuses on the efficiency issues of cluster algorithms. A refined initial cluster centers method is designed to reduce the number of iterative procedures in the algorithm. A parallel k-means algorithm is also studied for the problem of the operation limitation of a single processor machine when given huge data sets. The analytical results demonstrate that these improvements can greatly enhance the efficiency of the k-means algorithm, i.e., allow the grouping of a large number of data sets more accurately and more quickly. The analysis has theoretical and practical importance for work on the improvement and parallelism of cluster algorithms.
其他文献
The preparation of calcium phosphate (CP) coating on alumina ceramics using electric pulse stimulating method has been investigated. The cup-shaped alumina cera
提出了一种基于连续小波变换的时频域滤波方法,用于频响函数估计前的信号预处理.采用Morlet小波构造一种FIR滤波器对信号滤波,不会引起相位失真.提出了一种改进的小波基以满
The power supply system of ion source for the Neutral Beam Injector (NBI) in the HT-7 superconducting tokamak is based on a single injector with one ion source
An idea is presented about the development of a data processing and analysis system for ICF experiments, which is based on an object oriented framework. The des
The cascade algorithm plays an important role in computer graphics and wavelet analysis. For any initial function φ0, a cascade sequence (φn)∞n=1 is construc
In this paper, the adsorption isotherms of two disperse dyes, C.I. Disperse Red 60 and C.I. Disperse orange 76,on two kinds of PU fibers at 90℃ were measured r
本文通过对荣华二采区10
Let E2+pp (resp.E2+p2) be a (2+p)-dimensional pseudo-Euclidean space with the index p (resp.2). The maximal spacelike or minimal timelike translation surfaces M
It is theoretically shown that excitonic Doppler-Rabi oscillations can occur in an organic slab moving along the axis of a high-Q cavity. Due to the √N enhance
Let G be a simple connected graph of order n ≥ 6. The third edge-connectivity of G is defined as the minimum cardinality over all the sets of edges, if any, wh