Squeezer: An Efficient Algorithm for Clustering Categorical Data

来源 :计算机科学技术学报 | 被引量 : 0次 | 上传用户:dexiaolu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper presents a new efficient algorithm for clustering categorical data,Squeezer, which can produce high quality clustering results and at the same time deservegood scalability. The Squeezer algorithm reads each tuple t in sequence, either assigning tto an existing cluster (initially none), or creating t as a new cluster, which is determined bythe similarities between t and clusters. Due to its characteristics, the proposed algorithm isextremely suitable for clustering data streams, where given a sequence of points, the objective isto maintain consistently good clustering of the sequence so far, using a small amount of memoryand time. Outliers can also be handled efficiently and directly in Squeezer. Experimental resultson real-life and synthetic datasets verify the superiority of Squeezer.
其他文献
本文是2002年7月在长春召开的第八届全国量子化学学术会议上的大会发言。内容如下:(1)20世纪的化学取得了辉煌的成就,应该获得社会的认同。(2)20世纪发明了七大技术,第一是合成化学
本文通过对荣华二采区10
采用光外差-磁旋转-速度调制吸收光谱技术, 在可见光波段范围16800~17573 cm-1, 对N2+的A 2Πu-X 2Σ+g(12,6)、(11,5)、(7,2)带和B 2Σ+u-X 2Σ+g (1,5)带进行了测量和分析,
Concentration variations of suspended solids (SS), total phosphorus (TP), dissolved total phosphorus (DTP), dissolved reactive phosphorus (SRP), and algae avail
The copolymerization of DL-LA and 3-BMG was carried out in bulk with stannous 2-ethylhexanoate as the catalyst. A series of copolymers with pendant protected gr
We have calculated the orbital parameters for 90 stars in Chen et al. and updated the kinematic data for stars in Edvardsson et al. by using the accurate Hippar
Transesterification reaction of lactose with divinyladipate in pyridine catalyzed by an alkaline protease from Bacillus subtilis at 50(C for 3 days gave 6(-O-vi
The complex [La2(-ox){Cr(bipy)((μ-ox)(ox)}4(H2O)6]·12.3(H2O) (C58H68.6N8O54.3Cr4La)2 1 has been obtained from the reaction of La(Ⅲ) salt with [Cr(bipy)(ox)2]
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
The interaction between poly(methymethacrylate) (PMMA) and poly(vinyl chloride) (PVC) has been studied in dilute urea solutions of dimethylformamide (DMF) at 28