Improvement of K-Means Clustering Algorithm with Better Initial Centers Based on Variance of Dimensi

来源 :2015全国理论计算机科学学术年会 | 被引量 : 0次 | 上传用户:a673897736123
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  In this paper, a novel approach for initializing clustering centers of K-Means algorithm is presented.This method is based on the variance of dimension, which is used as keyword to make a full permutation.The results of the full permutation for the primary and secondary sequence of keyword is divided into k subsets to initialize the clustering centers.Four international datesets are used for testing datasets to test the effectiveness of this algorithm.And this algorithm is examined by numerical simulation.Experiments suggest that the initial clustering centers chosen by the optimization method proposed in this paper are very close to the clustering centers of ultimate convergence after clustering iteration.Compared with the traditional K-Means clustering algorithm, this algorithm increase the rationality of algorithm on the initial clustering center selection and improve the accuracy of clustering results, and the clustering results is more stable as well.
其他文献
  Text detection is the basis of Optical Character Recognition (OCR) and text information retrieval from natural images.The intrinsic variability of text rend
会议
  副本技术广泛应用于云计算及分布式系统中,合理的数据副本放置是降低网络运行成本的重要手段,也是副本技术的核心问题.副本更新是针对网络中数据访问请求的动态变化而进行
会议
  在现代基于虚拟化的数据中心上,虚拟机分配是实现云中资源有效调度的首要考虑.在云系统中,大数据被划分成多个数据存储在数据中心的数据结点上等待虚拟机处理.此时,不仅
会议
  A color image encryption algorithm based on modified RC4 and chaotic maps is proposed in this paper.The classic RC4 algorithm is modified in cryptography, a
会议
  Document databases are becoming popular, but how to present complex document query to obtain useful information from the document remains an important topic
会议
  片上网络作为一种将大量嵌入式内核集成到单个晶圆片上的可行性技术,与传统片上系统相比,更能应对未来需要更大规模集成内核的挑战,从而得到了更广泛的应用。然而,目前大多数
会议
  随着电子商务和金融软件应用日益广泛,提高这类软件系统的可靠性和安全性就显得特别重要。虽然能够提高这类软件可靠性的事务处理技术早在数据库管理系统中普遍使用,近几
会议
  为了降低图像轮廓检测中纹理对检测结果的影响,提出一种基于双尺度高斯核方向导数滤波器的图像轮廓检测算法。结合大小两个尺度高斯核方向导数滤波器构造图像的边缘强度映
  Kidney exchange programs have been established in several countries to organize kidney exchanges between incompatible patient-donor pairs.The core of these
会议
  FDTD算法是电磁场领域使用非常广泛的数值计算方法,该方法具有很好的精度与灵活性,已成为求解各种电磁场问题的有力的工具。半导体技术的快速发展使得CPU的计算性能有了
会议