论文部分内容阅读
以社会标签在网络资源聚类中的作用为研究目标,筛选标注资源的社会标签作为特征项,采用K-means聚类算法对文本资源进行聚类,并在小规模测试集上得到较好效果。详细讨论基于社会标签的文本聚类中标签筛选、聚类方法等关键技术的实现过程。通过实验证明:基于社会标签的文本聚类是一种较传统关键词进行聚类更为有效的一种聚类方法,能够提高文本聚类的效果。
In this paper, we use the social tag as a social networking resource in clustering of network resources as the research objective, select the social tagging resource as the feature item, and use K-means clustering algorithm to cluster the textual resources, and get good results on the small-scale test set . This paper discusses in detail the implementation of key technologies such as tag filtering and clustering in text clustering based on social tags. Experiments show that the text tagging based on social tags is a more effective clustering method than the traditional keyword clustering and can improve the effect of text clustering.