PSVM: a preference-enhanced SVM model using preference data for classification

来源 :Science China(Information Sciences) | 被引量 : 0次 | 上传用户:leaffan1985
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Classification is an essential task in data mining, machine learning and pattern recognition areas.Conventional classification models focus on distinctive samples from different categories. There are fine-grained differences between data instances within a particular category. These differences form the preference information that is essential for human learning, and, in our view, could also be helpful for classification models. In this paper, we propose a preference-enhanced support vector machine(PSVM), that incorporates preference-pair data as a specific type of supplementary information into SVM. Additionally, we propose a two-layer heuristic sampling method to obtain effective preference-pairs, and an extended sequential minimal optimization(SMO)algorithm to fit PSVM. To evaluate our model, we use the task of knowledge base acceleration-cumulative citation recommendation(KBA-CCR) on the TREC-KBA-2012 dataset and seven other datasets from UCI,Stat Lib and mldata.org. The experimental results show that our proposed PSVM exhibits high performance with official evaluation metrics. Classification is an essential task in data mining, machine learning and pattern recognition areas. Conventional classification models focus on distinctive samples from different categories. There are fine-grained differences between data instances within a particular category. These differences form the preference information that is essential for human learning, and, in our view, could also be helpful for classification models. In this paper, we propose a preference-enhanced support vector machine (PSVM), that incorporates preference-pair data as a specific type of supplementary information into SVM . To, we use the task of knowledge base acceleration-cumulative citation recommendation (KBA-CCR) on the TREC-KBA-2012 dataset and seven other datasets from UCI, Stat Lib and mldata.org. The experiment al results show that our proposed PSVM exhibits high performance with official evaluation metrics.
其他文献
随着社会的不断进步,城市的快速发展,人口的增长和人们生活水平的提高,城市生活废弃物管理已成为主要的都市问题,是城市环境主要污染源之一。随着城市化的快速发展和人民生活水平
矿产资源是经济社会发展不可或缺的生活生产资料,是实施可持续发展战略的物质保障。我国虽已经初步建立了采矿权制度,但随着其制度内部各类问题的不断扩大,使我国矿产资源浪费及
水资源是人类赖以生存与发展的重要资源,是不可替代的资源。随着经济的不断发展,世界各国都面临着水资源短缺和水资源污染,如何保护水资源是当今社会所面临的一个重要课题。在我
有限责任合伙(我国称之为特殊普通合伙)企业,是20世纪90年代初才诞生一种新型企业形态。有限责任合伙最初起源于采取合伙形式执业的专业人士限制自身法律责任的现实需求,其鲜
学位
期刊
一转眼间,从事安监工作已经十多年了.这十余年间,有相当长的一段时间,我也被基层安监人员是不是“背锅侠”这一命题所困扰.甚至在不同时期,根据不同个案,尝试探讨并寻求过这
期刊
资源型城市在发展中普遍面临资源利用率低、环境污染严重和经济发展乏力的困难,对此学术界提出调整产业结构以促使资源型城市转型的研究。本文以西部资源型城市为研究对象,从法
离子膜应用性能受杂质影响明显,盐水中存有的多样性杂质如果未经有效处理,那么离子膜的使用时间得不到保证,使用性能也会相应降低,进而影响企业的经济效益和成产工序.本文首
以有限元数值计算方法 ,依据攀枝花朱家包包陡坡实验段工艺、结构参数 ,对坡度、车辆节数、行车速度、道床厚度、列车制动、防爬桩刚度等因素对爬行力的影响进行了分析比较 ,
<正>基层安监人员成为"背锅侠",是一件令人痛心的事情。2016年,丰城发电厂"11·24"特大责任事故,首当其冲被追究责任的刘某某、郭某某、胡某某,3人的身份都是安全监理,引发热