一种基于局部流形结构的无监督特征学习方法(英文)

来源 :自动化学报 | 被引量 : 0次 | 上传用户:lovefuture888
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Unsupervised feature selection is fundamental in statistical pattern recognition, and has drawn persistent attention in the past several decades. Recently, much work has shown that feature selection can be formulated as nonlinear dimensionality reduction with discrete constraints. This line of research emphasizes utilizing the manifold learning techniques, where feature selection and learning can be studied based on the manifold assumption in data distribution. Many existing feature selection methods such as Laplacian score, SPEC(spectrum decomposition of graph Laplacian), TR(trace ratio) criterion, MSFS(multi-cluster feature selection) and EVSC(eigenvalue sensitive criterion) apply the basic properties of graph Laplacian, and select the optimal feature subsets which best preserve the manifold structure defined on the graph Laplacian. In this paper, we propose a new feature selection perspective from locally linear embedding(LLE), which is another popular manifold learning method. The main difficulty of using LLE for feature selection is that its optimization involves quadratic programming and eigenvalue decomposition, both of which are continuous procedures and different from discrete feature selection. We prove that the LLE objective can be decomposed with respect to data dimensionalities in the subset selection problem, which also facilitates constructing better coordinates from data using the principal component analysis(PCA) technique. Based on these results, we propose a novel unsupervised feature selection algorithm,called locally linear selection(LLS), to select a feature subset representing the underlying data manifold. The local relationship among samples is computed from the LLE formulation, which is then used to estimate the contribution of each individual feature to the underlying manifold structure. These contributions, represented as LLS scores, are ranked and selected as the candidate solution to feature selection. We further develop a locally linear rotation-selection(LLRS) algorithm which extends LLS to identify the optimal coordinate subset from a new space. Experimental results on real-world datasets show that our method can be more effective than Laplacian eigenmap based feature selection methods. Unsupervised feature selection is fundamental in statistical pattern recognition, and has taken drawn persistent attention in the past several decades. Recently, much work has shown that feature selection can be formulated as nonlinear dimensionality reduction with discrete constraints. This line of research emphasizes utilizing the manifold learning Techniques, where feature selection and learning can be studied based on the manifold assumption in data distribution. Many existing feature selection methods such as Laplacian score, SPEC (spectrum decomposition of graph Laplacian), TR (trace ratio) criterion, MSFS feature selection) and EVSC (eigenvalue sensitive criterion) apply the basic properties of graph Laplacian, and select the optimal feature subsets which best preserve the manifold structure on the graph Laplacian. In this paper, we propose a new feature selection perspective from locally linear embedding (LLE), which is another popular manifold learning method main difficulty of using LLE for feature selection is that it simplifies structures quadratic programming and eigenvalue decomposition, both of which are continuous procedures and different from discrete feature selection. We prove that the LLE objective can be decomposed with respect to data dimensionalities in the subset selection based on data using the principal component analysis (PCA) technique. based on these results, we propose a novel unsupervised feature selection algorithm, called locally linear selection (LLS), to select a feature subset on the underlying data manifold. The local relationship among samples is computed from the LLE formulation, which is then used to estimate the contribution of each individual feature to the underlying manifold structure. These contributions, represented as LLS scores, are ranked and selected as the candidate solution to feature selection. We have develop a locally linearrotation-selection (LLRS) algorithm which extends LLS to identify the optimal coordinate subset from a new space. Experimental results on real-world datasets show that our method can be more effective than Laplacian eigenmap based feature selection methods.
其他文献
该论文内容如下;1)综述.2)以芴 酮,肼基二硫代酸甲脂和肼基二硫代酸苄脂为原 料合成了席夫碱配体芴酮缩肼基硫代酸甲脂HL和芴 酮给肼基硫代酸苄脂HL并对它们进行了红外,元
盐在水中的溶解是很普遍的物理化学过程,对这些过程机理,尤其是微观机理的研究,对化学反应,生物化学,大气化学乃至日常生活都具有重要意义。团簇是由几个至几百个原子或分子组成的
该文从豆科植物豆薯(Pachyrrhizus erosus)的种子内,经磷酸缓冲液粗提,SP-Sepharose Fast Flow柱,CM-Sepharose Fast Flow柱和Sephacryl S-200柱层析,得到了一种新的蛋白组分
共轭聚合物(CPs)以其优良的光捕获和光信号放大能力在生物识别、成像及治疗方面得到广泛应用,为生物大分子识别和检测提供了一种高效的均相分析平台。CPs一般通过静电力或疏水
学位
本文通过对荣华二采区10
期刊
该项目主要研究了一步法合成低分子量聚乳酸,以降低聚乳酸生产成本,并将其改性,将改性物与药物共混,评价药物释放情况,探讨控释农药、化肥的可降解高分子基材的合成方法.结果
在社会主义市场经济条件下 ,正确认识市场经济对党员行为的双重影响 ,正确认识处理发展市场经济与党员坚持党性原则、规范党员行为的关系 ,使党员在发展社会主义市场经济过程
邓小平坚持了辩证否定的思想方法,认为社会主义和资本主义是既相互联系又有本质区别的。社会主义可以利用资本主义,克服资本主义的消极影响,同时,社会主义也必将要代替资本主
本论文利用流变学和显微学方法研究了纳米填料对溶聚丁苯橡胶/低异丙烯基含量聚异戊二烯(SSBR/LPI)弹性体共混体系相行为的影响,探讨了剪切场下及剪切停止后,填料的含量和几何结