分类分析中基于信息论准则的特征选取

来源 :自动化学报 | 被引量 : 0次 | 上传用户:baijiw
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Feature selection aims to reduce the dimensionality of patts for classificatory analysis by selecting the most informative instead of irrelevant and/or redundant features. In this study, two novel information-theoretic measures for feature ranking are presented: one is an improved formula to estimate the conditional mutual information between the candidate feature fi and the target class C given the subset of selected features S, i. e., I(C; fi|S), under the assumption that information of features is distributed uniformly; the other is a mutual information (MI) based constructive criterion that is able to capture both irrelevant and redundant input features under arbitrary distributions of information of features. With these two measures, two new feature selection algorithms,called the quadratic MI-based feature selection (QMIFS) approach and the MI-based constructive criterion (MICC) approach,respectively, are proposed, in which no parameters like β in Battitis MIFS and (Kwak and Choi)s MIFS-U methods need to be preset. Thus, the intractable problem of how to choose an appropriate value for β to do the tradeoff between the relevance to the target classes and the redundancy with the already-selected features is avoided completely. Experimental results demonstrate the good performances of QMIFS and MICC on both synthetic and benchmark data sets.
其他文献
提出了一个利用耦合双原子同时与大失谐的双光子Jaynes-Cummings模相互作用实现量子信息转移的方案.通过控制原子与腔场的相互作用时间及量子位的旋转操作角,可以实现原子与
西藏吉如斑岩铜矿位于冈底斯斑岩铜矿带的中段.锆石SHRIMP U-Pb和辉钼矿Re-Os年代学研究表明,与成矿相关的黑云母二长花岗岩成岩年龄为48.68±0.49Ma,成矿事件发生在48.30~50.
基于微观sdIBM-2方案和实验单粒子能量值,在最普遍的哈密顿量下,用两组不同的核子.核子等效相互作用参数,分别很好地再现了102Ru核的振动带能谱和转动带能谱及其演化过程.微
使用气相沉积SiO2和普通光刻以及湿法腐蚀方法,在c面蓝宝石上开出不同尺寸的正方形窗口,在窗口区域中露出衬底,然后使用氢化物气相外延(HVPE)方法选区外延GaN薄膜.采用光学显
建立了电磁驱动平面飞片的一维磁流体力学模型,考虑了焦耳加热的影响,并对Sandia实验室Z装置上开展的一个实验进行了模拟计算,与实验结果的比较表明,计算给出的样品自由面速
采用分子静力学结合量子修正Sutten-Chen型多体势研究了Ni单晶体在受单向拉伸和压缩载荷作用下的弹性响应.考虑了三种加载方式,即[001],[011]和[111]单向加载.模拟的结果表明
采用强度调制光电流谱(IMPS)和强度调制光电压谱(IMVS)研究电池内部电子传输机理和电子背反应动力学特性.利用理论表达式对不同TiO2多孔膜厚度(d)的电池实验数据进行了拟合,
理论上分析并从实验上验证了一种利用均匀相位掩模板写入啁啾光纤光栅的方法:将光纤弯曲,由于光纤离掩模板的距离不同从而使光纤光栅的周期轴向渐变,由此产生啁啾。分析了这
以1-苯基-3-甲基-5-苯氧基-吡唑-4-甲醛和1-苯基-3-甲基-5-对甲苯氧基-吡唑-4-甲醛为原料,在碱性条件下与苯乙酮(取代苯乙酮)发生羟醛缩合,合成出10种新型含吡唑基的查尔酮;
采用李代数方法研究线性三原子分子在强红外激光场中的多光子激发及其控制,实现了态选择激发,并讨论了激光脉冲对控制的影响. Using the Lie algebra method, we study the