GMDH-Based Outlier Detection Model in Classification Problems

来源 :系统科学与复杂性学报(英文版) | 被引量 : 0次 | 上传用户:169
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In many practical classification problems,datasets would have a portion of outliers,which could greatly affect the performance of the constructed models.In order to address this issue,we apply the group method of data handing (GMDH) neural network in outlier detection.This study builds a GMDH-based outlier detection (GOD) model.This model first implements feature selection in the training set L using GMDH neural network.Then a new training set L’can be obtained by mapping the selected key feature subset.Next,a linear regression model can be constructed in the set L’by ordinary least squares estimation.Further,it eliminates a sample from the set L’randomly every time,and then rebuilds a linear regression model.Finally,outlier detection is realized by calculating Cook’s distance for each sample.Four different customer classification datasets are used to conduct experiments.Results show that GOD model can effectively eliminate outliers,and compared with the five existing outlier detection models,it generally performs significantly better.This indicates that eliminating outliers can effectively enhance classification accuracy of the trained classification model.
其他文献
在国外,房地产与股票、黄金一起,被列入三大投资热门。其中,房地产投资是一种比较稳健的理财方式。但是,房市犹如股市,云谲波诡,变幻莫测,稍有不慎,便会吃亏。因此,初涉房市者,要想稳操
本文简要介绍了消防员热应激问题的现状、热应激的评价指标、综合热应激指数在评价空军飞行员等人员热应激的应用,同时展开了综合热应激指数用以评价消防员热应激的讨论.本文
This paper presents a new algorithm for computing the topology of an algebraic space curve.Based on an efficient weak generic position-checking method and a met
认识孙雁是在她的公司,上海市延安西路的一幢写字楼的15楼。记者刚走进办公区就看到这样一幕场景:女孩子们嬉笑着,追着那名叫旺财的明星猪,旺财在办公桌间穿梭,碰桌子绊绳子
In computer algebra,it remains to be challenging to establish general computational the ories for determining the equivalence of indexed polynomials.In previous
本文阐述了电弧焊弧光所造成的危害种类及其严重程度,并围绕自动变光焊接面罩的基本结构、工作原理、性能指标、防护效果等,重点介绍了防治电弧焊弧光危害的这种专门利器.
分析产学研合作网络的成因有助于促进产学研合作,提高创新产出的效率.文章以2018年浙江省高校、研究机构和企业的专利数据构建产学研合作网络,使用指数随机图模型对整个网络
传统光学显微镜的分辨率受到波长和物镜数值孔径的限制,难以再有很大的提高.近年来,逐渐发展成熟的鬼成像技术为进一步提高显微成像系统的分辨率带来了新的方法,它通过对探测
春天的夜空,空气十分清爽,小雨过后顶破泥土的嫩芽就在星光的照耀下悄悄生长,第二天天亮后行走的路人因它而迷惑,怎么一夜之间都变得这么绿了? 月亮悄然飘落,离开了夜晚的城
食用变性淀粉是普通淀粉经化学或酶处理而成,原有性状,如水溶性、粘度及流变性等均发生变化,更适应食品工业不同工艺需要。变性淀粉不仅具有营养价值,可提供热能,还可作食品