One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning

来源 :信息与电子工程前沿(英文版) | 被引量 : 0次 | 上传用户:xuxuwanju
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Since traditional machine learning methods are sensitive to skewed distribution and do not consider the characteristics in multiclass imbalance problems, the skewed distribution of multiclass data poses a major challenge to machine learning algorithms. To tackle such issues, we propose a new splitting criterion of the decision tree based on the one-against-all-based Hellinger distance (OAHD). Two crucial elements are included in OAHD. First, the one-against-all scheme is integrated into the process of computing the Hellinger distance in OAHD, thereby extending the Hellinger distance decision tree to cope with the multiclass imbalance problem. Second, for the multiclass imbalance problem, the distribution and the number of distinct classes are taken into account, and a modified Gini index is designed. Moreover, we give theoretical proofs for the properties of OAHD, including skew insensitivity and the ability to seek a purer node in the decision tree. Finally, we collect 20 public real-world imbalanced data sets from the Knowledge Extraction based on Evolutionary Learning (KEEL) repository and the University of California, Irvine (UCI) repository. Experimental and statistical results show that OAHD significantly improves the performance compared with the five other well-known decision trees in terms of Precision, F-measure, and multiclass area under the receiver operating characteristic curve (MAUC). Moreover, through statistical analysis, the Friedman and Nemenyi tests are used to prove the advantage of OAHD over the five other decision trees.
其他文献
高职院校劳动教育工作的开展是遵循国家相关教育政策、顺应时代发展趋势的教育举措,对于学生劳动态度的培养以及劳动技能的掌握具有重要意义.但在高职院校劳动教育工作开展过程中,部分高职院校存在教师队伍建设不完善、劳动教育资源不足、劳动教育课程体系不完善等问题,影响了学生劳动知识的掌握.在劳动教育工作开展过程中,高职院校要不断加强教师培训,优化劳动教育课程体系,拓展劳动教育资源的开发以及运用方式,进一步推动我国高职院校学生劳动能力的提高和全面发展.
The optical rotation technique arose in the 1990s. Optical tweezer brought an ideal platform for research on the angular momentum of laser beams. For decades, the optical rotation technique has been widely applied in laboratory optical manipulation and th
Active queue management (AQM) is essential to prevent the degradation of quality of service in TCP/AQM systems with round-trip time (RTT) delay. RTT delays are primarily caused by packet-propagation delays, but they can also be caused by the processing ti
A fundamental task for mobile robots is simultaneous localization and mapping (SLAM). Moreover, long-term robustness is an important property for SLAM. When vehicles or robots steer fast or steer in certain scenarios, such as low-texture environments, lon
A gallium nitride (GaN) power amplifier mono-lithic microwave integrated circuit (MMIC) with a wide band and high efficiency in the microwave fre-quency band is proposed in this study. The power am-plifier MMIC uses a 0.15 μm GaN high electron mo-bility t
期刊
In this paper, an intelligent fractional-order integral sliding mode control (FOISMC) strategy based on an improved cascade observer is proposed. First, an FOISMC strategy is designed to control a permanent magnet synchronous motor. It has good tracking p
Because it is magnet-free and can achieve a high integration level, the switched-capacitor (SC) converter acting as a direct current transformer has many promising applications in modern electronics. However, designing an SC converter with large current c
In constructing a smart court, to provide intelligent assistance for achieving more efficient, fair, and explainable trial proceedings, we propose a full-process intelligent trial system (FITS). In the proposed FITS, we introduce essential tasks for const
To avoid Doppler ambiguity, pulse Doppler radar may operate on a high pulse repetition frequency (PRF). The use of a high PRF can, however, lead to range ambiguity in many cases. At present, the major efficient solution to solve range ambiguity is based o
Deep learning has proven to be an effective mechanism for computer vision tasks, especially for image denoising and burst image denoising. In this paper, we focus on solving the burst image denoising problem and aim to generate a single clean image from a