A Semi-Random Multiple Decision-Tree Algorithm for Mining Data Streams

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:maomao11111
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Mining with streaming data is a hot topic in data mining. When performing classification on data streams,traditional classification algorithms based on decision trees, such as ID3 and C4.5, have a relatively poor efficiency in both time and space due to the characteristics of streaming data. There are some advantages in time and space when using random decision trees. An incremental algorithm for mining data streams, SRMTDS (Semi-Random Multiple decision Trees for Data Streams), based on random decision trees is proposed in this paper. SRMTDS uses the inequality of Hoeffding bounds to choose the minimum number of split-examples, a heuristic method to compute the information gain for obtaining the split thresholds of numerical attributes, and a Naive Bayes classifier to estimate the class labels of tree leaves. Our extensive experimental study shows that SRMTDS has an improved performance in time, space, accuracy and the anti-noise capability in comparison with VFDTc, a state-of-the-art decision-tree algorithm for classifying data streams.
其他文献
和谐社会应该是人与人、人与社会、人与自然和谐的社会.和谐社会应该有一整套的宪政制度安排.如环境保护制度、顺畅的社会流动制度、合理的利益协调与分配制度、安全的社会保
在分析西部农业要素流动受阻、价值流失、工业扩散效应丧失、生产力水平低下、市场萎缩等现实困境的基础上,提出了二元经济结构下西部农业发展的特殊路径,即通过新的制度安排
相关利益主体分析是解决多方利益和目标平衡的一种决策方法.旅游商业化现象是伴随旅游发展的一种必然现象,对旅游商业化现象引入相关利益主体分析具有重要的理论意义和现实意
在引入主要应用于生产领域的ERP核心思想--供应链思想的基础上,结合高等农业院校的实际,分析了当前高等农业教育资源的现状,并提出了具体的资源管理建议.
空间制约是矿业城市实现可持续发展的关键制约因素之一.文章分析了在特殊历史背景、矿产资源开发、矿业城市生命周期等特殊因素影响下的矿业城市空间布局特征,指出制约转型期
基于理性范式,遵循消费者均衡条件,根据我国面临的局部经济过热背景,推定了房地产投机的实质是需求结构扭曲,而且经济周期影响需求结构的途径是收入和替代效应,据此并借助空
1标准成本的制定和控制所为标准成本,是指依据历史成本资料,通过一定的经济技术分析所预先确定的医疗服务成本水平。标准成本的制定主要是以成本核算项目为基础依据,根
A rapid, simple and sensitive method was demonstrated for the determination of phenolic compounds in water samples by alternating-current oscillopolarographic t
In order to improve the water environment of Songhua River,develop and maintain a healthy water cycle,the article has made theoretical and mathematical analyses
Two multidentate ligands 2,9-di [6-(2"-hydroxyl -3"-methoxyphenyl) -n-2,5-diazahexyl ] -1,10-phenanthroline (LA) and 2,9 -di ( 6-α-phenol-n-2, 5 -diazahexyl) -