Cross Project Defect Prediction via Balanced Distribution Adaptation Based Transfer Learning

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:yueyue7373
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Defect prediction assists the rational allocation of testing resources by detecting the potentially defective software modules before releasing products. When a project has no historical labeled defect data, cross project defect prediction (CPDP) is an altative technique for this scenario. CPDP utilizes labeled defect data of an extal project to construct a classification model to predict the module labels of the current project. Transfer leing based CPDP methods are the current mainstream. In general, such methods aim to minimize the distribution differences between the data of the two projects. However, previous methods mainly focus on the marginal distribution difference but ignore the conditional distribution difference, which will lead to unsatisfactory performance. In this work, we use a novel balanced distribution adaptation (BDA) based transfer leing method to narrow this gap. BDA simultaneously considers the two kinds of distribution differences and adaptively assigns different weights to them. To evaluate the effectiveness of BDA for CPDP performance, we conduct experiments on 18 projects from four datasets using six indicators (i.e., F-measure, g-means, Balance, AUC, EARecall, and EAF-measure). Compared with 12 baseline methods, BDA achieves average improvements of 23.8%, 12.5%, 11.5%, 4.7%, 34.2%, and 33.7% in terms of the six indicators respectively over four datasets.
其他文献
目的探讨螺旋CT门静脉成像(SCTP)在肝脏疾病诊断中的应用价值.方法 81例疑有肝脏疾病的患者进行了SCTP检查,观察了肝脏疾病的SCTP表现及肝外门静脉侧枝循环情况;在重建横断面
对乌兰布和沙漠绿洲几种典型结构和配置的农田防护林带进行风速观测表明,其防风效益与林带高度、结构、疏透度等因子有关,稀疏型林带防风效果最佳,疏透型次之,通风型防风效果
基于森林经理学任务上的系统调控性、内容上的综合性及定位上的宏观指导性,阐述了森林经理是景观生态学在森林景观中的实现手段,景观生态学为森林经理工作提供景观水平的理论
护患关系是指护士与患者在医疗护理等活动中自然形成的一种帮助与被帮助的关系,不仅局限于护患之间而是一种多方面、多层次的专业性互动关系,同时还是一种治疗性关系.其中以
应用TDP(Thermal Dissipation Probe)技术对大青山古路板林场的30a生油松人工林树干液流以及不同林分密度下的树木蒸腾耗水规律进行了研究。结果表明:1)在生长季内,树干径向
植物种群的生态位宽度与种群之间的生态位相似性比例及生态位重叠度反映了森林的更新生态特征。采用定量分析方法对子午岭油松林主要种群的更新生态位进行分析研究,结果表明:
期刊
为有效预测预报小麦条锈病,采用跨平台动态网页语言JAVA和数据库管理系统MySQL,以近30年积累资料构建的模型为基础,结合专家经验,建立了陇东、陇南和关中地区小麦条锈病远程