Cross-project software defect prediction based on multi-source data sets

来源 :中国邮电高校学报(英文版) | 被引量 : 0次 | 上传用户:alwbgs
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Cross-project defect prediction (CPDP) uses one or more source projects to build a defect prediction model and applies the model to the target project.There is usually a big difference between the data distribution of the source project and the target project,which makes it difficult to construct an effective defect prediction model.In order to alleviate the problem of negative migration between the source project and the target project in CPDP,this paper proposes an integrated transfer adaptive boosting (TrAdaBoost) algorithm based on multi-source data sets(MSITrA).The algorithm uses an existing two-stage data filtering algorithm to obtain source project data related to the target project from multiple source items,and then uses the integrated TrAdaBoost algorithm proposed in the paper to build a CPDP model.The experimental results of Promise\'s 15 public data sets show that:1) The cross-project software defect prediction model proposed in this paper has better performance in all tested CPDP methods;2) In the within-project software defect prediction (WPDP) experiment,the proposed CPDP method has achieved the better experimental results than the tested WPDP method.
其他文献
At present,there is an urgent need for blockchain interoperability technology to realize interconnection between various blockchains,data communication and value transfer between blockchains,so as to break the\'value silo\'phenomenon of each blockchai
With the rapid development of vehicle-based applications,entertainment videos have gained popularity for passengers on public vehicles.Therefore,how to provide high quality video service for passengers in typical public transportation scenarios is an esse
Robust minimum class variance twin support vector machine (RMCV-TWSVM) presented previously gets better classification performance than the classical TWSVM.The RMCV-TWSVM introduces the class variance matrix of positive and negative samples into the const
Predicting user states in future and rendering visual feedbacks accordingly can effectively reduce the visual experienced delay in the tactile Internet (TI).However,most works omit the fact that different parts in an image may have distinct prediction req
Threshold proxy re-encryption(PRE)authorizes the data access right of data subject to multiple proxies,who authorize the right again to delegatee to accomplish the end-to-end data encryption process from storage to authorization.Based on threshold PRE alg
Aiming at the sensor faults of near-space hypersonic vehicles (NSHV),a fault identification method based on the extended state observer and kernel extreme learning machine (ESO-KELM) is proposed in this paper.The method is generated by a combination of th
Delegated proof-of-stake(DPOS)consensus mechanism is widely adopted in blockchain platforms,but problems exist in its current applications.In order to explore the security risks in the voting attack of the DPOS consensus mechanism,an extensive game model
Data island and information opacity are two major problems in collaborative administration.Blockchain has the potential to provide a trustable and transparent environment encouraging data sharing among administration members.However,the blockchain only st
从大豆油精炼生产工艺出发,考察γ-生育酚、磷、金属离子等影响大豆油回色的微量成分在精炼过程中的含量变化及其与辅料添加量、脱臭条件对储存期成品大豆油回色的影响.结果 表明:γ-生育酚在脱臭工段损失最大,建议脱臭温度在250℃以下,而辅料和汽提蒸汽压力的微调对其影响不大;水化磷脂基本在脱胶工段可以彻底脱除,酸炼脱胶主要脱除非水化磷脂,建议将磷含量控制的关键环节设置在脱胶工段,以降低脱色工段白土吸附除磷的压力和生产成本;金属离子基本可以在正常的脱酸、脱色工段利用皂脚和脱色剂吸附脱除;γ-生育酚损失率、脱臭油的磷
Existing algorithms of news recommendations lack in depth analysis of news texts and timeliness.To address these issues,an algorithm for news recommendations based on time factor and word embedding(TFWE)was proposed to improve the interpretability and pre