Comparison of Supervised Clustering Methods for the Analysis of DNA Microarray Expression Data

来源 :中国农业科学(英文版) | 被引量 : 0次 | 上传用户:ericawanghnu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Several typical supervised clustering methods such as Gaussian mixture model-based supervised clustering(GMM),knearest-neighbor(KNN),binary support vector machines(SVMs)and multiclass support vector machines(MC-SVMs)were employed to classify the computer simulation data and two real microarray expression datasets.False positive,false negative,true positive,true negative,clustering accuracy and Matthews correlation coefficient (MCC) were compared among these methods.The results are as follows:(1)In classifying thousands of gene expression data,the performances of two GMM methods have the maximal clustering accuracy and the least overall FP+FN error numbers on the basis of the assumption that the whole set of microarray data are a finite mixture of multivariate Gaussian distributions.Furthermore,when the number of training sample is very small,the clustering accuracy of GMM-II method has superiority over GMMI method.(2)In general,the superior classification performance of the MC-SVMs are more robust and more practical,which are less sensitive to the curse of dimensionality,and not only next to GMM method in clustering accuracy to thousands of gene expression data,but also more robust to a small number of high-dimensional gene expression samples than other techniques.(3)Of the MC-SVMs,OVO and DAGSVM perform better on the large sample sizes,whereas five MC-SVMs methods have very similar performance on moderate sample sizes.In other cases,OVR,WW and CS yield better results when sample sizes are small.So,it is recommended that at least two candidate methods,choosing on the basis of the real data features and experimental conditions,should be performed and compared to obtain better clustering result.
其他文献
采用PVC树脂和酶解木质素制备得到PVC/酶解木质素复合材料,研究酶解木质素及脲醛改性酶解木质素对复合材料性能的影响。结果表明:随着酶解木质素用量的增大,PVC/酶解木质素复
为使正交频分复用(OFDM)系统在总功率和比特数不变条件下,误码率最小化BM(Ber Minimum)的比特和功率联合分配算法帧内平均误码率最小,设计符号内以最小总名义功率为目标的迭
在表面涂装领域,工件表面涂层固化系统采用短波强辐射固化技术可以缩短涂层固化时间,有效提高热能利用效率,减少能量消耗,提高工件表面涂装质量.本文介绍了强辐射加热技术的
抗滑桩的合理桩间距是设计中的关键参数之一。本文通过桩间净距分别为4 d、5d和6 d的粘性土抗滑桩离心模型试验,分析了桩间土拱形态变化及破坏状态。试验结果表明,桩间距较大
采用ZS-AJ10C精炼剂替代六氯乙烷,在铝液中加入0.1%的A1Ti5B晶粒细化剂,并用氩气石墨精炼机进行二次精炼等工艺措施对ZL114A铸造铝合金熔炼工艺进行了改进.结果表明:采用改进
在建筑物纠偏技术研究中,依据Mohr-Coulomb破坏准则和土体非线性理论的研究平台,应用ADINA软件,分析了应力解除法的挖槽深度与地基变形之间的关系和挖槽深度与地基应力分布之
随着人们对健康的需求越来越强烈,电视媒体也应该重视健康节目的播出。但是目前目前各个主要省级电视台播出的健康节目比重仍然很小。因此,呼唤专业健康频道的出现,从而更好
目的 探讨微创保胆取石术治疗胆囊结石的适应证和临床疗效.方法 120例胆囊结石患者行腹腔镜下微创保胆取石术,前60例为随机选择(作为随机组),后60例严格掌控指征(作为治疗组
目的 探讨行肝癌射频消融(RFA)治疗依从性状况和影响因素,及其与社会支持的关系.方法 采用自制治疗依从性问卷和社会支持评定量表(SSRS)对2009年5月至2011年5月拟行RFA治疗的
[目的]研究胆囊良恶性病变组织中P53上凋凋亡调控因子(PUMA)和C-myb表达水平及其临床病理意义.[方法]108例胆囊腺癌、46例癌旁组织、15例腺瘤性息肉和35例慢性胆囊炎手术切除