Gene association study with SVM, MLP and cross-validation for the diagnosis of diseases

来源 :自然科学进展(英文版) | 被引量 : 0次 | 上传用户:yellowuncle
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Gene association study is one of the major challenges of biochip technology both for gene diagnosis where only a gene subset is responsible for some diseases, and for the treatment of the curse of dimensionality which occurs especially in DNA microarray datasets where there are more than thousands of genes and only a few number of experiments (samples). This paper presents a gene selection method by training linear support vector machine (SVM)/nonlinear MLP (multilayer perceptron) classifiers and testing them with cross-validation for finding a gene subset which is optimal/suboptimal for the diagnosis of binary/multiple disease types. Genes are selected with linear SVM classifier for the diagnosis of each binary disease types pair and tested by leave-one-out cross-validation; then, genes in the gene subset initialized by the union of them are deleted one by one by removing the gene which brings the greatest decrease of the generalization power, for samples, on the gene subset after removal, where generalization is measured by training MLPs with leave-one-out and leave-four-out cross-validations. The proposed method was tested with experiments on real DNA microarray MIT data and NCI data. The result shows that it outperforms conventional SNR method in the separability of the data with expression levels on selected genes. For real DNA microarray MIT/NCI data, which is composed of 7129/2308 effective genes with only 72/64 labeled samples belonging to 2/4 disease classes, only 11/6 genes are selected to be diagnostic genes. The selected genes are tested by the classification of samples on these genes with SVM/MLP with leave-one-out/both leave-one-out and leave-four-out cross-validations. The result of no misclassification indicates that the selected genes can be really considered as diagnostic genes for the diagnosis of the corresponding diseases.
其他文献
The role of interleukin 25 (IL-25) in a number of human diseases still has not been extensively studied, here we attempt to evaluate the role of recombinant IL-
再生纸是以废纸做原料,将其打碎、去色制浆后再通过高科技手段,经过多种复杂工序加工生产出来的纸张.本文介绍了废纸造纸废水的水质水量,提出了经济可行的治理措施.
In previous research, chimerical BPI23-Fcγ1 gene which consisted of human bactericidal/permeability increasing protein (BPI) gene of encoding the functional N
This study investigated the changes of CD4+CD25+ regulatory T cells (Tregs) in peripheral blood of patients with hepatocellular carcinoma before and after trans
In this study, a recurrent massive phyllodes tumor of the breast was surgically removed and the grafting was used to repair the local skin defects. A 29-y femal
To investigate the role of platelet membrane glycoprotein (GP) Ib/Ⅸ/Ⅴ complex and its subunit GP Ibа in patients with hemorrhagic thrombopathy (HT), the expr
Killer immunoglobulin-like receptor (KIR) genes can regulate the activation of NK and T cells upon interaction with HLA class I molecules. Hepatitis B virus (HB
B cell activating factor belonging to TNF superfamily (BAFF) is a critical regulator of B cell maturation and survival. In this present study, the expression ch
The remodeling process of synapses and eurotransmitter receptors of facial nucleus were observed. Models were set up by facial-facial anastomosis in rat. At pos
近几年,保护性耕作在推广部门和技术人员的辛勤努力下得到的快速发展,取得了较好的效果.三门峡市在该技术推广中也产生了良好的社会、经济、生态效益.特就保护性耕作的全国层