Correction for Population Stratification in Random Forest Analysis

来源 :第六届全国生物信息学与系统生物学学术大会暨国际生物信息学前沿研讨会 | 被引量 : 0次 | 上传用户:die0410
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Background: Population structure (PS), including population stratification and admixture, is a significant confounder in genome-wide association studies (GWAS) as it may produce spurious associations.Random forest (RF) has been increasingly applied in GWAS data analysis because of its advantage in analyzing high dimensional genetic data.RF creates importance measures for SNPs which are helpful for feature selections.However, if population structure is not appropriately corrected, RF tends to give high importance to disease-unrelated SNPs with different frequencies of allele or genotype among subpopulations, leading to inaccurate results.
其他文献
Introduction: Preservation of important microorganisms as seed stocks is necessary for industrial and scientific applications.Freeze drying is a favorite technique for long term storage of microorgani
会议
The metagenomics is a field that directly analyzes species identification, community structure, ecological function and metabolism of microbes in their native environment.In general, 454pyrosequencing
Microbial information can collect a huge amount of data related to specialized studies as:morphology, physiology, taxonomy, growth, description, biosafety, etc.The design and implementation of a datab
Background: As an important epigenetic marker, context-specific DNA methylation plays a critical role in regulating gene transcription, thereby involved in many biological processes.However, previousl
Background: Currently in vogue, the main antitumor therapy is an antiangiogenic therapy, that widely thought could lead to deprivation of oxygen and nutrients in fast growing tumor cells.However, by d
Background: Over 100 genome-scale metabolic networks(GSMNs) have been published in recent years.Even for the same organism various metabolic models have been reconstructed by different research groups
Background: Transcription factors (TFs) play key roles in gene expression regulation, so do the transcription cofactors and chromatin remodeling factors.Identifying them are primary and crucial steps
Background: The identification and analysis of tissue specificity of genes and gene expressions have a direct and profound impact on further understanding of a wide array of problems of much significa
Background: Identification of differential expressed genes can expose disease disordered pathways.Currently, Most calculated methods only distinguish researches based on different disease types.Howeve
Inferring gene regulatory networks based on gene expression profiles is an import topic in computational systems biology.Though dozens methods has been proposed in the last decade, current algorithms