Comparison of dimension reduction-based logistic regression models for case-control genome-wide asso

来源 :The Journal of Biomedical Research | 被引量 : 0次 | 上传用户:jiangcongzhi
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
With recent advances in biotechnology, genome-wide association study(GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression(LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression(PC-LR), partial least squares-based logistic regression(PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor?mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism(SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis.On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data. With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single- locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR ), partial least squares-based logistic regression (PLS-LR), have recently much attention in the analysis of high dimensional genomic data. However, the perfor? mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reaso In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR , especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.
其他文献
通过实验对三维电极中粒子电极进行了研究 .结果表明 ,三维电极中粒子电极的用量、粒子电极的粒径大小、粒子电极导电性等对三维电极降解废水的COD效率有很大的影响 .此研究
2003年,我们的产品进入了家乐福、沃尔玛这些大卖场,4月份我们开始生产采诗面膜,当月就成为国内面膜销量的第一位,可以说采诗的成功有目共睹。但是当年10月23号晚上的突发事
患儿伟伟,男,3岁半其母代诉:患儿长期便秘,每日必须使用“开塞露”方能排便。小儿足月顺产、饮食、睡眠、小便、发育、体格检查均无异常。按小儿推拿常规方法治疗5次无效。停诊一周
1991年,侯耀华开始从事直流电源开关柜的开发、研 制工作。实践出真知,在这个广阔的领域,一次次的风雨冲刷,使侯耀华由一只展翅欲飞的雏鹰成长为历经风雨的雄鹰。1998年底,上
建设创业文化,是日益引起人们关注的重大课题。创业文化的薄弱,是制约东北三省经济发展的重要因素。建设创业文化,推动全民创业,对于促进东北老工业基地的振兴与发展,具有重
针对双层客车司机控制台工效学布局问题,采用层次聚类分析中组间平均联接法对控制台60个显控器件以重要程度、使用频率为计算变量进行分类且输出凝聚顺序表及分类谱系图体现
用动态疲劳试验法研究了3Y-TZP和3Y-TZP/Al_2O_3(20wt%)陶瓷在空气中的室温动态疲劳,并讨论了疲劳慢裂纹扩展特性。另外利用动态疲劳数据对两种陶瓷的平均寿命进行了预测。两种陶瓷材料在室温下均存在着
该文在分析了目前使用的旋转机械故障的模糊诊断和诊断专家系统存在不足的基础上,提出了一种故障模糊诊断的层次结构模型。该模型是运用领域专家的浅知识,根据各种故障征兆反映
温度、湿度、光照及储存时间等会对药品储存产生一定影响,本文从影响药品储存的因素入手,分析了药品储存过程中应注意的问题,并提出解决方法。 Temperature, humidity, ligh
2007年7月,国家食品药品监督管理局公布了新的《药品注册管理办法》,该办法将于2007年10月1日正式施行。众所周知,新的《药品注册管 In July 2007, the State Food and Drug