论文部分内容阅读
利用10种组蛋白修饰和1种组蛋白变体的ChIP-seq数据,统计了K562细胞系和GM12878细胞系中11种组蛋白修饰在癌基因启动子区域的分布.比较了两个细胞系中致癌基因和抑癌基因启动子区域的组蛋白修饰分布密度,并计算了11种组蛋白修饰与基因表达的相关性,分别在致癌基因和抑癌基因中构建了组蛋白修饰和基因表达的多元线性回归方程,从而发现K562细胞系中癌症发生过程中组蛋白修饰的调控特征.最后对K562细胞系中超级增强子调控的致癌基因和抑癌基因进行GO功能分析,研究这些基因的生物学功能,进一步找出K562细胞系中癌症发生的5个关键基因GATA1、GATA2、H3F3B、LYL1和SH_2B3.
The distribution of 11 histone modifications in the oncogene promoter region of K562 cell line and GM12878 cell line was calculated using ChIP-seq data of 10 histone modifications and 1 histone variant. Oncogenes and tumor suppressor gene promoter region of the histone modification distribution density, and calculated 11 kinds of histone modifications and gene expression correlation, respectively, in the oncogene and tumor suppressor genes were constructed histone modification and gene expression of multiple Linear regression equation to find out the regulatory features of histone modification during cancer development in K562 cell line.Finally, the GO function analysis of oncogenes and tumor suppressor genes regulated by super enhancer in K562 cell line was carried out to study the biological function of these genes , To further identify the five key genes GATA1, GATA2, H3F3B, LYL1 and SH_2B3 in cancer cell line K562.