论文部分内容阅读
大豆种子含油量高低和油脂合成途径密切相关,油脂合成途径复杂,涉及诸多蛋白和酶,为此对大豆油脂储存蛋白进行生物信息学分析。大豆全基因组数据下载于JGI数据库,生物数据库查询结合Perl程序处理获取大豆油脂储存基因和蛋白,在大豆基因组中确定1 264个与油脂合成相关的基因,其中23个基因与油脂储存有密切的联系。利用Protparam、SOPMA、Prot Comp、Signal P软件对23个基因的蛋白序列、蛋白基本理化性质及二级结构、亚细胞定位、信号肽等进行生物信息学分析。结果表明:23个油脂储存基因不均匀分布在12条染色体上;23个蛋白序列氨基酸数目为165~1 012个;等电点为5.90~10.03;外显子数目为5~16个;二级结构预测显示无规则卷曲和α-螺旋为主要构成成分;蛋白亚细胞定位主要位于内质网、质膜和胞外。用MEGA6软件内置的Clustal W程序对大豆中油脂储存基因的蛋白序列进行比对分析,采用邻接法(neighbor-joining,NJ)构建系统发育树,结果显示大豆油脂储存基因的亲缘关系和进化差异。
Soybean oil content and oil synthesis is closely related to the path of oil synthesis is complex, involving many proteins and enzymes, for soybean oil storage protein bioinformatics analysis. Soybean genome-wide data were downloaded from JGI database, biological database query combined with Perl program to obtain soybean oil storage genes and proteins, and 1 264 genes related to oil synthesis were identified in soybean genome. Among them, 23 genes were closely associated with lipid storage . The bioinformatics analysis of 23 protein sequences, basic physical and chemical properties and secondary structure, subcellular localization, signal peptide and so on were performed by using Protparam, SOPMA, Prot Comp and SignalP software. The results showed that the 23 lipid storage genes were unevenly distributed on 12 chromosomes; the number of amino acids in 23 protein sequences was 165 to 1 012; the isoelectric point was 5.90 to 10.03; the number of exons was 5 to 16; Structural prediction showed that random coil and α-helix were the main components. Protein subcellular localization was mainly located in the endoplasmic reticulum, plasma membrane and extracellular matrix. The protein sequence of soybean oil storage genes was analyzed by Clustal W program built in MEGA6 software. The phylogenetic tree was constructed by neighbor-joining (NJ), which showed the genetic relationship and evolution difference of soybean oil storage genes.