Pan-genome: a new sight for bacterial genome dynamics

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:xiatiandegushi1989
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The reducing cost of DNA sequencing and the increasing number of bacterial genome sequences have provided a chance to study the profile and evolution of microbial genome from population sight.Pan-genome, the genetic repertoire of a given species, comprises three parts including core genome, dispensable genome and strains-specific genome.Core genome represents those genes shared by all strains, and these genes are mainly responsible for basic process of life and species major traits.Dispensable genome includes those genes shared by two or more, but not all strains, and strains-specific genome are genes shared by only one strains.In general, dispensable genes and strain-specific genes function in contribute to the species diversity and selective advantages, such as adaptation to different niche and antibiotic resistance or colonization of a new host, and sometimes strains-specific genes are related to HGT.Since the pan-genome was present in Streptococcus agalactiae genome analysis for the first time, pan-genome analysis was employed in more than forty-three bacterial species or genus.Currently, the key issue for pangenome analysis is how to select appropriate mathematical models to depict the bacterial pan-genome feature and detect homologs (gene or genome region) from multiple genome sequences.Many algorithms and pipelines were designed to identify orthologs genes and homologs regions from multiple species, such as inparanoid/multiparanoid, Mauve, PGAP and so on.Species pan-genome profile was indicated from the relationship between the strains number and new detected genes, strains specific genes and the whole gene pool size under various mathematical models.Another important aspect is to combine specific problems in biology with pan-genome analysis.During these years, pan-genome analysis has shed a new sight into the dynamic variation of bacterial genes composition during differentiation and adaption to different niche, and facilitated the identification of strain-specific features, such as virulence genes, resistance gene and other genes function in extreme environment and special metabolic pathway.So, pan-genome has rather significant consequences for the way to study bacterial evolution, adaption and population structure, and it would be also helpful for controlling epidemic disease, developing vaccine, and enhancing the effect of industrial microbiology .
其他文献
Background: Antifreeze proteins(AFPs), also known as thermal hysteresis proteins, are ice-binding proteins.AFPs can adsorb to ice crystal surface and inhibit the growth of ice crystals in solution.So
Background: MicroRNAs (miRNAs) are a set of short (19~24nt) non-coding RNAs that play significant roles as posttranscriptional regulators in animals and plants.The ab initio prediction methods show ex
Background: The research of protein thermostability is very important both in understanding the mechanism of protein unfolding and industrial application.There have been many strategies suggested to i
Background: Human complement receptor type 2 (CR2/CD21), a cell surface protein highly expressed on B cells and follicular dendritic cells, is a member of the regulators of complement activation prote
Background: MicroRNAs (miRNAs) are a class of small non-coding RNAs, which negatively regulate protein coding genes at the posttranscriptional level.These tiny regulators have been associated to almos
Background: T-cell acute lymphoblastic leukemia (T-ALL) is an aggressive hematological malignancy, understanding of its gene expression regulation and molecular mechanisms still remain elusive.MicroRN
Background: Profilin is involved in motility and invasion of apicomplexan (protozoan) parasites and is used for invading host cells.In 2005, mouse Toll-like receptor (TLR) 11 was found to initiate an
Background: As a member of the E6AP carboxy terminus(HECT) domain-containing family ofubiquitin E3 ligase, Nedd4 is known to be a unique E3 protein containing the overall structure which is highly con
会议
Background: Since various diseases and therapeutic approaches are correlated with protein subcellular localization, effective medical approaches require delivery of the drug to the appropriate subcell
Background: Machine learning methods are widely used in the field of bioinformatics, for example, to discover important genes for specific disease or phenotype, to classify proteins based on their str