Computational identification of functional phosphorylation events from the phosphoproteomic data

来源 :第七届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:markhai
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Post-translational modifications (PTMs),such as phosphorylation,acetylation and ubiquitination,play essential roles in the regulation of almost all of biological processes and cellular signaling pathways.Recent progresses in phosphoproteomics have identified nearly 500,000 phosphorylation sites.However,how to efficiently retrieve useful information from flood of data is still a great challenge.To integrate both large-scale and small-scale experimental data,we developed a number of databases,such as dbPPT containing 82,175 phosphorylation sites (p-sites) in 31,012 proteins for 20 plant species (http://dbppt.biocuckoo.org/),dbPSP containing 7,391 p-sites in 3,750 prokaryotic proteins (http://dbpsp.biocuckoo.org/),and dbPAF containing 483,001 p-sites of 54,148 proteins for human,animals and fungi (http://dbpaf.biocuckoo.org/).To predict potentially site-specific kinase-substrate relations (ssKSRs) or kinase-specific p-sites,we further improved our algorithm of Group-based Prediction System (GPS),by using over 6,000 known kinase-specific p-sites as the training data set.To generate an accurate classification map of protein kinases,we also collected 1,855 known kinases and 347 known phosphatases from the literature,further identified 50,433 kinases and 11,296 phosphatases in 84 eukaryotic species,and constructed the database of EKPD (http://ekpd.biocuckoo.org/).Using the updated GPS algorithm and EKPD information,GPS 3.0 beta provides both online service and local packages at http://gps.biocuckoo.org/,and can predict ssKSRs for protein kinases of 84 eukaryotes,including 464 human kinases.In addition,to greatly reduce false positive predictions,we integrated protein-protein interactions between kinases and substrates and further developed the in vivo GPS (iGPS) tool for the prediction of ssKSRs.
其他文献
  We present a computational study of transcriptomic data of 6,000+ tissue samples of 14 cancer types,aiming to address the fundamental question: what may dri
会议
  Background: Single-cell RNA sequencing (scRNA-seq) is an emerging technology that enables high resolution detection of heterogeneities between cells.One imp
  Background: Over the past decade,owing to rapid advances in the next-generation sequencing (NGS) technology,the cost of DNA sequencing has been reduced by o
  Nematodes are considered major pests in agriculture.In recent years,overdose usage of chemical pesticides causes environmental damages and becomes a thread
  The Encyclopedia of DNA Elements (ENCODE) Consortium has generated hundreds of high throughput genomic datasets with the goal of cataloging functional eleme
  Gene fusions are often driving events in human cancers.The simultaneous availability of whole genome and RNA sequencing data for a large panel of cancer sam
  The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor,grey wo
  Gastric cancer (GC) is one of the most common malignancies and the one of the greatest threats to human health,accounting for over 10% of tumour-related mor
  Mitochondria are the important organelles that affect tumor initiation,growth,survival,and metastasis.The mitochondrial proteostasis was largely maintained
  DNA replication begins at replication origins in all three domains of life.In bacteria,replication typically initiates from a single replication origin (ori