论文部分内容阅读
The significant advances in microarray and proteomics analyses have resulted in an exponential increase in potential new targets and have promised to shed light on the identification of disease markers and cellular pathways.Here,we build an integrative platform, the encyclopedia of hepatocellular carcinoma genes online, dubbed EHCO (http://bioagent.iis.sinica.edu.tw/EHCO2), to systematically collect, organize and compare the pileup of unsorted HCC-related studies by using natural language processing and softbots.Among the 13 gene set collections, ranging across PubMed, SAGE, microarray, and proteomics data, there are ~4,000 genes; however, more than 65% genes are only included once, suggesting that tremendous efforts need to be exerted to characterize the relationship between HCC and these genes.Of these HCC inventories, protein binding represents the largest proportion (~50%) from Gene Ontology analysis.To further highlight the potential new targets in the inferred network from EHCO, we combine comparative genomics and interactomics approaches to analyze 120 evolutionary conserved and overexpressed genes in HCC.47 out of 120 queries can form a highly interactive network with 18 queries serving as hubs.Of these hubs, we demonstrate the protein interaction networks center on PIN 1.Knockdown PIN1 reduces the growth rate of the HCC cell line.This architectural map may represent the first step toward the attempt to decipher the hepatocarcinogenesis at the systems level.Targeting hubs and/or disruption of the network formation might reveal novel strategy for HCC treatment.