General trends of protein expression and localization reveal the rule of bio-complexity realization

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:jiwei5520
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Background: During evolution, proteins containing newly emerged domains and the increasing proportion of multi-domain proteins in the full Genome-Encoded Proteome (GEP) have substantially contributed to increasing biological complexity.However, it is not known how these two potential structural factors are preferentially utilized at given physiological states, until our most recent study found that multidomain proteins or only older domain-containing proteins are significantly overrepresented in certain-state proteomes (CSPs, i.e., all the proteins expressed at certain physiological states) compared with GEP, which indicates biological complexity under certain conditions i3 more significantly realized by diverse domain organization than by the emergence of new types of domain.To further explore the regular utilization pattern in the view of protein spatial distribution at levels ranging from organelles to organs, we investigated the relationships between protein domain characters and protein subcelluar localization width (SLW) or tissue specificity (TS).Methods: To calculate SLW, we collected high confident subcellular location information in Swiss-Prot, GOA, human protein atlas (HPA), MitoP2, Human Liver Organelle Proteome (HLOP) and related references and references.Domain identification is based on RPS-BLAST against Pfam and SMART (E<0.001).TS is represented by the number of OTCs (organ, tissue or cell types) in which the gene is expressed.Results: The integrated atlas contains subcellular localization information with high confidence of more than 13000 human proteins, including the subcellular locations for different splice isoforms as complete as possible.Based on this atlas, we found that SLW positively correlates with DN value, while it negatively correlates with DA value, that is, multi-domain proteins or only older domain-containing proteins tend to be localized at more subcellular components.In addition, SLW negatively correlate with TS, i.e.proteins with more subcellular locations tend to be expressed widely across OTCs.The above findings are consistent with our previous report that multi-domain proteins or only older domain-containing proteins tend to be expressed widely across OTCs.Conclusions: At given physiological states, biological complexity is more dependent on the diverse domain organization than the new types of domain.The relationships between DN, DA and SLW, TS confirm this conclusion in the view of spatial distribution of proteins,and reveal the consistency of complexity of protein molecular function and the functional requirement at cell or organism levels .
其他文献
  Background: Protein phosphorylation is one of the pervasive and most important protein posttranslational modifications, which regulates the dynamic behavior
  Background: Module (community) structure is a common and important property of many types of networks such as social networks and biological networks.Severa
  Motivation: Genetic and pharmacological perturbations are powerful systems biology tools to study cellular signal transduction pathways.Here, we report a fr
会议
会议
会议
  Background: The ultra intercellular heterogeneity in tumor is one major causes for the failure of cancer therapy, e.g.drug resistance and/or cancer relapse.
会议
会议
  Background: Small insertions and deletions (INDELs) compose of the second largest category of genetic variants (next to single nucleotide polymorphism) in t
  Background: In recent years, secreted proteins have been identified as markers for disease typing and staging or the development of drugs.Computational iden