Comparison and evaluation of network clustering algorithms applied to genetic interaction networks

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:zhm4150175
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The goal of network clustering algorithms is to detect dense clusters in a network, which provides a first step towards the understanding of large scale biological networks.With numerous recent advances in biotechnologies, large-scale genetic interactions are widely available, but there is a limited understanding of which clustering algorithms may be most effective.In order to address this problem, we conducted a systematic study to compare and evaluate six clustering algorithms in analyzing genetic interaction networks, and investigated influencing factors in choosing algorithms.The algorithms considered in this comparison include hierarchical clustering, topological overlap matrix, bi-clustering, Markov clustering, Bayesian discriminant analysis based community detection, and variational Bayes approach to modularity.Both experimentally identified and synthetically constructed networks were used in this comparison.The accuracy of the algorithms is measured by the Jaccard index in comparing predicted gene modules with benchmark gene sets.The results suggest that the choice differs according to the network topology and evaluation criteria.Hierarchical clustering showed to be best at predicting protein complexes ; Bayesian discriminant analysis based community detection proved best under epistatic miniarray profile (EMAP) datasets ; the variational Bayes approach to modularity was noticeably better than the other algorithms in the genome-scale networks .
其他文献
  Background: The ultra intercellular heterogeneity in tumor is one major causes for the failure of cancer therapy, e.g.drug resistance and/or cancer relapse.
会议
会议
  Background: Small insertions and deletions (INDELs) compose of the second largest category of genetic variants (next to single nucleotide polymorphism) in t
  Background: In recent years, secreted proteins have been identified as markers for disease typing and staging or the development of drugs.Computational iden
  Background: During evolution, proteins containing newly emerged domains and the increasing proportion of multi-domain proteins in the full Genome-Encoded Pr
  Background: Pleiotropy refers the genetic mechanism that a gene affects multiple phenotypes, like different types of cancer.In the past, more efforts have b
  Background: Post-translational modification (PTM) is one of the most important biological processes within cellular machines, the de-function of which may l
  Background: Recently, microRNAs (miRNAs) have been reported to play essential roles in the pathogenesis of prostate cancer and shown to have the potential a
  Background: It is widely recognized that the molecular etiology of complex human diseases is very sophisticated, involving a large number of genes, gene-gen