Ensemble Classification for Gene Expression Data based on Parallel Clustering(12)

来源 :第二届中国计算机学会生物信息学会议 | 被引量 : 0次 | 上传用户:aerostock
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Analysis of large-scale gene expression data is a research hotspot in the field of bioinformatics,which can be used to diagnose the disease of human and animal,and to study the abnormal phenomenon in plant growth process.This paper proposes a biological knowledge integration method based on parallel clustering to select gene subsets effectively.Gene ontology is utilized to obtain the biological function similarity,and combine it with gene expression data.Parallelized affinity propagation algorithm is used to cluster fusion data since it can not only obtain more biologically meaningful subsets,but also avoid the loss of some potential value in genes from simple gene primary selection.Based on clustering result,neighborhood rough set is used to select representative genes which are used to train classifier for each cluster.
其他文献
Essential proteins are regarded as the crucial components of organisms,and thus identifying essential proteins is a hot and significant topic in biomedical research.A great deal of computational metho
会议
Protein-protein interactions(PPIs)are of vital importance to most biological processes.Plenty of PPIs have been identified by wet-lab experiments in the past decades,but there are still abundant uncov
会议
Though Lamarckian genetic algorithm has demonstrated excellent performance in terms of protein-ligand docking problems,it can not memorize the evaluated solutions that it has accessed,rendering it eff
会议
As one of the most important kinds of hormones,abscisic acid(ABA)regulates crucial physiologically developmental processes and water stress responses.The insight of ABA signaling pathway would be of g
会议
The identification of protein complexes is significant to understand the mechanisms of cellular processes.Up to now,many methods have been developed to identify protein complexes in static PPI network
会议
蛋白质功能预测问题本质上是一个多标签分类问题,但庞大的功能标签数量使得各种多标签分类器在蛋白质功能预测中的应用面临巨大挑战。本文针对蛋白质功能标签数量庞大且标签关联性较高的特点,提出了一种基于布尔矩阵分解的蛋白质功能预测框架(PFP-BMD)。
会议
Schizophrenia(SCZ)is a complex neuropsychiatric disorder that seriously affects the daily life of patients.Therefore,the accurate diagnosis of SCZ including its subtypes(e.g.,deficit SCZ(DSCZ)and nond
会议
In the last few years,there has been a rapid development in various bioinformatics technologies,which has led to the accumulation of a large amount of biomedical data.The biomedical data can be analyz
会议
Most proteins perform their biological functions while interacting as complexes.The detection of protein complexes is an important task not only for understanding the relationship between functions an
会议
Intelligent optimization algorithms have advantages in dealing with complex nonlinear problems accompanied by good flexibility and adaptability.In this paper,the FCBF(Fast Correlation-Based Feature se
会议