【摘 要】
:
In gene expression profiling studies,including single-cell RNA sequencing (scRNA-seq)analyses,the identification and characterization of co-expressed genes provides critical information on cell identity and function.Gene co-expression clustering in scRNA-
【机 构】
:
Center for Cancer Genomics and Precision Oncology,Wake Forest Baptist Comprehensive Cancer Center,Wa
【出 处】
:
基因组蛋白质组与生物信息学报(英文版)
论文部分内容阅读
In gene expression profiling studies,including single-cell RNA sequencing (scRNA-seq)analyses,the identification and characterization of co-expressed genes provides critical information on cell identity and function.Gene co-expression clustering in scRNA-seq data presents certain challenges.We show that commonly used methods for single-cell data are not capable of identifying co-expressed genes accurately,and produce results that substantially limit biological expectations of co-expressed genes.Herein,we present single-cell Latent-variable Model (scLM),a gene co-clustering algorithm tailored to single-cell data that performs well at detecting gene clusters with significant biologic context.Importantly,scLM can simultaneously cluster multiple single-cell data-sets,i.e.,consensus clustering,enabling users to leverage single-cell data from multiple sources for novel comparative analysis.scLM takes raw count data as input and preserves biological variation without being influenced by batch effects from multiple datasets.Results from both simulation data and experimental data demonstrate that scLM outperforms the existing methods with considerably improved accuracy.To illustrate the biological insights of scLM,we apply it to our in-house and public experimental scRNA-seq datasets.scLM identifies novel functional gene modules and refines cell states,which facilitates mechanism discovery and understanding of complex biosystems such as cancers.A user-friendly R package with all the key features of the scLM method is available at https://github.com/QSong-github/scLM.
其他文献
冠状病毒(Coronavirus)是具有包膜的正单链RNA病毒,基因组大小介于26 000与32 000 nt之间,编码刺突蛋白(S)、包膜蛋白(E)、膜蛋白(M)和核壳蛋白(N)等四种结构蛋白、复制酶(ORF1a/b)与若干辅助蛋白,部分病毒还具有血细胞凝集素酯酶(HE),这些蛋白除维持病毒结构,还有促进感染与抵抗宿主免疫反应等功能,其中刺突蛋白可与宿主细胞表面的受体结合,使病毒包膜和宿主细胞的膜融合以感染细胞.冠状病毒的感染会影响细胞的许多信号转导途径,引发免疫反应,是一类可感染哺乳动物与鸟类的病毒.
楝酰胺类化合物存在于楝属植物中,因其独特的化学结构而具有杀虫、抗炎及抗癌的活性.目前研究发现,楝酰胺类化合物对多种癌症如肺癌、肾癌、胰腺癌、恶性外周神经鞘瘤等都具有独特的细胞凋亡作用,而对正常细胞无毒害作用.楝酰胺类化合物抗肿瘤的机制主要有:抑制癌细胞翻译起始、调控细胞周期、诱导肿瘤细胞凋亡、抑制细胞增殖、降低药物细胞毒性等.因此,楝酰胺及其衍生物作为潜在抗癌药物也有极大的应用前景,成为近年来的研究热点.本综述总结了楝属植物中的次生代谢产物-楝酰胺类化合物的发现过程、结构特征和抗癌活性,重点阐述了其在癌症
The recent advancement of single-cell RNA sequencing (scRNA-seq) technologies facilitates the study of cell lineages in developmental processes and cancer.In this study,we developed a computational method,called redPATH,to reconstruct the pseudo developme
Single-cell mass cytometry (SCMC) combines features of traditional flow cytometry (i.e.,fluorescence-activated cell sorting) with mass spectrometry,making it possible to measure several parameters at the single-cell level for a complex analysis of biologi
Single-cell RNA sequencing (scRNA-seq) is generally used for profiling transcriptome of individual cells.The droplet-based 10X Genomics Chromium (10X) approach and the plate-based Smart-seq2 full-length method are two frequently used scRNA-seq platforms,y
Successful pregnancy in placental mammals substantially depends on the establishment of maternal immune tolerance to the semi-allogenic fetus.Disorders in this process are tightly asso-ciated with adverse pregnancy outcomes including recurrent miscarriage
The rapid advancement of single-cell technologies has shed new light on the complex mechanisms of cellular heterogeneity.However,compared to bulk RNA sequencing (RNA-seq),single-cell RNA-seq (scRNA-seq) suffers from higher noise and lower coverage,which b
One of the major challenges in single-cell data analysis is the determination of cellular developmental trajectories using single-cell data.Although substantial studies have been conducted in recent years,more effective methods are still strongly needed t
Accurate identification of cell types from single-cell RNA sequencing (scRNA-seq) data plays a critical role in a variety of scRNA-seq analysis studies.This task corresponds to solving an unsupervised clustering problem,in which the similarity measurement
Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis.Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification.However,comprehensive evaluations