Imputing single-cell RNA-seq data by considering cell heterogeneity and prior expression of dropouts

来源 :分子细胞生物学报(英文版) | 被引量 : 0次 | 上传用户:ffff2155
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to determine expression patterns of thousands of individual cells.However,the analysis of scRNA-seq data remains a computational challenge due to the high technical noise such as the presence of dropout events that lead to a large proportion of zeros for expressed genes.Taking into account the cell heterogene-ity and the relationship between dropout rate and expected expression level,we present a cell sub-population based bounded low-rank (PBLR) method to impute the dropouts of scRNA-seq data.Through application to both simulated and real scRNA-seq datasets,PBLR is shown to be effective in recovering dropout events,and it can dramatically improve the low-dimensional repre-sentation and the recovery of gene-gene relationships masked by dropout events compared to several state-of-the-art methods.Moreover,PBLR also detects accurate and robust cell sub-populations automatically,shedding light on its flexibility and general-ity for scRNA-seq data analysis.
其他文献
Protein modification by small ubiquitin-like modifier(SUMO)is an important regulatory mechanism for multiple cellular pro-cesses.Although the canonical pathway
Tumour vasculature is known to be aberrant,tortuous and erratic which can have significant implications for fluid flow.Fluid dynamics in tumour tissue plays an
SARS-CoV-2 is a kind of 'smart' virus that generates complex and dynamic crosstalk with hosts.Even with thousands of publications,the researchers have only unco
期刊
环渤中地区新构造运动强烈,新近系是主要的含油层系.新近系勘探早期集中于凸起区和陡坡带,而斜坡带和凹陷区由于缺乏油气运移条件的系统性研究导致其勘探成效较差.基于前人研
Targeted double-strand breaks (DSBs)in genomes can be introduced efficiently by endonucleases (Urnov et al.,2010;Jinek et al.,2012;Joung and Sander,2013),includ
期刊
本文研究了一类基于非负实参数的新型Chlodovsky算子,用Ditzian-Totik光滑模与二阶连续模得到了逼近定理,然后研究了该算子对Lipschitz类函数的逼近误差上界,最后得到了该算
采用塔河油田缝洞型油藏稀油和常规稠油两种代表性油样,通过开展向前多次接触相平衡实验模拟蒸发气驱过程、向后多次接触相平衡实验模拟凝析气驱过程,研究了氮气与塔河油田原
储层损害和流体敏感性等概念的提出使油气储层保护成为油气勘探与开发领域的重要研究方向,储层保护技术从系统工程理论出发,研究工程作业与地质对象的适应性问题,为油气田开
页岩气井压后裂缝描述对页岩气井生产动态预测、生产后期重复压裂设计以及加密井井眼轨迹设计等具有重要意义.以涪陵页岩气田开发和地质参数为基础,基于高效离散裂缝网络(EDF
Coronavirus disease 2019(COVID-19)caused by coronavirus SARS-CoV-2 infec-tion has now evolved into a worldwide cri-sis that triggers substantial morbidity and m
期刊