File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1038/s41467-022-34595-w
- Scopus: eid_2-s2.0-85142375367
- WOS: WOS:000888056100011
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Leveraging data-driven self-consistency for high-fidelity gene expression recovery
Title | Leveraging data-driven self-consistency for high-fidelity gene expression recovery |
---|---|
Authors | |
Issue Date | 21-Nov-2022 |
Publisher | Nature Research |
Citation | Nature Communications, 2022, v. 13, n. 1 How to Cite? |
Abstract | Single cell RNA sequencing is a promising technique to determine the states of individual cells and classify novel cell subtypes. In current sequence data analysis, however, genes with low expressions are omitted, which leads to inaccurate gene counts and hinders downstream analysis. Recovering these omitted expression values presents a challenge because of the large size of the data. Here, we introduce a data-driven gene expression recovery framework, referred to as self-consistent expression recovery machine (SERM), to impute the missing expressions. Using a neural network, the technique first learns the underlying data distribution from a subset of the noisy data. It then recovers the overall expression data by imposing a self-consistency on the expression matrix, thus ensuring that the expression levels are similarly distributed in different parts of the matrix. We show that SERM improves the accuracy of gene imputation with orders of magnitude enhancement in computational efficiency in comparison to the state-of-the-art imputation techniques. Recovering dropout-affected gene expression values is a challenging problem in bioinformatics. Here, the authors propose a data-driven framework, that first learns the underlying data distribution and then recovers the expression values by imposing a self-consistency on the expression matrix. |
Persistent Identifier | http://hdl.handle.net/10722/331868 |
ISSN | 2023 Impact Factor: 14.7 2023 SCImago Journal Rankings: 4.887 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Islam, MT | - |
dc.contributor.author | Wang, J | - |
dc.contributor.author | Ren, H | - |
dc.contributor.author | Li, X | - |
dc.contributor.author | Khuzani, MB | - |
dc.contributor.author | Sang, S | - |
dc.contributor.author | Yu, L | - |
dc.contributor.author | Shen, L | - |
dc.contributor.author | Zhao, W | - |
dc.contributor.author | Xing, L | - |
dc.date.accessioned | 2023-09-28T04:59:14Z | - |
dc.date.available | 2023-09-28T04:59:14Z | - |
dc.date.issued | 2022-11-21 | - |
dc.identifier.citation | Nature Communications, 2022, v. 13, n. 1 | - |
dc.identifier.issn | 2041-1723 | - |
dc.identifier.uri | http://hdl.handle.net/10722/331868 | - |
dc.description.abstract | <p>Single cell RNA sequencing is a promising technique to determine the states of individual cells and classify novel cell subtypes. In current sequence data analysis, however, genes with low expressions are omitted, which leads to inaccurate gene counts and hinders downstream analysis. Recovering these omitted expression values presents a challenge because of the large size of the data. Here, we introduce a data-driven gene expression recovery framework, referred to as self-consistent expression recovery machine (SERM), to impute the missing expressions. Using a neural network, the technique first learns the underlying data distribution from a subset of the noisy data. It then recovers the overall expression data by imposing a self-consistency on the expression matrix, thus ensuring that the expression levels are similarly distributed in different parts of the matrix. We show that SERM improves the accuracy of gene imputation with orders of magnitude enhancement in computational efficiency in comparison to the state-of-the-art imputation techniques.</p><p>Recovering dropout-affected gene expression values is a challenging problem in bioinformatics. Here, the authors propose a data-driven framework, that first learns the underlying data distribution and then recovers the expression values by imposing a self-consistency on the expression matrix.</p> | - |
dc.language | eng | - |
dc.publisher | Nature Research | - |
dc.relation.ispartof | Nature Communications | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.title | Leveraging data-driven self-consistency for high-fidelity gene expression recovery | - |
dc.type | Article | - |
dc.identifier.doi | 10.1038/s41467-022-34595-w | - |
dc.identifier.scopus | eid_2-s2.0-85142375367 | - |
dc.identifier.volume | 13 | - |
dc.identifier.issue | 1 | - |
dc.identifier.eissn | 2041-1723 | - |
dc.identifier.isi | WOS:000888056100011 | - |
dc.identifier.issnl | 2041-1723 | - |