File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: NG-SEM: an effective non-Gaussian structural equation modeling framework for gene regulatory network inference from single-cell RNA-seq data

TitleNG-SEM: an effective non-Gaussian structural equation modeling framework for gene regulatory network inference from single-cell RNA-seq data
Authors
Keywordsgene expression
gene regulatory network
single cell
Issue Date1-Nov-2023
PublisherOxford University Press
Citation
Briefings in Bioinformatics, 2023, v. 24, n. 6 How to Cite?
Abstract

Inference of gene regulatory network (GRN) from gene expression profiles has been a central problem in systems biology and bioinformatics in the past decades. The tremendous emergency of single-cell RNA sequencing (scRNA-seq) data brings new opportunities and challenges for GRN inference: the extensive dropouts and complicated noise structure may also degrade the performance of contemporary gene regulatory models. Thus, there is an urgent need to develop more accurate methods for gene regulatory network inference in single-cell data while considering the noise structure at the same time. In this paper, we extend the traditional structural equation modeling (SEM) framework by considering a flexible noise modeling strategy, namely we use the Gaussian mixtures to approximate the complex stochastic nature of a biological system, since the Gaussian mixture framework can be arguably served as a universal approximation for any continuous distributions. The proposed non-Gaussian SEM framework is called NG-SEM, which can be optimized by iteratively performing Expectation-Maximization algorithm and weighted least-squares method. Moreover, the Akaike Information Criteria is adopted to select the number of components of the Gaussian mixture. To probe the accuracy and stability of our proposed method, we design a comprehensive variate of control experiments to systematically investigate the performance of NG-SEM under various conditions, including simulations and real biological data sets. Results on synthetic data demonstrate that this strategy can improve the performance of traditional Gaussian SEM model and results on real biological data sets verify that NG-SEM outperforms other five state-of-the-art methods.


Persistent Identifierhttp://hdl.handle.net/10722/347273
ISSN
2023 Impact Factor: 6.8
2023 SCImago Journal Rankings: 2.143

 

DC FieldValueLanguage
dc.contributor.authorZhao, Jiaying-
dc.contributor.authorWong, Chi Wing-
dc.contributor.authorChing, Wai Ki-
dc.contributor.authorCheng, Xiaoqing-
dc.date.accessioned2024-09-20T00:31:07Z-
dc.date.available2024-09-20T00:31:07Z-
dc.date.issued2023-11-01-
dc.identifier.citationBriefings in Bioinformatics, 2023, v. 24, n. 6-
dc.identifier.issn1467-5463-
dc.identifier.urihttp://hdl.handle.net/10722/347273-
dc.description.abstract<p>Inference of gene regulatory network (GRN) from gene expression profiles has been a central problem in systems biology and bioinformatics in the past decades. The tremendous emergency of single-cell RNA sequencing (scRNA-seq) data brings new opportunities and challenges for GRN inference: the extensive dropouts and complicated noise structure may also degrade the performance of contemporary gene regulatory models. Thus, there is an urgent need to develop more accurate methods for gene regulatory network inference in single-cell data while considering the noise structure at the same time. In this paper, we extend the traditional structural equation modeling (SEM) framework by considering a flexible noise modeling strategy, namely we use the Gaussian mixtures to approximate the complex stochastic nature of a biological system, since the Gaussian mixture framework can be arguably served as a universal approximation for any continuous distributions. The proposed non-Gaussian SEM framework is called NG-SEM, which can be optimized by iteratively performing Expectation-Maximization algorithm and weighted least-squares method. Moreover, the Akaike Information Criteria is adopted to select the number of components of the Gaussian mixture. To probe the accuracy and stability of our proposed method, we design a comprehensive variate of control experiments to systematically investigate the performance of NG-SEM under various conditions, including simulations and real biological data sets. Results on synthetic data demonstrate that this strategy can improve the performance of traditional Gaussian SEM model and results on real biological data sets verify that NG-SEM outperforms other five state-of-the-art methods.</p>-
dc.languageeng-
dc.publisherOxford University Press-
dc.relation.ispartofBriefings in Bioinformatics-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectgene expression-
dc.subjectgene regulatory network-
dc.subjectsingle cell-
dc.titleNG-SEM: an effective non-Gaussian structural equation modeling framework for gene regulatory network inference from single-cell RNA-seq data-
dc.typeArticle-
dc.identifier.doi10.1093/bib/bbad369-
dc.identifier.pmid37864293-
dc.identifier.scopuseid_2-s2.0-85175498643-
dc.identifier.volume24-
dc.identifier.issue6-
dc.identifier.eissn1477-4054-
dc.identifier.issnl1467-5463-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats