File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Error Analysis of Three-Layer Neural Network Trained With PGD for Deep Ritz Method

TitleError Analysis of Three-Layer Neural Network Trained With PGD for Deep Ritz Method
Authors
Keywordsconvergence rate
deep Ritz method
Neural network
over-parametrization
projected gradient descent
Issue Date2025
Citation
IEEE Transactions on Information Theory, 2025, v. 71, n. 7, p. 5512-5538 How to Cite?
AbstractMachine learning is a rapidly advancing field with diverse applications across various domains. One prominent area of research is the utilization of deep learning techniques for solving partial differential equations (PDEs). In this work, we specifically focus on employing a three-layer tanh neural network within the framework of the deep Ritz method (DRM) to solve second-order elliptic equations with three different types of boundary conditions. We perform projected gradient descent (PDG) to train the three-layer network and we establish its global convergence. To the best of our knowledge, we are the first to provide a comprehensive error analysis of using overparameterized networks to solve PDE problems, as our analysis simultaneously includes estimates for approximation error, generalization error, and optimization error. We present error bound in terms of the sample size n and our work provides guidance on how to set the network depth, width, step size, and number of iterations for the projected gradient descent algorithm. Importantly, our assumptions in this work are classical and we do not require any additional assumptions on the solution of the equation. This ensures the broad applicability and generality of our results.
Persistent Identifierhttp://hdl.handle.net/10722/363031
ISSN
2023 Impact Factor: 2.2
2023 SCImago Journal Rankings: 1.607

 

DC FieldValueLanguage
dc.contributor.authorJiao, Yuling-
dc.contributor.authorLai, Yanming-
dc.contributor.authorWang, Yang-
dc.date.accessioned2025-10-10T07:44:09Z-
dc.date.available2025-10-10T07:44:09Z-
dc.date.issued2025-
dc.identifier.citationIEEE Transactions on Information Theory, 2025, v. 71, n. 7, p. 5512-5538-
dc.identifier.issn0018-9448-
dc.identifier.urihttp://hdl.handle.net/10722/363031-
dc.description.abstractMachine learning is a rapidly advancing field with diverse applications across various domains. One prominent area of research is the utilization of deep learning techniques for solving partial differential equations (PDEs). In this work, we specifically focus on employing a three-layer tanh neural network within the framework of the deep Ritz method (DRM) to solve second-order elliptic equations with three different types of boundary conditions. We perform projected gradient descent (PDG) to train the three-layer network and we establish its global convergence. To the best of our knowledge, we are the first to provide a comprehensive error analysis of using overparameterized networks to solve PDE problems, as our analysis simultaneously includes estimates for approximation error, generalization error, and optimization error. We present error bound in terms of the sample size n and our work provides guidance on how to set the network depth, width, step size, and number of iterations for the projected gradient descent algorithm. Importantly, our assumptions in this work are classical and we do not require any additional assumptions on the solution of the equation. This ensures the broad applicability and generality of our results.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Information Theory-
dc.subjectconvergence rate-
dc.subjectdeep Ritz method-
dc.subjectNeural network-
dc.subjectover-parametrization-
dc.subjectprojected gradient descent-
dc.titleError Analysis of Three-Layer Neural Network Trained With PGD for Deep Ritz Method-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TIT.2025.3570730-
dc.identifier.scopuseid_2-s2.0-105005285117-
dc.identifier.volume71-
dc.identifier.issue7-
dc.identifier.spage5512-
dc.identifier.epage5538-
dc.identifier.eissn1557-9654-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats