Showing results 1 to 3 of 3
Title | Author(s) | Issue Date | |
---|---|---|---|
Closing the generalization gap of adaptive gradient methods in training deep neural networks Proceeding/Conference:Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence | 2020 | ||
Gradient descent optimizes over-parameterized deep ReLU networks Journal:Machine Learning | 2020 | ||
On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization Journal:Transactions on Machine Learning Research | 16-Mar-2024 |