Showing results 1 to 15 of 15
| Title | Author(s) | Issue Date | |
|---|---|---|---|
A THEORETICAL ANALYSIS ON FEATURE LEARNING IN NEURAL NETWORKS: EMERGENCE FROM INPUTS AND ADVANTAGE OVER FIXED FEATURES Proceeding/Conference:ICLR 2022 - 10th International Conference on Learning Representations | 2022 | ||
Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix Proceeding/Conference:The 13th International Conference on Learning Representations (ICLR) (24/04/2025-28/04/2025, Singapore) | 24-Apr-2025 | ||
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao) | 3-May-2025 | ||
Curse of Attention: A Kernel-Based Perspective for Why Transformers Fail to Generalize on Time Series Forecasting and Beyond Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California) | 24-Mar-2025 | ||
Deep Online Fused Video Stabilization Proceeding/Conference:Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022 | 2022 | ||
Differential Privacy Mechanisms in Neural Tangent Kernel Regression Proceeding/Conference:IEEE/CVF Winter Conference on Applications of Computer Vision 2025 (28/02/2025-04/03/2025, Tucson, Arizona) | 28-Feb-2025 | ||
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability* Proceeding/Conference:First Conference on Language Modeling (07/10/2024-12/10/2024, Philadelphia) | 7-Oct-2024 | ||
Domain Generalization via Nuclear Norm Regularization Proceeding/Conference:Proceedings of Machine Learning Research | 2024 | ||
Fast John Ellipsoid Computation with Differential Privacy Optimization Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California) | 24-Mar-2025 | ||
Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple Inputs Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao) | 3-May-2025 | ||
HSR-Enhanced Sparse Attention Acceleration Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California) | 24-Mar-2025 | ||
Looped ReLU MLPs May Be All You Need as Practical Programmable Computers Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao) | 3-May-2025 | ||
The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California) | 24-Mar-2025 | ||
When and How Does Known Class Help Discover Unknown Ones? Provable Understanding Through Spectral Analysis Proceeding/Conference:Proceedings of Machine Learning Research | 2023 | ||
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?
Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao) | 3-May-2025 |
