Browsing by Author Shi, Zhenmei

Jump to: 0-9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Showing results 1 to 15 of 15
TitleAuthor(s)Issue Date
A THEORETICAL ANALYSIS ON FEATURE LEARNING IN NEURAL NETWORKS: EMERGENCE FROM INPUTS AND ADVANTAGE OVER FIXED FEATURES
Proceeding/Conference:ICLR 2022 - 10th International Conference on Learning Representations
2022
 
Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix
Proceeding/Conference:The 13th International Conference on Learning Representations (ICLR) (24/04/2025-28/04/2025, Singapore)
24-Apr-2025
 
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao)
3-May-2025
 
Curse of Attention: A Kernel-Based Perspective for Why Transformers Fail to Generalize on Time Series Forecasting and Beyond
Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California)
24-Mar-2025
Deep Online Fused Video Stabilization
Proceeding/Conference:Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022
2022
 
Differential Privacy Mechanisms in Neural Tangent Kernel Regression
Proceeding/Conference:IEEE/CVF Winter Conference on Applications of Computer Vision 2025 (28/02/2025-04/03/2025, Tucson, Arizona)
28-Feb-2025
 
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability*
Proceeding/Conference:First Conference on Language Modeling (07/10/2024-12/10/2024, Philadelphia)
7-Oct-2024
Domain Generalization via Nuclear Norm Regularization
Proceeding/Conference:Proceedings of Machine Learning Research
2024
 
Fast John Ellipsoid Computation with Differential Privacy Optimization
Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California)
24-Mar-2025
 
Fourier Circuits in Neural Networks and Transformers: A Case Study of Modular Arithmetic with Multiple Inputs
Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao)
3-May-2025
 
HSR-Enhanced Sparse Attention Acceleration
Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California)
24-Mar-2025
 
Looped ReLU MLPs May Be All You Need as Practical Programmable Computers
Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao)
3-May-2025
 
The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity
Proceeding/Conference:Conference on Parsimony and Learning 2025 (24/03/2025-27/03/2025, Stanford University, California)
24-Mar-2025
2023
 
When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?
Proceeding/Conference:The 28th International Conference on Artificial Intelligence and Statistics. (03/05/2025-05/05/2025, Mai Khao)
3-May-2025