File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Subsample Ridge Ensembles: Equivalences and Generalized Cross-Validation

TitleSubsample Ridge Ensembles: Equivalences and Generalized Cross-Validation
Authors
Issue Date2023
Citation
Proceedings of Machine Learning Research, 2023, v. 202, p. 8585-8631 How to Cite?
AbstractWe study subsampling-based ridge ensembles in the proportional asymptotics regime, where the feature size grows proportionally with the sample size such that their ratio converges to a constant. By analyzing the squared prediction risk of ridge ensembles as a function of the explicit penalty λ and the limiting subsample aspect ratio ϕs (the ratio of the feature size to the subsample size), we characterize contours in the (λ, ϕs)-plane at any achievable risk. As a consequence, we prove that the risk of the optimal full ridgeless ensemble (fitted on all possible subsamples) matches that of the optimal ridge predictor. In addition, we prove strong uniform consistency of generalized cross-validation (GCV) over the subsample sizes for estimating the prediction risk of ridge ensembles. This allows for GCV-based tuning of full ridgeless ensembles without sample splitting and yields a predictor whose risk matches optimal ridge risk.
Persistent Identifierhttp://hdl.handle.net/10722/365522

 

DC FieldValueLanguage
dc.contributor.authorDu, Jin Hong-
dc.contributor.authorPatil, Pratik-
dc.contributor.authorKuchibhotla, Arun Kumar-
dc.date.accessioned2025-11-05T09:41:14Z-
dc.date.available2025-11-05T09:41:14Z-
dc.date.issued2023-
dc.identifier.citationProceedings of Machine Learning Research, 2023, v. 202, p. 8585-8631-
dc.identifier.urihttp://hdl.handle.net/10722/365522-
dc.description.abstractWe study subsampling-based ridge ensembles in the proportional asymptotics regime, where the feature size grows proportionally with the sample size such that their ratio converges to a constant. By analyzing the squared prediction risk of ridge ensembles as a function of the explicit penalty λ and the limiting subsample aspect ratio ϕ<inf>s</inf> (the ratio of the feature size to the subsample size), we characterize contours in the (λ, ϕ<inf>s</inf>)-plane at any achievable risk. As a consequence, we prove that the risk of the optimal full ridgeless ensemble (fitted on all possible subsamples) matches that of the optimal ridge predictor. In addition, we prove strong uniform consistency of generalized cross-validation (GCV) over the subsample sizes for estimating the prediction risk of ridge ensembles. This allows for GCV-based tuning of full ridgeless ensembles without sample splitting and yields a predictor whose risk matches optimal ridge risk.-
dc.languageeng-
dc.relation.ispartofProceedings of Machine Learning Research-
dc.titleSubsample Ridge Ensembles: Equivalences and Generalized Cross-Validation-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.scopuseid_2-s2.0-85174386933-
dc.identifier.volume202-
dc.identifier.spage8585-
dc.identifier.epage8631-
dc.identifier.eissn2640-3498-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats