File Download
Supplementary

postgraduate thesis: Nonconvex penalised regression and post-selection least squares under high dimensions : a local asymptotic perspective

TitleNonconvex penalised regression and post-selection least squares under high dimensions : a local asymptotic perspective
Authors
Advisors
Advisor(s):Lee, SMS
Issue Date2023
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Xu, X. [徐小雅]. (2023). Nonconvex penalised regression and post-selection least squares under high dimensions : a local asymptotic perspective. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractIn the realm of high-dimensional linear regression, nonconvex penalised estimators have enjoyed increasing popularity due to their much acclaimed oracle property, which holds under assumptions weaker than those typically required for convex penalised estimators to enjoy the same property. However, validity of such oracle property of nonconvex penalisation and the accompanying inference tools is questionable in the presence of many weak signals and/or a few moderate signals, which may incur substantial biases. To address this issue, the thesis aims at developing theoretically valid, computationally feasible, procedures for estimation and inference about the relationships between strong signals and response variables. We first provide a more holistic assessment of the selection and convergence properties of nonconvex penalised estimators from a local asymptotic perspective, under a framework which accommodates existence of many weak signals or a few moderate signals and, heavy tail conditions on covariates and random errors. We then show that post-selection least squares estimation has the beneficial effect of removing the bias incurred by nonconvex penalisation of moderate signals. Post-selection least squares estimators acquire convergence properties more desirable than nonconvex penalised estimators and, in the case of multiple solutions to the nonconvex optimisation program, are ratewise more robust against the choice of selected sets. In particular, the post-selection least squares method is found to improve on nonconvex penalised estimation, especially under heavy-tailed settings. Then, trustworthy bootstrap inference procedures based on nonconvex penalised estimators and their post-selection OLS estimators are developed to estimate their distribution under this flexible framework. The revised bootstrap method draws the bootstrap samples from the post-selection OLS model, and is shown to be valid under weaker conditions. Owing to its desirable theoretical properties, the residual bootstrap method based on post-selection least squares estimators is accurate generally, and can be as effective as normal approximation, even without assuming strong conditions on signal strength. Empirical results obtained from large-scale simulation studies corroborate our theoretical findings.
DegreeDoctor of Philosophy
SubjectRegression analysis
Least squares
Dept/ProgramStatistics and Actuarial Science
Persistent Identifierhttp://hdl.handle.net/10722/330264

 

DC FieldValueLanguage
dc.contributor.advisorLee, SMS-
dc.contributor.authorXu, Xiaoya-
dc.contributor.author徐小雅-
dc.date.accessioned2023-08-31T09:18:17Z-
dc.date.available2023-08-31T09:18:17Z-
dc.date.issued2023-
dc.identifier.citationXu, X. [徐小雅]. (2023). Nonconvex penalised regression and post-selection least squares under high dimensions : a local asymptotic perspective. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/330264-
dc.description.abstractIn the realm of high-dimensional linear regression, nonconvex penalised estimators have enjoyed increasing popularity due to their much acclaimed oracle property, which holds under assumptions weaker than those typically required for convex penalised estimators to enjoy the same property. However, validity of such oracle property of nonconvex penalisation and the accompanying inference tools is questionable in the presence of many weak signals and/or a few moderate signals, which may incur substantial biases. To address this issue, the thesis aims at developing theoretically valid, computationally feasible, procedures for estimation and inference about the relationships between strong signals and response variables. We first provide a more holistic assessment of the selection and convergence properties of nonconvex penalised estimators from a local asymptotic perspective, under a framework which accommodates existence of many weak signals or a few moderate signals and, heavy tail conditions on covariates and random errors. We then show that post-selection least squares estimation has the beneficial effect of removing the bias incurred by nonconvex penalisation of moderate signals. Post-selection least squares estimators acquire convergence properties more desirable than nonconvex penalised estimators and, in the case of multiple solutions to the nonconvex optimisation program, are ratewise more robust against the choice of selected sets. In particular, the post-selection least squares method is found to improve on nonconvex penalised estimation, especially under heavy-tailed settings. Then, trustworthy bootstrap inference procedures based on nonconvex penalised estimators and their post-selection OLS estimators are developed to estimate their distribution under this flexible framework. The revised bootstrap method draws the bootstrap samples from the post-selection OLS model, and is shown to be valid under weaker conditions. Owing to its desirable theoretical properties, the residual bootstrap method based on post-selection least squares estimators is accurate generally, and can be as effective as normal approximation, even without assuming strong conditions on signal strength. Empirical results obtained from large-scale simulation studies corroborate our theoretical findings.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshRegression analysis-
dc.subject.lcshLeast squares-
dc.titleNonconvex penalised regression and post-selection least squares under high dimensions : a local asymptotic perspective-
dc.typePG_Thesis-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineStatistics and Actuarial Science-
dc.description.naturepublished_or_final_version-
dc.date.hkucongregation2023-
dc.identifier.mmsid991044717471203414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats