File Download
Supplementary

postgraduate thesis: A bootstrap procedure for unsupervised classification based on variable and observation resampling

TitleA bootstrap procedure for unsupervised classification based on variable and observation resampling
Authors
Issue Date2017
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Liu, J. [刘俊宏]. (2017). A bootstrap procedure for unsupervised classification based on variable and observation resampling. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractIn this thesis, we first investigate the performance of bagged sparse 2-means clustering using a conventional bagging procedure. We next propose a cross bagging procedure, which bootstraps the variables and observations simultaneously, for sparse unsupervised classification problems. Formulated under a general framework, the procedure admits a variety of plugged-in classifiers, which can be determined case by case. Our procedure comprises two main stages. The main goal in the first stage is to collect class probabilities and variable importance. Another round of modified bagging is implemented in the follow-up stage, aiming to re-classify the original learning set so that the final partition is more accurate than that obtained by a single-round bagging. We illustrate the potential applicability of cross bagging by studying its behavior when sparse 2-means clustering is taken to be the sparse unsupervised classifier and k-nearest-neighbor classifier is taken as the supervised classifier. Several simulations are reported to demonstrate the performances of cross bagged sparse 2-means clustering, which are compared with those of sparse 2-means clustering and its bagged version. The results show that, under the aforementioned setting, cross bagged sparse 2-means clustering approximates Bayes rule the best when n is greater or close to p. However, this conclusion may not be true if n is much smaller than p.
DegreeMaster of Philosophy
SubjectBootstrap (Statistics)
Cluster analysis
Mathematical statistics
Dept/ProgramStatistics and Actuarial Science
Persistent Identifierhttp://hdl.handle.net/10722/239967
HKU Library Item IDb5846396

 

DC FieldValueLanguage
dc.contributor.authorLiu, Junhong-
dc.contributor.author刘俊宏-
dc.date.accessioned2017-04-08T23:13:19Z-
dc.date.available2017-04-08T23:13:19Z-
dc.date.issued2017-
dc.identifier.citationLiu, J. [刘俊宏]. (2017). A bootstrap procedure for unsupervised classification based on variable and observation resampling. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/239967-
dc.description.abstractIn this thesis, we first investigate the performance of bagged sparse 2-means clustering using a conventional bagging procedure. We next propose a cross bagging procedure, which bootstraps the variables and observations simultaneously, for sparse unsupervised classification problems. Formulated under a general framework, the procedure admits a variety of plugged-in classifiers, which can be determined case by case. Our procedure comprises two main stages. The main goal in the first stage is to collect class probabilities and variable importance. Another round of modified bagging is implemented in the follow-up stage, aiming to re-classify the original learning set so that the final partition is more accurate than that obtained by a single-round bagging. We illustrate the potential applicability of cross bagging by studying its behavior when sparse 2-means clustering is taken to be the sparse unsupervised classifier and k-nearest-neighbor classifier is taken as the supervised classifier. Several simulations are reported to demonstrate the performances of cross bagged sparse 2-means clustering, which are compared with those of sparse 2-means clustering and its bagged version. The results show that, under the aforementioned setting, cross bagged sparse 2-means clustering approximates Bayes rule the best when n is greater or close to p. However, this conclusion may not be true if n is much smaller than p.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshBootstrap (Statistics)-
dc.subject.lcshCluster analysis-
dc.subject.lcshMathematical statistics-
dc.titleA bootstrap procedure for unsupervised classification based on variable and observation resampling-
dc.typePG_Thesis-
dc.identifier.hkulb5846396-
dc.description.thesisnameMaster of Philosophy-
dc.description.thesislevelMaster-
dc.description.thesisdisciplineStatistics and Actuarial Science-
dc.description.naturepublished_or_final_version-
dc.identifier.mmsid991022013549703414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats