File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Supervised machine learning with Delaunay triangulation learner and collaborative gradient boosting
Title | Supervised machine learning with Delaunay triangulation learner and collaborative gradient boosting |
---|---|
Authors | |
Advisors | Advisor(s):Yin, G |
Issue Date | 2021 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Liu, Y. [刘业鸿]. (2021). Supervised machine learning with Delaunay triangulation learner and collaborative gradient boosting. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | In this thesis, we discuss some statistical machine learning methods and their ensemble learning approachs with theories and applications.
Firstly, we propose a new piecewise linear learner, called the Delaunay triangulation learner (DTL). Based on the data samples in a p-dimensional feature space, the Delaunay triangulation algorithm provides a unique triangulation of the space. The triangulation separates the convex hull of the samples into a series of disjoint p-simplices, where the samples are the vertices of the p-simplices. The DTL is constructed by fitting the responses through linear interpolation functions on each of the Delaunay simplices, and thus it approximates the whole functional by a piecewise linear function. In its ensemble learning approaches, we propose the bagging DTLs, random crystal and the boosting DTL, where the DTLs are constructed on the subspaces of the features, and the feature interactions are captured by Delaunay triangle meshes. Extensive numerical studies are conducted to compare the proposed DTL and its ensembles with tree-based counterparts. The DTL methods show competitive performances in various settings, and particularly for smooth functionals the DTL demonstrates its superiority over others.
Secondly, we propose a collaborative gradient boosting (CGB) algorithm, which can leverage the full potential of boosting multiple types of base learners by using the intrinsic regularization. Inspired by the idea of coopetition (i.e., cooperation and competition), we allow the base learners of different types to collaborate with as well as competing against each other in gradient descent steps. In the CGB algorithm, different types of base learners typically learn from each other toward better model accuracy, i.e., poor-performing base learners would approach better ones. Extensive numerical studies show that the CGB dramatically improves the prediction accuracy by utilizing different types of learners. Our method yields a general strategy that performs better on real data from a wide range of domains.
Finally, we introduce a new concept of the average holding price (AHP) in stock market. We show that, under certain assumptions on the investors’ behavior, the average holding price of a stock can be estimated, based on the historical trading prices and volumes. In contrast to the moving average of the stock prices, the AHP can serve as a benchmark for estimating the average profit/loss level of the stock holders. The numerical algorithm for the AHP depends on an recursive equation, which enables us to compute the real-time AHP from any time point of the stock trading. As illustrated in the examples, some trading strategies can also be built upon the AHP. |
Degree | Doctor of Philosophy |
Subject | Machine learning |
Dept/Program | Statistics and Actuarial Science |
Persistent Identifier | http://hdl.handle.net/10722/298879 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Yin, G | - |
dc.contributor.author | Liu, Yehong | - |
dc.contributor.author | 刘业鸿 | - |
dc.date.accessioned | 2021-04-16T11:16:36Z | - |
dc.date.available | 2021-04-16T11:16:36Z | - |
dc.date.issued | 2021 | - |
dc.identifier.citation | Liu, Y. [刘业鸿]. (2021). Supervised machine learning with Delaunay triangulation learner and collaborative gradient boosting. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/298879 | - |
dc.description.abstract | In this thesis, we discuss some statistical machine learning methods and their ensemble learning approachs with theories and applications. Firstly, we propose a new piecewise linear learner, called the Delaunay triangulation learner (DTL). Based on the data samples in a p-dimensional feature space, the Delaunay triangulation algorithm provides a unique triangulation of the space. The triangulation separates the convex hull of the samples into a series of disjoint p-simplices, where the samples are the vertices of the p-simplices. The DTL is constructed by fitting the responses through linear interpolation functions on each of the Delaunay simplices, and thus it approximates the whole functional by a piecewise linear function. In its ensemble learning approaches, we propose the bagging DTLs, random crystal and the boosting DTL, where the DTLs are constructed on the subspaces of the features, and the feature interactions are captured by Delaunay triangle meshes. Extensive numerical studies are conducted to compare the proposed DTL and its ensembles with tree-based counterparts. The DTL methods show competitive performances in various settings, and particularly for smooth functionals the DTL demonstrates its superiority over others. Secondly, we propose a collaborative gradient boosting (CGB) algorithm, which can leverage the full potential of boosting multiple types of base learners by using the intrinsic regularization. Inspired by the idea of coopetition (i.e., cooperation and competition), we allow the base learners of different types to collaborate with as well as competing against each other in gradient descent steps. In the CGB algorithm, different types of base learners typically learn from each other toward better model accuracy, i.e., poor-performing base learners would approach better ones. Extensive numerical studies show that the CGB dramatically improves the prediction accuracy by utilizing different types of learners. Our method yields a general strategy that performs better on real data from a wide range of domains. Finally, we introduce a new concept of the average holding price (AHP) in stock market. We show that, under certain assumptions on the investors’ behavior, the average holding price of a stock can be estimated, based on the historical trading prices and volumes. In contrast to the moving average of the stock prices, the AHP can serve as a benchmark for estimating the average profit/loss level of the stock holders. The numerical algorithm for the AHP depends on an recursive equation, which enables us to compute the real-time AHP from any time point of the stock trading. As illustrated in the examples, some trading strategies can also be built upon the AHP. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Machine learning | - |
dc.title | Supervised machine learning with Delaunay triangulation learner and collaborative gradient boosting | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Statistics and Actuarial Science | - |
dc.description.nature | published_or_final_version | - |
dc.date.hkucongregation | 2021 | - |
dc.identifier.mmsid | 991044360596703414 | - |