A novel boosting approach to actuarial ratemaking

Lee, Chun King; 李春景

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Statistics & Actuarial Science: Theses

postgraduate thesis: A novel boosting approach to actuarial ratemaking

Title	A novel boosting approach to actuarial ratemaking
Authors	Lee, Chun King 李春景
Issue Date	2021
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Lee, C. K. [李春景]. (2021). A novel boosting approach to actuarial ratemaking. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Successes are seen in applying advanced machine learning techniques to solve supervised learning problems in many industries. This poses a contrast to actuarial practice as most of the ratemaking exercises are still based on generalized linear modeling and averaging. This phenomenon is mainly driven by the technical challenges to adopt machine learning engines into an IT-friendly environment, non-transparent model output, and lack of regulatory acceptance. Contrarily, as the insurance industry is competitive in many markets, accurate pricing of risks is of utmost importance for insurance companies to stay relevant. This thesis offers a novel and comprehensive approach for actuaries to utilize boosting in ratemaking which addresses accuracy, transparency, robustness, and consistency. Those characteristics are crucial for actuaries to fulfill the statement of principles in ratemaking which requires a rating plan to be reasonable and not excessive, inadequate, or unfairly discriminatory. The contributions of this thesis are three-fold. The first part of the thesis compares traditional generalized linear models to various machine learning alternatives. It confirms that a significant improvement in accuracy is observed by using machine learning techniques. This gives a strong motivation for actuaries to utilize the more complex modeling methods. A novel boosting mechanism called delta boosting, a boosting technique with actuarial focus is then introduced. Delta boosting is proven to be the optimal boosting algorithm for most common loss functions. Poisson regression is known to underestimate the tail risk of heavy tail data commonly observed in claims data. In pursuit of finding more relevant distributions to model the claiming behavior, actuaries leverage negative binomial, zero-inflated Poisson, generalized Poisson regressions and find promising improvement. Part two of this thesis introduces a novel delta boosting implementation of negative binomial and zero-inflated Poisson. This part establishes the boosting formulae and introduces two approaches to solve for multiple parameters during the estimation process. In particular, the formula for zero-inflated Poisson regression is established to offer actuaries an intuitive view to segregate the perfect state (no propensity to claim) and non-perfect state. When compared with some popular mechanisms to handle imbalanced data, the resulting algorithm is one-shot modeling that does not require post-modeling recalibration nor judgemental distance metrics. This part of the thesis also demonstrates the flaws of common partial exposures handling and suggests that proper handling of exposures can improve the prediction accuracy. Boosting and random forest are both members of the ensembling family of algorithms. Boosting addresses the bias reduction whereas random forest reduces the models’ variance. The third part of the thesis proposes a novel algorithm called boosting forest that combines the features of both powerful machine learning techniques and proves the merits. The random forest component allows trees with more layers to extract complex interactions in the data and averages out the noises whereas the boosting component effectively brings the model to the local minimum by reducing biases.
Degree	Doctor of Philosophy
Subject	Boosting (Algorithms) Actuarial science
Dept/Program	Statistics and Actuarial Science
Persistent Identifier	http://hdl.handle.net/10722/325804

DC Field	Value	Language
dc.contributor.author	Lee, Chun King	-
dc.contributor.author	李春景	-
dc.date.accessioned	2023-03-02T16:32:58Z	-
dc.date.available	2023-03-02T16:32:58Z	-
dc.date.issued	2021	-
dc.identifier.citation	Lee, C. K. [李春景]. (2021). A novel boosting approach to actuarial ratemaking. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/325804	-
dc.description.abstract	Successes are seen in applying advanced machine learning techniques to solve supervised learning problems in many industries. This poses a contrast to actuarial practice as most of the ratemaking exercises are still based on generalized linear modeling and averaging. This phenomenon is mainly driven by the technical challenges to adopt machine learning engines into an IT-friendly environment, non-transparent model output, and lack of regulatory acceptance. Contrarily, as the insurance industry is competitive in many markets, accurate pricing of risks is of utmost importance for insurance companies to stay relevant. This thesis offers a novel and comprehensive approach for actuaries to utilize boosting in ratemaking which addresses accuracy, transparency, robustness, and consistency. Those characteristics are crucial for actuaries to fulfill the statement of principles in ratemaking which requires a rating plan to be reasonable and not excessive, inadequate, or unfairly discriminatory. The contributions of this thesis are three-fold. The first part of the thesis compares traditional generalized linear models to various machine learning alternatives. It confirms that a significant improvement in accuracy is observed by using machine learning techniques. This gives a strong motivation for actuaries to utilize the more complex modeling methods. A novel boosting mechanism called delta boosting, a boosting technique with actuarial focus is then introduced. Delta boosting is proven to be the optimal boosting algorithm for most common loss functions. Poisson regression is known to underestimate the tail risk of heavy tail data commonly observed in claims data. In pursuit of finding more relevant distributions to model the claiming behavior, actuaries leverage negative binomial, zero-inflated Poisson, generalized Poisson regressions and find promising improvement. Part two of this thesis introduces a novel delta boosting implementation of negative binomial and zero-inflated Poisson. This part establishes the boosting formulae and introduces two approaches to solve for multiple parameters during the estimation process. In particular, the formula for zero-inflated Poisson regression is established to offer actuaries an intuitive view to segregate the perfect state (no propensity to claim) and non-perfect state. When compared with some popular mechanisms to handle imbalanced data, the resulting algorithm is one-shot modeling that does not require post-modeling recalibration nor judgemental distance metrics. This part of the thesis also demonstrates the flaws of common partial exposures handling and suggests that proper handling of exposures can improve the prediction accuracy. Boosting and random forest are both members of the ensembling family of algorithms. Boosting addresses the bias reduction whereas random forest reduces the models’ variance. The third part of the thesis proposes a novel algorithm called boosting forest that combines the features of both powerful machine learning techniques and proves the merits. The random forest component allows trees with more layers to extract complex interactions in the data and averages out the noises whereas the boosting component effectively brings the model to the local minimum by reducing biases.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Boosting (Algorithms)	-
dc.subject.lcsh	Actuarial science	-
dc.title	A novel boosting approach to actuarial ratemaking	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Statistics and Actuarial Science	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2022	-
dc.identifier.mmsid	991044649996703414	-

File Download

Supplementary

postgraduate thesis: A novel boosting approach to actuarial ratemaking

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats