File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Analysis of change-point in covariate effects in survival models
Title | Analysis of change-point in covariate effects in survival models |
---|---|
Authors | |
Advisors | Advisor(s):Lam, KF |
Issue Date | 2020 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Lee, C. Y. [李峻賢]. (2020). Analysis of change-point in covariate effects in survival models. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | This thesis focuses on the statistical analysis of survival data where the effect of one continuous explanatory variable on the response function is not linear as in standard generalized linear models. It has wide applications in the field of clinical medicine and epidemiology. The thesis comprises three parts, each devoted to resolving a statistical problem on nonlinear effects encountered in medical applications.
The first part of this thesis focuses on addressing the nuisance nonlinear effect of a continuous explanatory variable. For example, age is known to be an important prognostic factor in many cancer studies. The age effect on the risk is more or less constant and stays at a low level for young patients aged 60 or below, but it increases with age for patients aged above 60. While the primary interest of most cancer studies is the treatment effect, a valid statistical inference procedure, that appropriately adjusts for the nonlinear effect of the nuisance factor age, is warranted. A class of partly linear models is considered to address this problem.
The nuisance nonlinear effect can be estimated nonparametrically using polynomial splines as in most partly linear models. In medicine, the severity of some diseases are often characterized by some proxy measures, like obesity characterized by BMI, cardiovascular diseases characterized by diastolic blood pressure, diabetes mellitus characterized by blood sugar level. The disease status classifications are for preliminary diagnostic purposes based on previous experience that subjects with a measurement exceeding the threshold value is of strong evidence of having the disease. These threshold values are often called change-points in statistical models in a sense that the trends of the effects of measurements are very different before and after the change-points. Unlike the first part, the change-point associated factor is not nuisance while statistical inference on the change-points is the main objective of the study. The second part of the thesis probes into testing for the existence of a change-point in the effect of a continuous surrogate measurement. Three tests are proposed and their performance are studied both theoretically and empirically.
For multi-level disease status classifications, the work above is extended to study the multiple change-points problem. For example, normal fasting blood glucose level for non-diabetics is between the two change-points 70 and 130 mg/dL. Confidence interval estimation method for the change-point locations is not available in the literature. The third part of the thesis aims to fill the research gap by proposing a sequential test for determining the optimal number of change-points in the effect of the surrogate measurement on the risk function. Given the optimal number of change-points, a bootstrap method is proposed for confidence interval estimation of the change-point locations. |
Degree | Doctor of Philosophy |
Subject | Survival analysis (Biometry) |
Dept/Program | Statistics and Actuarial Science |
Persistent Identifier | http://hdl.handle.net/10722/297531 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Lam, KF | - |
dc.contributor.author | Lee, Chun Yin | - |
dc.contributor.author | 李峻賢 | - |
dc.date.accessioned | 2021-03-21T11:38:02Z | - |
dc.date.available | 2021-03-21T11:38:02Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | Lee, C. Y. [李峻賢]. (2020). Analysis of change-point in covariate effects in survival models. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/297531 | - |
dc.description.abstract | This thesis focuses on the statistical analysis of survival data where the effect of one continuous explanatory variable on the response function is not linear as in standard generalized linear models. It has wide applications in the field of clinical medicine and epidemiology. The thesis comprises three parts, each devoted to resolving a statistical problem on nonlinear effects encountered in medical applications. The first part of this thesis focuses on addressing the nuisance nonlinear effect of a continuous explanatory variable. For example, age is known to be an important prognostic factor in many cancer studies. The age effect on the risk is more or less constant and stays at a low level for young patients aged 60 or below, but it increases with age for patients aged above 60. While the primary interest of most cancer studies is the treatment effect, a valid statistical inference procedure, that appropriately adjusts for the nonlinear effect of the nuisance factor age, is warranted. A class of partly linear models is considered to address this problem. The nuisance nonlinear effect can be estimated nonparametrically using polynomial splines as in most partly linear models. In medicine, the severity of some diseases are often characterized by some proxy measures, like obesity characterized by BMI, cardiovascular diseases characterized by diastolic blood pressure, diabetes mellitus characterized by blood sugar level. The disease status classifications are for preliminary diagnostic purposes based on previous experience that subjects with a measurement exceeding the threshold value is of strong evidence of having the disease. These threshold values are often called change-points in statistical models in a sense that the trends of the effects of measurements are very different before and after the change-points. Unlike the first part, the change-point associated factor is not nuisance while statistical inference on the change-points is the main objective of the study. The second part of the thesis probes into testing for the existence of a change-point in the effect of a continuous surrogate measurement. Three tests are proposed and their performance are studied both theoretically and empirically. For multi-level disease status classifications, the work above is extended to study the multiple change-points problem. For example, normal fasting blood glucose level for non-diabetics is between the two change-points 70 and 130 mg/dL. Confidence interval estimation method for the change-point locations is not available in the literature. The third part of the thesis aims to fill the research gap by proposing a sequential test for determining the optimal number of change-points in the effect of the surrogate measurement on the risk function. Given the optimal number of change-points, a bootstrap method is proposed for confidence interval estimation of the change-point locations. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Survival analysis (Biometry) | - |
dc.title | Analysis of change-point in covariate effects in survival models | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Statistics and Actuarial Science | - |
dc.description.nature | published_or_final_version | - |
dc.date.hkucongregation | 2020 | - |
dc.identifier.mmsid | 991044351381003414 | - |