File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Nutrition data science for lung cancer prevention and macrotrends
Title | Nutrition data science for lung cancer prevention and macrotrends |
---|---|
Authors | |
Advisors | |
Issue Date | 2022 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Tao, J. [陶鋆]. (2022). Nutrition data science for lung cancer prevention and macrotrends. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | According to the GLOBOCAN estimates, approximately 1.8 million deaths are attributable to lung cancer annually. While smoking is a well-known risk factor for lung cancer, some observational studies have identified associations between nutrition factors and lung cancer risk. However, the results are not always consistent. This thesis uncovers the association between nutrition factors and lung cancer using multiple data sources from the single nutrient, the gene-diet interaction and the macrotrend perspectives.
From the single nutrient perspective, the role of dietary carbohydrates on lung cancer risk was investigated using data from the PLCO Cancer Screening Trial. Dietary carbohydrates and their food sources were categorized into quartiles by quantity and the lowest quartile was set as the reference. Multivariable binary logistic model, multinomial logistic model, and Cox regression model with time-varying coefficients were used to study the associations between dietary carbohydrates and lung cancer risk. The results showed that low-GI carbohydrates from fruits, vegetables and whole grains protected against lung cancer, while refined carbohydrates from soft drinks increased the risk of lung cancer.
The association between dietary sodium and lung cancer risk was explored by multivariable Cox regression model using data from the PLCO Trial and the NIH-AARP cohort. The results indicated an inverse association between moderate sodium intake and lung cancer risk. The association needs further validation before inviting reflection on the sodium restriction policy.
From the gene-diet interaction perspective, current statistical methods include the constrained maximum likelihood estimation, the unconstrained maximum likelihood estimation, and the empirical Bayesian (EB) estimation. The realistic simulation was conducted based on the single nucleotide polymorphisms (SNPs) from the PLCO GWAS data. The operating characteristics of the three statistical methods were tested in the simulated datasets. The results showed that the EB method could trade-off the type I error and statistical power for gene-diet interaction analysis. In the PLCO GWAS dataset, SNP rs7175421 was found to significantly interact with whole grain intake for lung cancer risk by the EB method. Future research can consider our workflow to select the appropriate statistical method for gene-diet interaction studies.
From the macrotrend perspective, the global data of dietary factors, country income, and lung cancer incidence were extracted from the Global Dietary Database, the World Bank DataBank, and the 2019 Global Burden of Disease Study. Linear mixed model was used to study the trends of dietary sodium, fiber and whole grains by country income levels from 1990 to 2018. In addition, the linear trends of dietary sodium and lung cancer incidence were modelled by linear regression for individual countries. The study found increasing trends of dietary sodium, fiber and whole grains in middle-income countries. Moreover, none of the country-level income groups demonstrated strong agreement on trends of dietary sodium and lung cancer incidence.
The dissertation investigates the relationship between dietary factors and lung cancer incidence by harnessing a holistic view of individual diet, gene-diet interaction, and dietary macrotrends at the country level. This work leverages multiple data sources to conduct nutrition data science investigations for lung cancer prevention. |
Degree | Doctor of Philosophy |
Subject | Lungs - Cancer - Nutritional aspects Lungs - Cancer - Prevention Carbohydrates |
Dept/Program | Public Health |
Persistent Identifier | http://hdl.handle.net/10722/335137 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Lam, WWT | - |
dc.contributor.advisor | Pang, HMH | - |
dc.contributor.author | Tao, Jun | - |
dc.contributor.author | 陶鋆 | - |
dc.date.accessioned | 2023-11-13T07:44:51Z | - |
dc.date.available | 2023-11-13T07:44:51Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Tao, J. [陶鋆]. (2022). Nutrition data science for lung cancer prevention and macrotrends. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/335137 | - |
dc.description.abstract | According to the GLOBOCAN estimates, approximately 1.8 million deaths are attributable to lung cancer annually. While smoking is a well-known risk factor for lung cancer, some observational studies have identified associations between nutrition factors and lung cancer risk. However, the results are not always consistent. This thesis uncovers the association between nutrition factors and lung cancer using multiple data sources from the single nutrient, the gene-diet interaction and the macrotrend perspectives. From the single nutrient perspective, the role of dietary carbohydrates on lung cancer risk was investigated using data from the PLCO Cancer Screening Trial. Dietary carbohydrates and their food sources were categorized into quartiles by quantity and the lowest quartile was set as the reference. Multivariable binary logistic model, multinomial logistic model, and Cox regression model with time-varying coefficients were used to study the associations between dietary carbohydrates and lung cancer risk. The results showed that low-GI carbohydrates from fruits, vegetables and whole grains protected against lung cancer, while refined carbohydrates from soft drinks increased the risk of lung cancer. The association between dietary sodium and lung cancer risk was explored by multivariable Cox regression model using data from the PLCO Trial and the NIH-AARP cohort. The results indicated an inverse association between moderate sodium intake and lung cancer risk. The association needs further validation before inviting reflection on the sodium restriction policy. From the gene-diet interaction perspective, current statistical methods include the constrained maximum likelihood estimation, the unconstrained maximum likelihood estimation, and the empirical Bayesian (EB) estimation. The realistic simulation was conducted based on the single nucleotide polymorphisms (SNPs) from the PLCO GWAS data. The operating characteristics of the three statistical methods were tested in the simulated datasets. The results showed that the EB method could trade-off the type I error and statistical power for gene-diet interaction analysis. In the PLCO GWAS dataset, SNP rs7175421 was found to significantly interact with whole grain intake for lung cancer risk by the EB method. Future research can consider our workflow to select the appropriate statistical method for gene-diet interaction studies. From the macrotrend perspective, the global data of dietary factors, country income, and lung cancer incidence were extracted from the Global Dietary Database, the World Bank DataBank, and the 2019 Global Burden of Disease Study. Linear mixed model was used to study the trends of dietary sodium, fiber and whole grains by country income levels from 1990 to 2018. In addition, the linear trends of dietary sodium and lung cancer incidence were modelled by linear regression for individual countries. The study found increasing trends of dietary sodium, fiber and whole grains in middle-income countries. Moreover, none of the country-level income groups demonstrated strong agreement on trends of dietary sodium and lung cancer incidence. The dissertation investigates the relationship between dietary factors and lung cancer incidence by harnessing a holistic view of individual diet, gene-diet interaction, and dietary macrotrends at the country level. This work leverages multiple data sources to conduct nutrition data science investigations for lung cancer prevention. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Lungs - Cancer - Nutritional aspects | - |
dc.subject.lcsh | Lungs - Cancer - Prevention | - |
dc.subject.lcsh | Carbohydrates | - |
dc.title | Nutrition data science for lung cancer prevention and macrotrends | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Public Health | - |
dc.description.nature | published_or_final_version | - |
dc.date.hkucongregation | 2023 | - |
dc.identifier.mmsid | 991044625587803414 | - |