File Download
Supplementary

postgraduate thesis: Genetic landscape of copy number variations in congenital complex diseases

TitleGenetic landscape of copy number variations in congenital complex diseases
Authors
Issue Date2021
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Zhuang, X. [庄雪寒]. (2021). Genetic landscape of copy number variations in congenital complex diseases. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractCopy number variations (CNVs) are imbalanced structural variants characterized by alterations in the number of copies of DNA segments larger than 50bp. CNVs can disrupt coding sequences, alter gene dosage, and affect gene regulation. They are known to underlie Mendelian diseases and represent an essential portion of missing heritability in human complex diseases. It is therefore crucial to uncover the disease-associated CNVs and affected genes; however, CNV detection by current tools is prone to error and evaluating CNV accuracy remains challenging. Tetralogy of Fallot (TOF) and Hirschsprung disease (HSCR) are two rare congenital disorders, with the nonsyndromic form serving as models of complex diseases. Previous studies based on microarray or target sequencing technologies have discovered many CNVs associated with TOF or HSCR. Nevertheless, the genetic role of CNVs in these two complex diseases remains largely uncertain. To further explore the contribution of CNVs to TOF and HSCR susceptibility, especially those encompassing non-coding regulatory regions and those with small size (≤1kb) that were largely underestimated in previous studies due to technical limitations, two CNV studies were performed based on the whole genome sequencing (WGS) data of 150 nonsyndromic TOF case-parent trios, as well as the WGS data of 443 short-segment HSCR patients and 493 ethnically matched controls, respectively. CNVs were detected based on a newly developed framework, CNV-JACG, which was created for Judging the Accuracy of CNVs and Genotyping using paired-end WGS data. It is based on a random forest model trained on 21 distinctive features characterizing the CNV region and its breakpoints. CNV-JACG was demonstrated to have superior sensitivity and precision compared to the latest method, particularly for the small CNVs (≤1kb). The de novo CNV rate (including mosaic) in this TOF cohort was comparable with previous reports. In particular, a few candidate genes (HAND2, SORBS2, SMCHD1, MYOM1, PTPRM) were discovered through the integration of epigenomic and expression data. Among the rare inherited CNVs, compound heterozygous variants consisting of rare inherited deletions and missense point mutations were also identified wherein the well-documented congenital heart disease associated gene DNAH11 was involved. Four large-scale rare CNVs were uniquely identified in HSCR patients, three of them disrupted known enteric nervous system associated genes, EDNRB, SPRY2 and UCHL1. The category-wide association study found that duplications encompassing the enhancer/H3K27ac regions of the neural crest cell expressed/constrained genes were enriched in HSCR patients. Through new tool development and CNV analysis of both family and population WGS data, this thesis presents a useful tool for assessing the accuracy of CNVs to meet the ever-growing needs for uncovering the missing heritability linked to CNVs. The CNVs and affected genes discovered in this study extend the current understanding of the contribution of CNVs to TOF and HSCR susceptibility, with the potential of informing therapeutic decision-making.
DegreeDoctor of Philosophy
SubjectTetralogy of Fallot - Genetic aspects
Hirschsprung's disease - Genetic aspects
Dept/ProgramSurgery
Persistent Identifierhttp://hdl.handle.net/10722/299164

 

DC FieldValueLanguage
dc.contributor.authorZhuang, Xuehan-
dc.contributor.author庄雪寒-
dc.date.accessioned2021-04-29T02:24:26Z-
dc.date.available2021-04-29T02:24:26Z-
dc.date.issued2021-
dc.identifier.citationZhuang, X. [庄雪寒]. (2021). Genetic landscape of copy number variations in congenital complex diseases. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/299164-
dc.description.abstractCopy number variations (CNVs) are imbalanced structural variants characterized by alterations in the number of copies of DNA segments larger than 50bp. CNVs can disrupt coding sequences, alter gene dosage, and affect gene regulation. They are known to underlie Mendelian diseases and represent an essential portion of missing heritability in human complex diseases. It is therefore crucial to uncover the disease-associated CNVs and affected genes; however, CNV detection by current tools is prone to error and evaluating CNV accuracy remains challenging. Tetralogy of Fallot (TOF) and Hirschsprung disease (HSCR) are two rare congenital disorders, with the nonsyndromic form serving as models of complex diseases. Previous studies based on microarray or target sequencing technologies have discovered many CNVs associated with TOF or HSCR. Nevertheless, the genetic role of CNVs in these two complex diseases remains largely uncertain. To further explore the contribution of CNVs to TOF and HSCR susceptibility, especially those encompassing non-coding regulatory regions and those with small size (≤1kb) that were largely underestimated in previous studies due to technical limitations, two CNV studies were performed based on the whole genome sequencing (WGS) data of 150 nonsyndromic TOF case-parent trios, as well as the WGS data of 443 short-segment HSCR patients and 493 ethnically matched controls, respectively. CNVs were detected based on a newly developed framework, CNV-JACG, which was created for Judging the Accuracy of CNVs and Genotyping using paired-end WGS data. It is based on a random forest model trained on 21 distinctive features characterizing the CNV region and its breakpoints. CNV-JACG was demonstrated to have superior sensitivity and precision compared to the latest method, particularly for the small CNVs (≤1kb). The de novo CNV rate (including mosaic) in this TOF cohort was comparable with previous reports. In particular, a few candidate genes (HAND2, SORBS2, SMCHD1, MYOM1, PTPRM) were discovered through the integration of epigenomic and expression data. Among the rare inherited CNVs, compound heterozygous variants consisting of rare inherited deletions and missense point mutations were also identified wherein the well-documented congenital heart disease associated gene DNAH11 was involved. Four large-scale rare CNVs were uniquely identified in HSCR patients, three of them disrupted known enteric nervous system associated genes, EDNRB, SPRY2 and UCHL1. The category-wide association study found that duplications encompassing the enhancer/H3K27ac regions of the neural crest cell expressed/constrained genes were enriched in HSCR patients. Through new tool development and CNV analysis of both family and population WGS data, this thesis presents a useful tool for assessing the accuracy of CNVs to meet the ever-growing needs for uncovering the missing heritability linked to CNVs. The CNVs and affected genes discovered in this study extend the current understanding of the contribution of CNVs to TOF and HSCR susceptibility, with the potential of informing therapeutic decision-making.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshTetralogy of Fallot - Genetic aspects-
dc.subject.lcshHirschsprung's disease - Genetic aspects-
dc.titleGenetic landscape of copy number variations in congenital complex diseases-
dc.typePG_Thesis-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineSurgery-
dc.description.naturepublished_or_final_version-
dc.date.hkucongregation2021-
dc.identifier.mmsid991044362001003414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats