File Download

There are no files associated with this item.

Supplementary

Conference Paper: A new machine-learning based method to accurately assess copy number variants from whole genome sequencing data and its application on the analysis of the Hirschsprung disease genome

TitleA new machine-learning based method to accurately assess copy number variants from whole genome sequencing data and its application on the analysis of the Hirschsprung disease genome
Authors
KeywordsCopy number/structural variation
Gastrointestinal system
Genome-wide association
Identification of disease genes
Bioinformatics
Issue Date2019
PublisherAmerican Society of Human Genetics.
Citation
American Society of Human Genetics (ASHG) 2019 Annual Meeting, Houston, USA, 15-19 October 2019 How to Cite?
AbstractCopy Number Variants (CNVs) are defined as DNA segments larger than 50 base-pairs with copy number changes. CNVs discovery is essential for uncovering genes/risk factors for a wide range of diseases, including Hirschsprung disease(HSCR; colon aganglionosis). Incidentally, gross deletions encompassing RETor EDNRBlead to the discovery of these two main HSCR genes. Importantly, our previous CNV analysis of HSCR patients identified the association of CNVs encompassing NRG3with HSCR, therefore vindicating a role for CNVs in HSCR. As CNV calling is still a challenge, we have devised and developed a new machine-learning based method CNV-JACG to accurately assess CNVs. Initially, in order to get the most comprehensive CNVs, we used four complementary CNV discovery methods namely CNVnator, Delly, Lumpy and Seeksv to call CNVs. We used 9 in-house trios to produce training dataset, 11 pairs of in-house duplicated samples for validation, and 2 trios from 1000 Genomes Project for evaluation. For deletions, after CNV-JACG assessment, the concordance between each pair of 11 duplicated samples increases from 63% to 88%, and the Mendelian inconsistent rate of the 2 trios decreases from 25% to 6%. In the benchmark sample NA12878, 84.7% of CNV-JACG predicted CNVs are consistent with previously published result. CNV-JACG has been used for the CNV analysis of whole genome sequencing data (pair-end 150bp, ~30X) generated from 443 HSCR patients and 493 matched controls. Here we will present the performance of our new program together with the genome-wide association results.
DescriptionPoster presentation - Session: Bioinformatics and Computational Approaches - no. PgmNr 1470
Persistent Identifierhttp://hdl.handle.net/10722/284264

 

DC FieldValueLanguage
dc.contributor.authorZhuang, X-
dc.contributor.authorYe, R-
dc.contributor.authorSham, PC-
dc.contributor.authorGarcia-Barcelo, MM-
dc.contributor.authorTang, SM-
dc.contributor.authorTam, PKH-
dc.date.accessioned2020-07-20T05:57:21Z-
dc.date.available2020-07-20T05:57:21Z-
dc.date.issued2019-
dc.identifier.citationAmerican Society of Human Genetics (ASHG) 2019 Annual Meeting, Houston, USA, 15-19 October 2019-
dc.identifier.urihttp://hdl.handle.net/10722/284264-
dc.descriptionPoster presentation - Session: Bioinformatics and Computational Approaches - no. PgmNr 1470-
dc.description.abstractCopy Number Variants (CNVs) are defined as DNA segments larger than 50 base-pairs with copy number changes. CNVs discovery is essential for uncovering genes/risk factors for a wide range of diseases, including Hirschsprung disease(HSCR; colon aganglionosis). Incidentally, gross deletions encompassing RETor EDNRBlead to the discovery of these two main HSCR genes. Importantly, our previous CNV analysis of HSCR patients identified the association of CNVs encompassing NRG3with HSCR, therefore vindicating a role for CNVs in HSCR. As CNV calling is still a challenge, we have devised and developed a new machine-learning based method CNV-JACG to accurately assess CNVs. Initially, in order to get the most comprehensive CNVs, we used four complementary CNV discovery methods namely CNVnator, Delly, Lumpy and Seeksv to call CNVs. We used 9 in-house trios to produce training dataset, 11 pairs of in-house duplicated samples for validation, and 2 trios from 1000 Genomes Project for evaluation. For deletions, after CNV-JACG assessment, the concordance between each pair of 11 duplicated samples increases from 63% to 88%, and the Mendelian inconsistent rate of the 2 trios decreases from 25% to 6%. In the benchmark sample NA12878, 84.7% of CNV-JACG predicted CNVs are consistent with previously published result. CNV-JACG has been used for the CNV analysis of whole genome sequencing data (pair-end 150bp, ~30X) generated from 443 HSCR patients and 493 matched controls. Here we will present the performance of our new program together with the genome-wide association results.-
dc.languageeng-
dc.publisherAmerican Society of Human Genetics. -
dc.relation.ispartofAmerican Society of Human Genetics (ASHG) 2019 Annual Meeting-
dc.subjectCopy number/structural variation-
dc.subjectGastrointestinal system-
dc.subjectGenome-wide association-
dc.subjectIdentification of disease genes-
dc.subjectBioinformatics-
dc.titleA new machine-learning based method to accurately assess copy number variants from whole genome sequencing data and its application on the analysis of the Hirschsprung disease genome-
dc.typeConference_Paper-
dc.identifier.emailSham, PC: pcsham@hku.hk-
dc.identifier.emailGarcia-Barcelo, MM: mmgarcia@hku.hk-
dc.identifier.emailTang, SM: claratang@hku.hk-
dc.identifier.emailTam, PKH: paultam@hku.hk-
dc.identifier.authoritySham, PC=rp00459-
dc.identifier.authorityGarcia-Barcelo, MM=rp00445-
dc.identifier.authorityTang, SM=rp02105-
dc.identifier.authorityTam, PKH=rp00060-
dc.identifier.hkuros311104-
dc.publisher.placeUnited States-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats