File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: A multi-task convolutional deep neural network for variant calling in single molecule sequencing

TitleA multi-task convolutional deep neural network for variant calling in single molecule sequencing
Authors
Issue Date2019
PublisherNature Research (part of Springer Nature): Fully open access journals. The Journal's web site is located at http://www.nature.com/ncomms/index.html
Citation
Nature Communications, 2019, v. 10, article no. 998 How to Cite?
AbstractThe accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5–15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2 h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source (https://github.com/aquaskyline/Clairvoyante), with modules to train, utilize and visualize the model.
Persistent Identifierhttp://hdl.handle.net/10722/272475
ISSN
2021 Impact Factor: 17.694
2020 SCImago Journal Rankings: 5.559
PubMed Central ID
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLuo, R-
dc.contributor.authorSedlazeck, FJ-
dc.contributor.authorLam, TW-
dc.contributor.authorSchatz, MC-
dc.date.accessioned2019-07-20T10:43:01Z-
dc.date.available2019-07-20T10:43:01Z-
dc.date.issued2019-
dc.identifier.citationNature Communications, 2019, v. 10, article no. 998-
dc.identifier.issn2041-1723-
dc.identifier.urihttp://hdl.handle.net/10722/272475-
dc.description.abstractThe accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5–15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2 h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source (https://github.com/aquaskyline/Clairvoyante), with modules to train, utilize and visualize the model.-
dc.languageeng-
dc.publisherNature Research (part of Springer Nature): Fully open access journals. The Journal's web site is located at http://www.nature.com/ncomms/index.html-
dc.relation.ispartofNature Communications-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleA multi-task convolutional deep neural network for variant calling in single molecule sequencing-
dc.typeArticle-
dc.identifier.emailLuo, R: rbluo@cs.hku.hk-
dc.identifier.emailLam, TW: twlam@cs.hku.hk-
dc.identifier.authorityLuo, R=rp02360-
dc.identifier.authorityLam, TW=rp00135-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1038/s41467-019-09025-z-
dc.identifier.pmid30824707-
dc.identifier.pmcidPMC6397153-
dc.identifier.scopuseid_2-s2.0-85062266874-
dc.identifier.hkuros299464-
dc.identifier.volume10-
dc.identifier.spagearticle no. 998-
dc.identifier.epagearticle no. 998-
dc.identifier.isiWOS:000459988600010-
dc.publisher.placeUnited Kingdom-
dc.identifier.issnl2041-1723-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats