File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.isci.2020.101128
- Scopus: eid_2-s2.0-85084479170
- PMID: 32422594
- WOS: WOS:000536743600083
- Find via

Supplementary
- Citations:
- Appears in Collections:
Article: CONNET: Accurate Genome Consensus in Assembling Nanopore Sequencing Data via Deep Learning
| Title | CONNET: Accurate Genome Consensus in Assembling Nanopore Sequencing Data via Deep Learning |
|---|---|
| Authors | |
| Keywords | Bioinformatics Genomics Sequence Analysis |
| Issue Date | 2020 |
| Publisher | Cell Press. The Journal's web site is located at https://www.cell.com/iscience.home |
| Citation | iScience, 2020, v. 23, p. article no. 101128 How to Cite? |
| Abstract | Single-molecule sequencing technologies produce much longer reads compared with next-generation sequencing, greatly improving the contiguity of de novo as- sembly of genomes. However, the relatively high error rates in long reads make it challenging to obtain high-quality assemblies. A computationally intensive consensus step is needed to resolve the discrepancies in the reads. Efficient consensus tools have emerged in the recent past, based on partial-order align- ment. In this study, we discovered that the spatial relationship of alignment pileup is crucial to high-quality consensus and developed a deep learning-based consensus tool, CONNET, which outperforms the fastest tools in terms of both accuracy and speed. We tested CONNET using a 903 dataset of E. coli and a 373 human dataset. In addition to achieving high-quality consensus results, CONNET is capable of delivering phased diploid genome consensus. Diploid consensus on the above-mentioned human assembly further reduced 12% of the consensus errors made in the haploid results. |
| Persistent Identifier | http://hdl.handle.net/10722/284227 |
| ISSN | 2023 Impact Factor: 4.6 2023 SCImago Journal Rankings: 1.497 |
| PubMed Central ID | |
| ISI Accession Number ID |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Zhang, Y | - |
| dc.contributor.author | Liu, CM | - |
| dc.contributor.author | Leung, HCM | - |
| dc.contributor.author | Luo, R | - |
| dc.contributor.author | Lam, TW | - |
| dc.date.accessioned | 2020-07-20T05:57:04Z | - |
| dc.date.available | 2020-07-20T05:57:04Z | - |
| dc.date.issued | 2020 | - |
| dc.identifier.citation | iScience, 2020, v. 23, p. article no. 101128 | - |
| dc.identifier.issn | 2589-0042 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/284227 | - |
| dc.description.abstract | Single-molecule sequencing technologies produce much longer reads compared with next-generation sequencing, greatly improving the contiguity of de novo as- sembly of genomes. However, the relatively high error rates in long reads make it challenging to obtain high-quality assemblies. A computationally intensive consensus step is needed to resolve the discrepancies in the reads. Efficient consensus tools have emerged in the recent past, based on partial-order align- ment. In this study, we discovered that the spatial relationship of alignment pileup is crucial to high-quality consensus and developed a deep learning-based consensus tool, CONNET, which outperforms the fastest tools in terms of both accuracy and speed. We tested CONNET using a 903 dataset of E. coli and a 373 human dataset. In addition to achieving high-quality consensus results, CONNET is capable of delivering phased diploid genome consensus. Diploid consensus on the above-mentioned human assembly further reduced 12% of the consensus errors made in the haploid results. | - |
| dc.language | eng | - |
| dc.publisher | Cell Press. The Journal's web site is located at https://www.cell.com/iscience.home | - |
| dc.relation.ispartof | iScience | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.subject | Bioinformatics | - |
| dc.subject | Genomics | - |
| dc.subject | Sequence Analysis | - |
| dc.title | CONNET: Accurate Genome Consensus in Assembling Nanopore Sequencing Data via Deep Learning | - |
| dc.type | Article | - |
| dc.identifier.email | Liu, CM: imcx@hku.hk | - |
| dc.identifier.email | Leung, HCM: cmleung3@hku.hk | - |
| dc.identifier.email | Luo, R: rbluo@cs.hku.hk | - |
| dc.identifier.email | Lam, TW: twlam@cs.hku.hk | - |
| dc.identifier.authority | Leung, HCM=rp00144 | - |
| dc.identifier.authority | Luo, R=rp02360 | - |
| dc.identifier.authority | Lam, TW=rp00135 | - |
| dc.description.nature | published_or_final_version | - |
| dc.identifier.doi | 10.1016/j.isci.2020.101128 | - |
| dc.identifier.pmid | 32422594 | - |
| dc.identifier.pmcid | PMC7229283 | - |
| dc.identifier.scopus | eid_2-s2.0-85084479170 | - |
| dc.identifier.hkuros | 310900 | - |
| dc.identifier.volume | 23 | - |
| dc.identifier.spage | article no. 101128 | - |
| dc.identifier.epage | article no. 101128 | - |
| dc.identifier.isi | WOS:000536743600083 | - |
| dc.publisher.place | United States | - |
| dc.identifier.issnl | 2589-0042 | - |
