File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1093/gigascience/gix129
- Scopus: eid_2-s2.0-85045207122
- PMID: 29267858
- WOS: WOS:000427171400001
Supplementary
- Citations:
- Appears in Collections:
Article: Hybrid-denovo: A de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags
Title | Hybrid-denovo: A de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags |
---|---|
Authors | |
Keywords | 16S rRNA Microbiome OTU picking |
Issue Date | 2018 |
Citation | GigaScience, 2018, v. 7, n. 3, p. 1-7 How to Cite? |
Abstract | Background: Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach. Results: To have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach. Conclusions: Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data. |
Persistent Identifier | http://hdl.handle.net/10722/324500 |
PubMed Central ID | |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chen, Xianfeng | - |
dc.contributor.author | Johnson, Stephen | - |
dc.contributor.author | Jeraldo, Patricio | - |
dc.contributor.author | Wang, Junwen | - |
dc.contributor.author | Chia, Nicholas | - |
dc.contributor.author | Kocher, Jean Pierre A. | - |
dc.contributor.author | Chen, Jun | - |
dc.date.accessioned | 2023-02-03T07:03:30Z | - |
dc.date.available | 2023-02-03T07:03:30Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | GigaScience, 2018, v. 7, n. 3, p. 1-7 | - |
dc.identifier.uri | http://hdl.handle.net/10722/324500 | - |
dc.description.abstract | Background: Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach. Results: To have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach. Conclusions: Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data. | - |
dc.language | eng | - |
dc.relation.ispartof | GigaScience | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | 16S rRNA | - |
dc.subject | Microbiome | - |
dc.subject | OTU picking | - |
dc.title | Hybrid-denovo: A de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags | - |
dc.type | Article | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1093/gigascience/gix129 | - |
dc.identifier.pmid | 29267858 | - |
dc.identifier.pmcid | PMC5841375 | - |
dc.identifier.scopus | eid_2-s2.0-85045207122 | - |
dc.identifier.volume | 7 | - |
dc.identifier.issue | 3 | - |
dc.identifier.spage | 1 | - |
dc.identifier.epage | 7 | - |
dc.identifier.eissn | 2047-217X | - |
dc.identifier.isi | WOS:000427171400001 | - |