File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1186/1471-2105-16-S7-S10
- Scopus: eid_2-s2.0-84977534075
- PMID: 25952019
- WOS: WOS:000353976600010
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)
Title | MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC) |
---|---|
Authors | |
Issue Date | 2015 |
Citation | BMC Bioinformatics, 2015, v. 16 n. suppl. 7, article no. S10 How to Cite? |
Abstract | Background: Short-read aligners have recently gained a lot of speed by exploiting the massive parallelism of GPU. An uprising alterative to GPU is Intel MIC; supercomputers like Tianhe-2, currently top of TOP500, is built with 48,000 MIC boards to offer ~55 PFLOPS. The CPU-like architecture of MIC allows CPU-based software to be parallelized easily; however, the performance is often inferior to GPU counterparts as an MIC card contains only ~60 cores (while a GPU card typically has over a thousand cores). Results: To better utilize MIC-enabled computers for NGS data analysis, we developed a new short-read aligner MICA that is optimized in view of MIC's limitation and the extra parallelism inside each MIC core. By utilizing the 512-bit vector units in the MIC and implementing a new seeding strategy, experiments on aligning 150 bp paired-end reads show that MICA using one MIC card is 4.9 times faster than BWA-MEM (using 6 cores of a top-end CPU), and slightly faster than SOAP3-dp (using a GPU). Furthermore, MICA's simplicity allows very efficient scale-up when multiple MIC cards are used in a node (3 cards give a 14.1-fold speedup over BWA-MEM). Summary: MICA can be readily used by MIC-enabled supercomputers for production purpose. We have tested MICA on Tianhe-2 with 90 WGS samples (17.47 Tera-bases), which can be aligned in an hour using 400 nodes. MICA has impressive performance even though MIC is only in its initial stage of development. Availability and implementation: MICA's source code is freely available at http://sourceforge.net/projects/mica-aligner under GPL v3. Supplementary information: Supplementary information is available as "Additional File 1". Datasets are available at www.bio8.cs.hku.hk/dataset/mica. |
Persistent Identifier | http://hdl.handle.net/10722/215514 |
ISSN | 2023 Impact Factor: 2.9 2023 SCImago Journal Rankings: 1.005 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | LUO, R | - |
dc.contributor.author | CHAN, SH | - |
dc.contributor.author | LAW, WC | - |
dc.contributor.author | He, G | - |
dc.contributor.author | Yu, C | - |
dc.contributor.author | LIU, CM | - |
dc.contributor.author | Zhou, D | - |
dc.contributor.author | Li, Y | - |
dc.contributor.author | Li, R | - |
dc.contributor.author | Wang, J | - |
dc.contributor.author | Zhu, X | - |
dc.contributor.author | Peng, S | - |
dc.contributor.author | Lam, TW | - |
dc.date.accessioned | 2015-08-21T13:28:36Z | - |
dc.date.available | 2015-08-21T13:28:36Z | - |
dc.date.issued | 2015 | - |
dc.identifier.citation | BMC Bioinformatics, 2015, v. 16 n. suppl. 7, article no. S10 | - |
dc.identifier.issn | 1471-2105 | - |
dc.identifier.uri | http://hdl.handle.net/10722/215514 | - |
dc.description.abstract | Background: Short-read aligners have recently gained a lot of speed by exploiting the massive parallelism of GPU. An uprising alterative to GPU is Intel MIC; supercomputers like Tianhe-2, currently top of TOP500, is built with 48,000 MIC boards to offer ~55 PFLOPS. The CPU-like architecture of MIC allows CPU-based software to be parallelized easily; however, the performance is often inferior to GPU counterparts as an MIC card contains only ~60 cores (while a GPU card typically has over a thousand cores). Results: To better utilize MIC-enabled computers for NGS data analysis, we developed a new short-read aligner MICA that is optimized in view of MIC's limitation and the extra parallelism inside each MIC core. By utilizing the 512-bit vector units in the MIC and implementing a new seeding strategy, experiments on aligning 150 bp paired-end reads show that MICA using one MIC card is 4.9 times faster than BWA-MEM (using 6 cores of a top-end CPU), and slightly faster than SOAP3-dp (using a GPU). Furthermore, MICA's simplicity allows very efficient scale-up when multiple MIC cards are used in a node (3 cards give a 14.1-fold speedup over BWA-MEM). Summary: MICA can be readily used by MIC-enabled supercomputers for production purpose. We have tested MICA on Tianhe-2 with 90 WGS samples (17.47 Tera-bases), which can be aligned in an hour using 400 nodes. MICA has impressive performance even though MIC is only in its initial stage of development. Availability and implementation: MICA's source code is freely available at http://sourceforge.net/projects/mica-aligner under GPL v3. Supplementary information: Supplementary information is available as "Additional File 1". Datasets are available at www.bio8.cs.hku.hk/dataset/mica. | - |
dc.language | eng | - |
dc.relation.ispartof | BMC Bioinformatics | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.title | MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC) | - |
dc.type | Article | - |
dc.identifier.email | He, G: gzhe2@HKUCC-COM.hku.hk | - |
dc.identifier.email | Lam, TW: twlam@cs.hku.hk | - |
dc.identifier.authority | Lam, TW=rp00135 | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1186/1471-2105-16-S7-S10 | - |
dc.identifier.pmid | 25952019 | - |
dc.identifier.scopus | eid_2-s2.0-84977534075 | - |
dc.identifier.hkuros | 248453 | - |
dc.identifier.volume | 16 | - |
dc.identifier.issue | suppl. 7 | - |
dc.identifier.spage | article no. S10 | - |
dc.identifier.epage | article no. S10 | - |
dc.identifier.isi | WOS:000353976600010 | - |
dc.identifier.issnl | 1471-2105 | - |