File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System

TitleE-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System
Authors
Issue Date2019
PublisherACM Press. The Proceedings' web site is located at https://dl.acm.org/citation.cfm?id=3316781&picked=prox
Citation
Proceedings of the 56th Annual Design Automation Conference 2019 (DAC '19), Las Vegas, NV, USA, 2-6 June 2019, article no. 182:1--182:6 How to Cite?
AbstractVarious models with Long Short-Term Memory (LSTM) network have demonstrated prior art performances in sequential information processing. Previous LSTM-specific architectures set large on-chip memory for weight storage to alleviate the memory-bound issue and facilitate the LSTM inference in cloud computing. In this paper, E-LSTM is proposed for embedded scenarios with the consideration of the chip-area and limited data-access bandwidth. The heterogeneous hardware in E-LSTM tightly couples an LSTM co-processor with an embedded RISC-V CPU. The eSELL format is developed to represent the sparse weight matrix. With the proposed cell fusion optimization based on the inherent sparsity in computation, E-LSTM achieves up to 2.2× speedup of processing throughput.
Persistent Identifierhttp://hdl.handle.net/10722/275274
ISBN
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorShi, R-
dc.contributor.authorLiu, J-
dc.contributor.authorSo, HKH-
dc.contributor.authorWang, S-
dc.contributor.authorLiang, Y-
dc.date.accessioned2019-09-10T02:39:11Z-
dc.date.available2019-09-10T02:39:11Z-
dc.date.issued2019-
dc.identifier.citationProceedings of the 56th Annual Design Automation Conference 2019 (DAC '19), Las Vegas, NV, USA, 2-6 June 2019, article no. 182:1--182:6-
dc.identifier.isbn9781450367257-
dc.identifier.urihttp://hdl.handle.net/10722/275274-
dc.description.abstractVarious models with Long Short-Term Memory (LSTM) network have demonstrated prior art performances in sequential information processing. Previous LSTM-specific architectures set large on-chip memory for weight storage to alleviate the memory-bound issue and facilitate the LSTM inference in cloud computing. In this paper, E-LSTM is proposed for embedded scenarios with the consideration of the chip-area and limited data-access bandwidth. The heterogeneous hardware in E-LSTM tightly couples an LSTM co-processor with an embedded RISC-V CPU. The eSELL format is developed to represent the sparse weight matrix. With the proposed cell fusion optimization based on the inherent sparsity in computation, E-LSTM achieves up to 2.2× speedup of processing throughput.-
dc.languageeng-
dc.publisherACM Press. The Proceedings' web site is located at https://dl.acm.org/citation.cfm?id=3316781&picked=prox-
dc.relation.ispartofProceedings of the 56th Annual Design Automation Conference 2019 (DAC '19)-
dc.titleE-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System-
dc.typeConference_Paper-
dc.identifier.emailSo, HKH: hso@eee.hku.hk-
dc.identifier.authoritySo, HKH=rp00169-
dc.identifier.doi10.1145/3316781.3317813-
dc.identifier.scopuseid_2-s2.0-85067788769-
dc.identifier.hkuros304176-
dc.identifier.spage182:1-
dc.identifier.epage182:6-
dc.identifier.isiWOS:000482058200182-
dc.publisher.placeNew York, NY-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats