File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TCSII.2022.3233396
- Scopus: eid_2-s2.0-85147219775
- WOS: WOS:001000301200013
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: An ADC-Less RRAM-Based Computing-in-Memory Macro With Binary CNN for Efficient Edge AI
Title | An ADC-Less RRAM-Based Computing-in-Memory Macro With Binary CNN for Efficient Edge AI |
---|---|
Authors | |
Keywords | 3T2R ADC free binary CNN computing-in-memory energy-efficient hardware-implementation RRAM |
Issue Date | 2-Jan-2023 |
Publisher | Institute of Electrical and Electronics Engineers |
Citation | IEEE Transactions on Circuits and Systems II: Express Briefs, 2023, v. 70, n. 6, p. 1871-1875 How to Cite? |
Abstract | Resistive random-access memory (RRAM) based non-volatile computing-in-memory (nvCIM) has been regarded as a promising solution to enable efficient data-intensive artificial intelligence (AI) applications on resource-limited edge systems. However, existing weighted-current summation-based nvCIM suffers from device non-idealities and significant time, storage, and energy overheads when processing high-precision analog signals. To address these issues, we propose a 3T2R digital nvCIM macro for a fully hardware-implemented binary convolutional neural network (HBCNN), focusing on accelerating edge AI applications at low weight precision. By quantizing the voltage-division results of RRAMs through inverters, the 3T2R macro provides a stable rail-to-rail output without analog-to-digital converters or sensing amplifiers. Moreover, both batch normalization and sign activation are integrated on-chip. The hybrid simulation results show that the proposed 3T2R digital macro achieves an 86.2% (95.6%) accuracy on the CIFAR-10 (MNIST) dataset, corresponding to a 4.7% (1.9%) accuracy loss compared to the software baselines, which also feature a peak energy efficiency of 51.3 TOPS/W and a minimum latency of 8 ns, realizing an energy-efficient, low-latency, and robust AI processor. |
Persistent Identifier | http://hdl.handle.net/10722/340962 |
ISSN | 2023 Impact Factor: 4.0 2023 SCImago Journal Rankings: 1.523 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Li, Yi | - |
dc.contributor.author | Chen, Jia | - |
dc.contributor.author | Wang, Linfang | - |
dc.contributor.author | Zhang, Woyu | - |
dc.contributor.author | Guo, Zeyu | - |
dc.contributor.author | Wang, Jun | - |
dc.contributor.author | Han, Yongkang | - |
dc.contributor.author | Li, Zhi | - |
dc.contributor.author | Wang, Fei | - |
dc.contributor.author | Dou, Chunmeng | - |
dc.contributor.author | Xu, Xiaoxin | - |
dc.contributor.author | Yang, Jianguo | - |
dc.contributor.author | Wang, Zhongrui | - |
dc.contributor.author | Shang, Dashan | - |
dc.date.accessioned | 2024-03-11T10:48:37Z | - |
dc.date.available | 2024-03-11T10:48:37Z | - |
dc.date.issued | 2023-01-02 | - |
dc.identifier.citation | IEEE Transactions on Circuits and Systems II: Express Briefs, 2023, v. 70, n. 6, p. 1871-1875 | - |
dc.identifier.issn | 1549-7747 | - |
dc.identifier.uri | http://hdl.handle.net/10722/340962 | - |
dc.description.abstract | <p>Resistive random-access memory (RRAM) based non-volatile computing-in-memory (nvCIM) has been regarded as a promising solution to enable efficient data-intensive artificial intelligence (AI) applications on resource-limited edge systems. However, existing weighted-current summation-based nvCIM suffers from device non-idealities and significant time, storage, and energy overheads when processing high-precision analog signals. To address these issues, we propose a 3T2R digital nvCIM macro for a fully hardware-implemented binary convolutional neural network (HBCNN), focusing on accelerating edge AI applications at low weight precision. By quantizing the voltage-division results of RRAMs through inverters, the 3T2R macro provides a stable rail-to-rail output without analog-to-digital converters or sensing amplifiers. Moreover, both batch normalization and sign activation are integrated on-chip. The hybrid simulation results show that the proposed 3T2R digital macro achieves an 86.2% (95.6%) accuracy on the CIFAR-10 (MNIST) dataset, corresponding to a 4.7% (1.9%) accuracy loss compared to the software baselines, which also feature a peak energy efficiency of 51.3 TOPS/W and a minimum latency of 8 ns, realizing an energy-efficient, low-latency, and robust AI processor.<br></p> | - |
dc.language | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers | - |
dc.relation.ispartof | IEEE Transactions on Circuits and Systems II: Express Briefs | - |
dc.subject | 3T2R | - |
dc.subject | ADC free | - |
dc.subject | binary CNN | - |
dc.subject | computing-in-memory | - |
dc.subject | energy-efficient | - |
dc.subject | hardware-implementation | - |
dc.subject | RRAM | - |
dc.title | An ADC-Less RRAM-Based Computing-in-Memory Macro With Binary CNN for Efficient Edge AI | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/TCSII.2022.3233396 | - |
dc.identifier.scopus | eid_2-s2.0-85147219775 | - |
dc.identifier.volume | 70 | - |
dc.identifier.issue | 6 | - |
dc.identifier.spage | 1871 | - |
dc.identifier.epage | 1875 | - |
dc.identifier.eissn | 1558-3791 | - |
dc.identifier.isi | WOS:001000301200013 | - |
dc.identifier.issnl | 1549-7747 | - |