File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)

Article: WireLightning: Harnessing Capacitances for In-Transit Massively Parallel Matrix Multiplication

TitleWireLightning: Harnessing Capacitances for In-Transit Massively Parallel Matrix Multiplication
Authors
KeywordsAI accelerator
capacitive computing
energy-efficient hardware
matrix multiplication
Mixed signal integrated circuits
multiply accumulate circuits
Issue Date1-Sep-2025
PublisherInstitute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Circuits and Systems I: Regular Papers, 2025 How to Cite?
Abstract

Analog computing-in-memory accelerators promise ultra-low-power, on-device AI by reducing data transfer and energy usage. Yet inherent device variations and high energy consumption for analog-digital conversion continue to hinder their wide-scale adoption in mainstream systems. To address these issues, this paper introduces WireLightning, a novel capacitive-computing accelerator featuring a mixed-signal architecture that rethinks analog AI acceleration. Unlike conventional analog crossbars that encode weights in programmable devices, WireLightning exploits intrinsic charge dynamics in passive capacitors, encoding matrix multiplication through spike amplitude and timing. This design addresses critical limitations such as weight drift, stochasticity, and power-intensive ADC bottlenecks. Key innovations include: amplitude-temporal dual encoding that enables constant-time analog dot-products; time-based decoding scheme that significantly reduces reliance on power-intensive ADCs; row-wise parallel architecture for concurrent dot-product calculations across multiple rows to enhance throughput; and value repetition exploitation in low-bit quantized vectors to reduce multiplications to constant time complexity. A PCB pro totype achieved a range-normalized RMSE of 1.80% - 69.2% of the error in RRAM crossbar implementations - and a normalized error of 9.18%, corresponding to 77.1% of that in leading PCM crossbars. Implemented in a 40-nm CMOS technology, WireLightning macro delivers 465.47 TOPS/W at 4-bit precision, outperforming state-of-the-art analog accelerators while maintaining 0.23% range-normalized RMSE. By integrating algorithm-circuit co-design with physical computing, this work establishes capacitive computing as a promising path toward combining digital precision and analog efficiency in next-generation edge AI. 


Persistent Identifierhttp://hdl.handle.net/10722/368339
ISSN
2023 Impact Factor: 5.2
2023 SCImago Journal Rankings: 1.836

 

DC FieldValueLanguage
dc.contributor.authorWang, Song-
dc.contributor.authorWang, Zhu-
dc.contributor.authorLi, Can-
dc.contributor.authorSo, Hayden Kwok Hay-
dc.date.accessioned2025-12-24T00:37:44Z-
dc.date.available2025-12-24T00:37:44Z-
dc.date.issued2025-09-01-
dc.identifier.citationIEEE Transactions on Circuits and Systems I: Regular Papers, 2025-
dc.identifier.issn1549-8328-
dc.identifier.urihttp://hdl.handle.net/10722/368339-
dc.description.abstract<p>Analog computing-in-memory accelerators promise ultra-low-power, on-device AI by reducing data transfer and energy usage. Yet inherent device variations and high energy consumption for analog-digital conversion continue to hinder their wide-scale adoption in mainstream systems. To address these issues, this paper introduces WireLightning, a novel capacitive-computing accelerator featuring a mixed-signal architecture that rethinks analog AI acceleration. Unlike conventional analog crossbars that encode weights in programmable devices, WireLightning exploits intrinsic charge dynamics in passive capacitors, encoding matrix multiplication through spike amplitude and timing. This design addresses critical limitations such as weight drift, stochasticity, and power-intensive ADC bottlenecks. Key innovations include: amplitude-temporal dual encoding that enables constant-time analog dot-products; time-based decoding scheme that significantly reduces reliance on power-intensive ADCs; row-wise parallel architecture for concurrent dot-product calculations across multiple rows to enhance throughput; and value repetition exploitation in low-bit quantized vectors to reduce multiplications to constant time complexity. A PCB pro totype achieved a range-normalized RMSE of 1.80% - 69.2% of the error in RRAM crossbar implementations - and a normalized error of 9.18%, corresponding to 77.1% of that in leading PCM crossbars. Implemented in a 40-nm CMOS technology, WireLightning macro delivers 465.47 TOPS/W at 4-bit precision, outperforming state-of-the-art analog accelerators while maintaining 0.23% range-normalized RMSE. By integrating algorithm-circuit co-design with physical computing, this work establishes capacitive computing as a promising path toward combining digital precision and analog efficiency in next-generation edge AI. <br></p>-
dc.languageeng-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.relation.ispartofIEEE Transactions on Circuits and Systems I: Regular Papers-
dc.subjectAI accelerator-
dc.subjectcapacitive computing-
dc.subjectenergy-efficient hardware-
dc.subjectmatrix multiplication-
dc.subjectMixed signal integrated circuits-
dc.subjectmultiply accumulate circuits-
dc.titleWireLightning: Harnessing Capacitances for In-Transit Massively Parallel Matrix Multiplication -
dc.typeArticle-
dc.identifier.doi10.1109/TCSI.2025.3603908-
dc.identifier.scopuseid_2-s2.0-105015163625-
dc.identifier.eissn1558-0806-
dc.identifier.issnl1549-8328-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats