File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TCSI.2025.3603908
- Scopus: eid_2-s2.0-105015163625
- Find via

Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: WireLightning: Harnessing Capacitances for In-Transit Massively Parallel Matrix Multiplication
| Title | WireLightning: Harnessing Capacitances for In-Transit Massively Parallel Matrix Multiplication |
|---|---|
| Authors | |
| Keywords | AI accelerator capacitive computing energy-efficient hardware matrix multiplication Mixed signal integrated circuits multiply accumulate circuits |
| Issue Date | 1-Sep-2025 |
| Publisher | Institute of Electrical and Electronics Engineers |
| Citation | IEEE Transactions on Circuits and Systems I: Regular Papers, 2025 How to Cite? |
| Abstract | Analog computing-in-memory accelerators promise ultra-low-power, on-device AI by reducing data transfer and energy usage. Yet inherent device variations and high energy consumption for analog-digital conversion continue to hinder their wide-scale adoption in mainstream systems. To address these issues, this paper introduces WireLightning, a novel capacitive-computing accelerator featuring a mixed-signal architecture that rethinks analog AI acceleration. Unlike conventional analog crossbars that encode weights in programmable devices, WireLightning exploits intrinsic charge dynamics in passive capacitors, encoding matrix multiplication through spike amplitude and timing. This design addresses critical limitations such as weight drift, stochasticity, and power-intensive ADC bottlenecks. Key innovations include: amplitude-temporal dual encoding that enables constant-time analog dot-products; time-based decoding scheme that significantly reduces reliance on power-intensive ADCs; row-wise parallel architecture for concurrent dot-product calculations across multiple rows to enhance throughput; and value repetition exploitation in low-bit quantized vectors to reduce multiplications to constant time complexity. A PCB pro totype achieved a range-normalized RMSE of 1.80% - 69.2% of the error in RRAM crossbar implementations - and a normalized error of 9.18%, corresponding to 77.1% of that in leading PCM crossbars. Implemented in a 40-nm CMOS technology, WireLightning macro delivers 465.47 TOPS/W at 4-bit precision, outperforming state-of-the-art analog accelerators while maintaining 0.23% range-normalized RMSE. By integrating algorithm-circuit co-design with physical computing, this work establishes capacitive computing as a promising path toward combining digital precision and analog efficiency in next-generation edge AI. |
| Persistent Identifier | http://hdl.handle.net/10722/368339 |
| ISSN | 2023 Impact Factor: 5.2 2023 SCImago Journal Rankings: 1.836 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Wang, Song | - |
| dc.contributor.author | Wang, Zhu | - |
| dc.contributor.author | Li, Can | - |
| dc.contributor.author | So, Hayden Kwok Hay | - |
| dc.date.accessioned | 2025-12-24T00:37:44Z | - |
| dc.date.available | 2025-12-24T00:37:44Z | - |
| dc.date.issued | 2025-09-01 | - |
| dc.identifier.citation | IEEE Transactions on Circuits and Systems I: Regular Papers, 2025 | - |
| dc.identifier.issn | 1549-8328 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/368339 | - |
| dc.description.abstract | <p>Analog computing-in-memory accelerators promise ultra-low-power, on-device AI by reducing data transfer and energy usage. Yet inherent device variations and high energy consumption for analog-digital conversion continue to hinder their wide-scale adoption in mainstream systems. To address these issues, this paper introduces WireLightning, a novel capacitive-computing accelerator featuring a mixed-signal architecture that rethinks analog AI acceleration. Unlike conventional analog crossbars that encode weights in programmable devices, WireLightning exploits intrinsic charge dynamics in passive capacitors, encoding matrix multiplication through spike amplitude and timing. This design addresses critical limitations such as weight drift, stochasticity, and power-intensive ADC bottlenecks. Key innovations include: amplitude-temporal dual encoding that enables constant-time analog dot-products; time-based decoding scheme that significantly reduces reliance on power-intensive ADCs; row-wise parallel architecture for concurrent dot-product calculations across multiple rows to enhance throughput; and value repetition exploitation in low-bit quantized vectors to reduce multiplications to constant time complexity. A PCB pro totype achieved a range-normalized RMSE of 1.80% - 69.2% of the error in RRAM crossbar implementations - and a normalized error of 9.18%, corresponding to 77.1% of that in leading PCM crossbars. Implemented in a 40-nm CMOS technology, WireLightning macro delivers 465.47 TOPS/W at 4-bit precision, outperforming state-of-the-art analog accelerators while maintaining 0.23% range-normalized RMSE. By integrating algorithm-circuit co-design with physical computing, this work establishes capacitive computing as a promising path toward combining digital precision and analog efficiency in next-generation edge AI. <br></p> | - |
| dc.language | eng | - |
| dc.publisher | Institute of Electrical and Electronics Engineers | - |
| dc.relation.ispartof | IEEE Transactions on Circuits and Systems I: Regular Papers | - |
| dc.subject | AI accelerator | - |
| dc.subject | capacitive computing | - |
| dc.subject | energy-efficient hardware | - |
| dc.subject | matrix multiplication | - |
| dc.subject | Mixed signal integrated circuits | - |
| dc.subject | multiply accumulate circuits | - |
| dc.title | WireLightning: Harnessing Capacitances for In-Transit Massively Parallel Matrix Multiplication | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/TCSI.2025.3603908 | - |
| dc.identifier.scopus | eid_2-s2.0-105015163625 | - |
| dc.identifier.eissn | 1558-0806 | - |
| dc.identifier.issnl | 1549-8328 | - |
