Overcoming computational bottlenecks in large language models through analog in-memory computing

Lin, Yudeng; Tang, Jianshi

doi:10.1038/s43588-025-00860-3

News & Views
Published: 08 September 2025

Neuromorphic computing

Overcoming computational bottlenecks in large language models through analog in-memory computing

Nature Computational Science volume 5, pages 711–712 (2025)Cite this article

678 Accesses
2 Altmetric
Metrics details

Subjects

A recent study demonstrates the potential of using in-memory computing architecture for implementing large language models for an improved computational efficiency in both time and energy while maintaining a high accuracy.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Accelerate attention mechanism with analog in-memory computing.**

References

de Vries, A. Joule 7, 2191–2194 (2023).
Article Google Scholar
Leroux, N. et al. Nat. Comput. Sci. https://doi.org/10.1038/s43588-025-00854-1 (2025).
Article Google Scholar
Horowitz, M. Computing’s energy problem (and what we can do about it). In Proc. 2014 IEEE International Solid-State Circuits Conference 10–14 (IEEE, 2014).
Liu, Z. et al. Scissorhands: exploiting the persistence of importance hypothesis for LLM KV cache compression at test time. In Proc. Advances in Neural Information Processing Systems (eds Oh, A. et al.) 52342–52364 (NeurIPS, 2023).
Wan, W. et al. Nature 608, 504–512 (2022).
Article Google Scholar
Yao, P. et al. Nature 577, 641–646 (2020).
Article Google Scholar
Lin, Y. et al. Nat. Comput. Sci. 5, 27–36 (2025).
Article Google Scholar
Yang, X., Yan, B., Li, H. & Chen, Y. ReTransformer: ReRAM-based processing-in-memory architecture for transformer acceleration. In Proc. 39th International Conference on Computer-Aided Design (Association for Computing Machinery, 2020).
Yang, H. et al. Monolithic 3D integration of analog RRAM-based fully weight stationary and novel CFET 2T0C-based partially weight stationary for accelerating transformer. In Proc. 2024 IEEE Symposium on VLSI Technology and Circuits (IEEE, 2024).
Sridharan, S., Stevens, J. R., Roy, K. & Raghunathan, A. IEEE Trans. Very Large Scale Integr. VLSI Syst. 31, 1223–1233 (2023).
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Integrated Circuits, Beijing Advanced Innovation Center for Integrated Circuits, Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing, China
Yudeng Lin & Jianshi Tang

Authors

Yudeng Lin
View author publications
Search author on:PubMed Google Scholar
Jianshi Tang
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Jianshi Tang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, Y., Tang, J. Overcoming computational bottlenecks in large language models through analog in-memory computing. Nat Comput Sci 5, 711–712 (2025). https://doi.org/10.1038/s43588-025-00860-3

Download citation

Published: 08 September 2025
Version of record: 08 September 2025
Issue date: September 2025
DOI: https://doi.org/10.1038/s43588-025-00860-3

Overcoming computational bottlenecks in large language models through analog in-memory computing

Subjects

Access options

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Analog in-memory computing attention mechanism for fast and energy-efficient large language models

Search

Quick links

Subjects

Access options

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links