A 3D SRAM-eDRAM hybrid CIM design in 22nm FDSOI enables general-purpose matrix computations beyond dot products with claimed balance of latency, energy, and density.
Neural cache: Bit-serial in-cache acceleration of deep neural networks
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
fields
cs.AR 3years
2026 3representative citing papers
AQPIM performs in-memory product quantization of activations for LLMs on PIM hardware, reducing GPU-CPU communication by 90-98.5% and delivering 3.4x speedup over prior PIM methods.