Hmt: Hierarchical memory transformer for efficient long context language processing

He, Z · 2025 · arXiv 2511.06174

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference

cs.AR · 2026-04-24 · unverdicted · novelty 6.0

HGQ-LUT delivers a practical LUT-aware training framework with new tensor-based layers, heterogeneous quantization, and a resource surrogate that automates accuracy-efficiency trade-offs for FPGA DNN inference.

Understand and Accelerate Memory Processing Pipeline for Large Language Model Inference

cs.DC · 2026-03-30

citing papers explorer

Showing 2 of 2 citing papers.

HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference cs.AR · 2026-04-24 · unverdicted · none · ref 9
HGQ-LUT delivers a practical LUT-aware training framework with new tensor-based layers, heterogeneous quantization, and a resource surrogate that automates accuracy-efficiency trade-offs for FPGA DNN inference.
Understand and Accelerate Memory Processing Pipeline for Large Language Model Inference cs.DC · 2026-03-30 · unreviewed · ref 10

Hmt: Hierarchical memory transformer for efficient long context language processing

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer