CoRR , volume =

Yijiong Yu, Ziyun Dai, Zekun Wang, Wei Wang, Ran Chen, Ji Pei , title = · 2025 · arXiv 2501.08197

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

ConSA: Controllable Sparsity in Hybrid Attention via Learnable Allocation

cs.CL · 2026-06-16 · unverdicted · novelty 6.0

ConSA learns FA/SWA allocation via L0 masks and augmented Lagrangian constraints, outperforming rule-based baselines on 0.6B and 1.7B models with consistent layer patterns.

citing papers explorer

Showing 1 of 1 citing paper.

ConSA: Controllable Sparsity in Hybrid Attention via Learnable Allocation cs.CL · 2026-06-16 · unverdicted · none · ref 32
ConSA learns FA/SWA allocation via L0 masks and augmented Lagrangian constraints, outperforming rule-based baselines on 0.6B and 1.7B models with consistent layer patterns.

CoRR , volume =

fields

years

verdicts

representative citing papers

citing papers explorer