Tomoe: Converting dense large language models to mixture-of-experts through dynamic structural pruning.arXiv preprint arXiv:2501.15316,

Shangqian Gao, Ting Hua, Reza Shirkavand, Chi-Heng Lin, Zheng Tang, Zhengao Li, Longge Yuan, Fangyi Li, Zeyu Zhang, Alireza Ganjdanesh, et al · arXiv 2501.15316

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling

cs.LG · 2026-05-26 · unverdicted · novelty 5.0

Dense2MoE unifies pruning of attention modules with upcycling of MLPs into MoE experts to produce on-device LLMs that improve the latency-accuracy Pareto frontier.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Dense2MoE: Pushing the Pareto Frontier of On-Device LLMs via Unified Pruning and Upcycling cs.LG · 2026-05-26 · unverdicted · none · ref 6
Dense2MoE unifies pruning of attention modules with upcycling of MLPs into MoE experts to produce on-device LLMs that improve the latency-accuracy Pareto frontier.

Tomoe: Converting dense large language models to mixture-of-experts through dynamic structural pruning.arXiv preprint arXiv:2501.15316,

fields

years

verdicts

representative citing papers

citing papers explorer