pith. sign in

Upcycling large language models into mixture of experts

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.LG 4 cs.CL 1

years

2026 2 2025 3

roles

background 1

polarities

background 1

representative citing papers

SpikingBrain: Spiking Brain-inspired Large Models

cs.LG · 2025-09-05 · unverdicted · novelty 6.0

SpikingBrain-7B and SpikingBrain-76B achieve Transformer-comparable performance after continual pre-training on 150B tokens, with over 100x TTFT speedup on 4M-token sequences and 69.15% sparsity from event-driven spiking.

citing papers explorer

Showing 5 of 5 citing papers.