PPC - GPT : Federated task-specific compression of large language models via pruning and chain-of-thought distillation

Tao Fan, Guoqiang Ma, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang · 2025 · arXiv 2502.15857

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction

cs.AI · 2025-09-15 · unverdicted · novelty 6.0

A pruning technique called Reasoning-Aware Compression (RAC) jointly reconstructs input and chain-of-thought activations to preserve reasoning performance better than standard methods when compressing models like DeepSeek-R1.

citing papers explorer

Showing 1 of 1 citing paper.

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction cs.AI · 2025-09-15 · unverdicted · none · ref 4
A pruning technique called Reasoning-Aware Compression (RAC) jointly reconstructs input and chain-of-thought activations to preserve reasoning performance better than standard methods when compressing models like DeepSeek-R1.

PPC - GPT : Federated task-specific compression of large language models via pruning and chain-of-thought distillation

fields

years

verdicts

representative citing papers

citing papers explorer