Gradpruner: Gradient-guided layer pruning enabling efficient fine-tuning and inference for llms

Wei Huang, Anda Cheng, Yinggui Wang · 2026 · arXiv 2601.19503

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Drop-Then-Recovery: How Redundant Are Vision-Language-Action Models?

cs.RO · 2026-06-26 · accept · novelty 7.0

VLA language backbones show high redundancy on manipulation benchmarks, with half the LLM blocks removable and even two blocks sufficient to recover baseline performance after fine-tuning, unlike vision and action pathways.

GHOST: Geometry-Hierarchical Online Streaming Token Eviction for Efficient 3D Reconstruction

cs.CV · 2026-05-15 · unverdicted · novelty 6.0 · 2 refs

GHOST is a geometry-hierarchical token eviction framework that halves the KV cache size in monocular video 3D reconstruction while maintaining quality and achieving 1.75x faster inference.

citing papers explorer

Showing 2 of 2 citing papers.

Drop-Then-Recovery: How Redundant Are Vision-Language-Action Models? cs.RO · 2026-06-26 · accept · none · ref 14
VLA language backbones show high redundancy on manipulation benchmarks, with half the LLM blocks removable and even two blocks sufficient to recover baseline performance after fine-tuning, unlike vision and action pathways.
GHOST: Geometry-Hierarchical Online Streaming Token Eviction for Efficient 3D Reconstruction cs.CV · 2026-05-15 · unverdicted · none · ref 8 · 2 links
GHOST is a geometry-hierarchical token eviction framework that halves the KV cache size in monocular video 3D reconstruction while maintaining quality and achieving 1.75x faster inference.

Gradpruner: Gradient-guided layer pruning enabling efficient fine-tuning and inference for llms

fields

years

verdicts

representative citing papers

citing papers explorer