arXiv preprint arXiv:2312.04455 , year=

Fortify the shortest stave in attention: Enhancing context awareness of large language models for effective tool use , author= · arXiv 2312.04455

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Mitigating Position Bias in Transformers via Layer-Specific Positional Embedding Scaling

cs.CL · 2026-06-26 · unverdicted · novelty 6.0

LPES uses per-layer scaling factors optimized by a genetic algorithm with Bézier curves to balance attention and improve long-context LLM performance by up to 11.2% on key-value retrieval.

citing papers explorer

Showing 1 of 1 citing paper.

Mitigating Position Bias in Transformers via Layer-Specific Positional Embedding Scaling cs.CL · 2026-06-26 · unverdicted · none · ref 19
LPES uses per-layer scaling factors optimized by a genetic algorithm with Bézier curves to balance attention and improve long-context LLM performance by up to 11.2% on key-value retrieval.

arXiv preprint arXiv:2312.04455 , year=

fields

years

verdicts

representative citing papers

citing papers explorer