Title resolution pending

Verma, P · 2024 · arXiv 2406.10254

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

Energy-Gated Attention improves language model validation loss by gating attention according to spectral energy of key embeddings discovered by a learned projection, with consistent gains on TinyShakespeare and Penn Treebank using under 0.26% extra parameters.

Energy-Gated Attention and Wavelet Positional Encoding: Complementary Inductive Biases for Transformer Attention

cs.LG · 2026-05-25 · unverdicted · novelty 5.0

EGA and MoPE together yield a 0.119 validation loss improvement on TinyShakespeare that exceeds the sum of their individual effects, indicating complementary inductive biases for salience and locality.

citing papers explorer

Showing 2 of 2 citing papers.

Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention cs.LG · 2026-05-21 · unverdicted · none · ref 19
Energy-Gated Attention improves language model validation loss by gating attention according to spectral energy of key embeddings discovered by a learned projection, with consistent gains on TinyShakespeare and Penn Treebank using under 0.26% extra parameters.
Energy-Gated Attention and Wavelet Positional Encoding: Complementary Inductive Biases for Transformer Attention cs.LG · 2026-05-25 · unverdicted · none · ref 21
EGA and MoPE together yield a 0.119 validation loss improvement on TinyShakespeare that exceeds the sum of their individual effects, indicating complementary inductive biases for salience and locality.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer