Contextual position encoding: Learning to count what’s important

Olga Golovneva, Tianlu Wang, Jason Weston, Sainbayar Sukhbaatar · 2024 · arXiv 2405.18719

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Hypothesis generation and updating in large language models

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

LLMs exhibit Bayesian-like hypothesis updating with strong-sampling bias and an evaluation-generation gap but generalize poorly outside observed data.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Hypothesis generation and updating in large language models cs.LG · 2026-05-07 · unverdicted · none · ref 59
LLMs exhibit Bayesian-like hypothesis updating with strong-sampling bias and an evaluation-generation gap but generalize poorly outside observed data.

Contextual position encoding: Learning to count what’s important

fields

years

verdicts

representative citing papers

citing papers explorer