Selection, Not Salience: The Shape and Limits of Personalization in Social Highlighting

· 2026 · cs.IR · arXiv 2606.10398

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

Does personalizing what a reader sees pay off, and where does it stop? Using a social web highlighter and a co-readership identity control (the same document highlighted by many users, which holds document and topic fixed and asks whether a person's own history predicts their marks better than another reader's does), we map the shape and limits of personalization across reading altitudes. At the document altitude we give the clean, leakage-free, identity-controlled measurement that prior next-document evaluations could only upper-bound: a person's history identifies which documents in a co-reading neighborhood are theirs, with an own-versus-other gap of +0.169 against community negatives and +0.119 against topic-matched hard negatives (both highly significant); a content-based arm suggests the signal is not purely title-driven but is largely thematic. This is comparable to the span-level selection signal (+0.14) from our prior work: the selection signal is of comparable magnitude across altitudes (+0.12 to +0.17), most of it stable topic preference. At the sentence altitude, a two-stage personalized auto-highlight (an impersonal model proposes candidates, a personal model re-ranks them) does not improve on its impersonal baseline: two off-the-shelf zero-shot LLMs, including a frontier model, predict highlight locations worse than a lead baseline, and personal re-ranking is beaten by the salience order even on the highest-recall candidate pool, so the null is not merely a Stage-1 ceiling artifact. Measurable personalization appears primarily at the selection layer: modest (~+0.13), topic-dominated, with no reliable gain at the salience layer. We also surface a control-in-negatives bias that inflated our document gap to a spurious +0.227 until audited. Going beyond the shared salience layer may be better approached by aggregating individuals than by personalizing them harder.

representative citing papers

Trait, Not State: The Durability of Reading Identity in Social Highlighting

cs.IR · 2026-06-11 · unverdicted · novelty 6.0

Readers' highlighting patterns on a social web platform remain stable over 24 months as a durable trait, with personal profiles from early documents predicting future selections at roughly 3x the average precision of non-personal baselines.

Factions Within, Uncertain Across: Within-Document Reader Sub-Groups in Social Highlighting

cs.IR · 2026-06-10 · unverdicted · novelty 6.0

Within-document highlighting shows strong reader sub-groups beyond null expectations from salience and popularity, but cross-document reproducibility of pair agreement is near zero and unresolved due to insufficient overlap.

The Long Tail, Not the Front Page: Cold-Start Prediction of Crowd Highlight Salience

cs.IR · 2026-06-10 · unverdicted · novelty 4.0

A supervised logistic ranker on embeddings and features beats the lead baseline by 0.044 average precision in retrospective cold-start prediction of crowd highlights.

citing papers explorer

Showing 3 of 3 citing papers.

Trait, Not State: The Durability of Reading Identity in Social Highlighting cs.IR · 2026-06-11 · unverdicted · none · ref 2 · internal anchor
Readers' highlighting patterns on a social web platform remain stable over 24 months as a durable trait, with personal profiles from early documents predicting future selections at roughly 3x the average precision of non-personal baselines.
Factions Within, Uncertain Across: Within-Document Reader Sub-Groups in Social Highlighting cs.IR · 2026-06-10 · unverdicted · none · ref 2 · internal anchor
Within-document highlighting shows strong reader sub-groups beyond null expectations from salience and popularity, but cross-document reproducibility of pair agreement is near zero and unresolved due to insufficient overlap.
The Long Tail, Not the Front Page: Cold-Start Prediction of Crowd Highlight Salience cs.IR · 2026-06-10 · unverdicted · none · ref 2 · internal anchor
A supervised logistic ranker on embeddings and features beats the lead baseline by 0.044 average precision in retrospective cold-start prediction of crowd highlights.

Selection, Not Salience: The Shape and Limits of Personalization in Social Highlighting

fields

years

verdicts

representative citing papers

citing papers explorer