hub

Wukong: Towards a scaling law for large-scale recommendation

Wukong: Towards a scaling law for large-scale recommendation , author= · 2024 · arXiv 2403.02545

18 Pith papers cite this work. Polarity classification is still indexing.

18 Pith papers citing it

read on arXiv browse 18 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

cs.IR · 2026-04-21 · unverdicted · novelty 7.0

LoopCTR trains CTR models with recursive layer reuse and process supervision so that zero-loop inference outperforms baselines on public and industrial datasets.

TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds

cs.IR · 2026-04-15 · unverdicted · novelty 7.0

TokenFormer unifies multi-field and sequential recommendation modeling via bottom-full-top-sliding attention and non-linear interaction representations to avoid sequential collapse and deliver state-of-the-art performance.

IAT: Instance-As-Token Compression for Historical User Sequence Modeling in Industrial Recommender Systems

cs.IR · 2026-04-10 · unverdicted · novelty 7.0

IAT compresses each historical interaction instance into a unified embedding token via temporal-order or user-order schemes, allowing standard sequence models to learn long-range preferences with better performance and transferability.

Compute Only Once: UG-Separation for Efficient Large Recommendation Models

cs.IR · 2026-02-11 · unverdicted · novelty 7.0

UG-Separation framework disentangles user-side and item-side flows in TokenMixer dense-interaction models to enable reusable user computations, cutting inference latency up to 20% in ByteDance production scenarios.

FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation

cs.AI · 2026-05-20 · unverdicted · novelty 6.0

FLUID introduces LUCID semantic codes from a multimodal encoder to retire item IDs in livestreaming rankers, with staged warmup yielding online gains of +0.55% watch duration and +2.05% cold-start views.

LoKA: Low-precision Kernel Applications for Recommendation Models At Scale

cs.LG · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

LoKA enables practical FP8 use in numerically sensitive large recommendation models via online profiling of activations, reusable model modifications for stability, and dynamic kernel dispatching.

Understanding DNNs in Feature Interaction Models: A Dimensional Collapse Perspective

cs.LG · 2026-04-29 · unverdicted · novelty 6.0

DNNs mitigate dimensional collapse of embeddings in feature interaction models, shown via parallel and stacked experiments plus gradient analysis.

When Less is More: The LLM Scaling Paradox in Context Compression

cs.LG · 2026-02-10 · unverdicted · novelty 6.0

Larger LLM compressors in lossy setups often yield less faithful context reconstructions due to knowledge overwriting and semantic drift, with mid-sized models outperforming larger ones across 27 tested configurations.

SilverTorch: A Unified Model-based System to Democratize Large-Scale Recommendation on GPUs

cs.IR · 2025-11-18 · unverdicted · novelty 6.0

SilverTorch replaces standalone ANN indexing and filtering with a unified GPU model using a model-based Bloom index and fused Int8 ANN kernel, delivering up to 23.7x throughput and 13.35x cost efficiency gains on industry data.

PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

PACEvolve++ uses a phase-adaptive reinforcement learning advisor to decouple hypothesis selection from execution in LLM-driven evolutionary search, delivering faster convergence than prior frameworks on load balancing, recommendation, and protein tasks.

Harmonizing Generative Retrieval and Ranking in Chain-of-Recommendation

cs.IR · 2026-04-28 · unverdicted · novelty 5.0

RecoChain unifies generative candidate generation via hierarchical semantic IDs and SIM-based ranking in a single Transformer to improve top-K recommendation performance.

Sample Is Feature: Beyond Item-Level, Toward Sample-Level Tokens for Unified Large Recommender Models

cs.IR · 2026-04-17 · unverdicted · novelty 5.0 · 2 refs

SIF encodes entire historical raw samples as tokens via hierarchical group-adaptive quantization and token/sample-level mixing to overcome partial encoding and feature heterogeneity limits in scaled recommender models.

Beyond Dense Connectivity: Explicit Sparsity for Scalable Recommendation

cs.IR · 2026-04-09 · unverdicted · novelty 5.0

SSR uses static random filters and iterative competitive sparse mechanisms to explicitly enforce sparsity in recommendation models, outperforming dense baselines on public and billion-scale industrial datasets.

Revisiting Content-Based Music Recommendation: Efficient Feature Aggregation from Large-Scale Music Models

cs.IR · 2026-02-10 · unverdicted · novelty 5.0

TASTE dataset and MuQ-token aggregation enable effective use of audio features from large music models to improve content-based music recommendations over collaborative filtering alone.

CMSL: Constructive Multi-Sequence Learning for Recommendation Systems

cs.IR · 2026-06-26 · unverdicted · novelty 4.0

CMSL uses a learnable module to disentangle user history into multiple pure sequences modeled with linear attention to improve recommendation performance over single-sequence approaches.

FreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost

cs.LG · 2026-04-27 · unverdicted · novelty 4.0

FreeScale reduces computational bubbles by up to 90.3% in distributed training of sequence recommendation models on 256 H100 GPUs via load balancing, prioritized embedding overlap, and SM-Free communication.

Joint Model Parameter Scaling and Universal-Domain Data Integration for E-commerce Search Ranking

cs.IR · 2026-03-25 · unverdicted · novelty 4.0

UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.

SOLARIS: Speculative Offloading of Latent-bAsed Representation for Inference Scaling

cs.LG · 2026-04-13

citing papers explorer

Showing 18 of 18 citing papers.

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction cs.IR · 2026-04-21 · unverdicted · none · ref 25
LoopCTR trains CTR models with recursive layer reuse and process supervision so that zero-loop inference outperforms baselines on public and industrial datasets.
TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds cs.IR · 2026-04-15 · unverdicted · none · ref 52
TokenFormer unifies multi-field and sequential recommendation modeling via bottom-full-top-sliding attention and non-linear interaction representations to avoid sequential collapse and deliver state-of-the-art performance.
IAT: Instance-As-Token Compression for Historical User Sequence Modeling in Industrial Recommender Systems cs.IR · 2026-04-10 · unverdicted · none · ref 47
IAT compresses each historical interaction instance into a unified embedding token via temporal-order or user-order schemes, allowing standard sequence models to learn long-range preferences with better performance and transferability.
Compute Only Once: UG-Separation for Efficient Large Recommendation Models cs.IR · 2026-02-11 · unverdicted · none · ref 32
UG-Separation framework disentangles user-side and item-side flows in TokenMixer dense-interaction models to enable reusable user computations, cutting inference latency up to 20% in ByteDance production scenarios.
FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation cs.AI · 2026-05-20 · unverdicted · none · ref 44
FLUID introduces LUCID semantic codes from a multimodal encoder to retire item IDs in livestreaming rankers, with staged warmup yielding online gains of +0.55% watch duration and +2.05% cold-start views.
LoKA: Low-precision Kernel Applications for Recommendation Models At Scale cs.LG · 2026-05-11 · unverdicted · none · ref 83 · 2 links
LoKA enables practical FP8 use in numerically sensitive large recommendation models via online profiling of activations, reusable model modifications for stability, and dynamic kernel dispatching.
Understanding DNNs in Feature Interaction Models: A Dimensional Collapse Perspective cs.LG · 2026-04-29 · unverdicted · none · ref 33
DNNs mitigate dimensional collapse of embeddings in feature interaction models, shown via parallel and stacked experiments plus gradient analysis.
When Less is More: The LLM Scaling Paradox in Context Compression cs.LG · 2026-02-10 · unverdicted · none · ref 18
Larger LLM compressors in lossy setups often yield less faithful context reconstructions due to knowledge overwriting and semantic drift, with mid-sized models outperforming larger ones across 27 tested configurations.
SilverTorch: A Unified Model-based System to Democratize Large-Scale Recommendation on GPUs cs.IR · 2025-11-18 · unverdicted · none · ref 39
SilverTorch replaces standalone ANN indexing and filtering with a unified GPU model using a model-based Bloom index and fused Int8 ANN kernel, delivering up to 23.7x throughput and 13.35x cost efficiency gains on industry data.
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents cs.LG · 2026-05-07 · unverdicted · none · ref 53
PACEvolve++ uses a phase-adaptive reinforcement learning advisor to decouple hypothesis selection from execution in LLM-driven evolutionary search, delivering faster convergence than prior frameworks on load balancing, recommendation, and protein tasks.
Harmonizing Generative Retrieval and Ranking in Chain-of-Recommendation cs.IR · 2026-04-28 · unverdicted · none · ref 22
RecoChain unifies generative candidate generation via hierarchical semantic IDs and SIM-based ranking in a single Transformer to improve top-K recommendation performance.
Sample Is Feature: Beyond Item-Level, Toward Sample-Level Tokens for Unified Large Recommender Models cs.IR · 2026-04-17 · unverdicted · none · ref 25 · 2 links
SIF encodes entire historical raw samples as tokens via hierarchical group-adaptive quantization and token/sample-level mixing to overcome partial encoding and feature heterogeneity limits in scaled recommender models.
Beyond Dense Connectivity: Explicit Sparsity for Scalable Recommendation cs.IR · 2026-04-09 · unverdicted · none · ref 36
SSR uses static random filters and iterative competitive sparse mechanisms to explicitly enforce sparsity in recommendation models, outperforming dense baselines on public and billion-scale industrial datasets.
Revisiting Content-Based Music Recommendation: Efficient Feature Aggregation from Large-Scale Music Models cs.IR · 2026-02-10 · unverdicted · none · ref 48
TASTE dataset and MuQ-token aggregation enable effective use of audio features from large music models to improve content-based music recommendations over collaborative filtering alone.
CMSL: Constructive Multi-Sequence Learning for Recommendation Systems cs.IR · 2026-06-26 · unverdicted · none · ref 107
CMSL uses a learnable module to disentangle user history into multiple pure sequences modeled with linear attention to improve recommendation performance over single-sequence approaches.
FreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost cs.LG · 2026-04-27 · unverdicted · none · ref 5
FreeScale reduces computational bubbles by up to 90.3% in distributed training of sequence recommendation models on 256 H100 GPUs via load balancing, prioritized embedding overlap, and SM-Free communication.
Joint Model Parameter Scaling and Universal-Domain Data Integration for E-commerce Search Ranking cs.IR · 2026-03-25 · unverdicted · none · ref 35
UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.
SOLARIS: Speculative Offloading of Latent-bAsed Representation for Inference Scaling cs.LG · 2026-04-13 · unreviewed · ref 42

Wukong: Towards a scaling law for large-scale recommendation

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer