archive

Every paper Pith has read. Search by title, abstract, or pith.

7878 papers in cs.LG · page 1

cs.CV 2026-05-14 reviewed

The paper proposes RefDecoder
RefDecoder: Enhancing Visual Generation with Conditional Video Decoding

Bohan Fang +4
cs.LG 2026-05-14 reviewed

FutureSim shows top AI agents predict events at 25% accuracy
FutureSim: Replaying World Events to Evaluate Adaptive Agents

Ameya Prabhu +7
cs.LG 2026-05-14 reviewed

Tensor similarity algebraically checks when two networks compute the same function
When Are Two Networks the Same? Tensor Similarity for Mechanistic Interpretability

Jacob Meyer Cohen +5
cs.LG 2026-05-14 reviewed

This paper introduces Shodh-MoE
Eradicating Negative Transfer in Multi-Physics Foundation Models via Sparse Mixture-of-Experts Routing

Arastu Sharma +1
cs.CL 2026-05-14 reviewed

EHR tables sharpen timing in text-based clinical timelines
Text Knows What, Tables Know When: Clinical Timeline Reconstruction via Retrieval-Augmented Multimodal Alignment

Jeremy C. Weiss +3
cs.RO 2026-05-14 reviewed

Seamless blending cuts robot-hand takeover jitter by 99.8%
Hand-in-the-Loop: Improving Dexterous VLA via Seamless Interventional Correction

Liqun Huang +7
cs.CL 2026-05-14 reviewed

Memory model lets LLMs add knowledge without retraining
MeMo: Memory as a Model

Alfred Wei Lun Leong +8
stat.ML 2026-05-14 reviewed

RoSHAP gives stable rankings by summarizing SHAP distributions
RoSHAP: A Distributional Framework and Robust Metric for Stable Feature Attribution

Boyu Jiang +5
stat.ML 2026-05-14 reviewed

Optimal logging policies minimize OPE error via reward-coverage balance
Logging Policy Design for Off-Policy Evaluation

Connor Douglas +2
stat.ML 2026-05-14 reviewed

Anomaly detection uncovers refinery LP errors and opportunities
From Data to Action: Accelerating Refinery Optimization with AI

\'Abrah\'am Papp +6
stat.ML 2026-05-14 reviewed

AGOP from kernel regression recovers central subspace with fewer samples
Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

Damek Davis +3
cs.SD 2026-05-14 reviewed

SpeakerLLM turns speaker verification into natural-language reasoning
SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning

Ha-Jin Yu +4
eess.SP 2026-05-14 reviewed

Attention network cuts IRS pilots by 87 percent
Multi-Block Attention for Efficient Channel Estimation in IRS-Assisted mmWave MIMO

Maryam Sabbaghian +2
cs.LG 2026-05-14 reviewed

128 random demos suffice for strong RLVR results
Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance

Alexander G. Schwing +2
cs.LG 2026-05-14 reviewed

DeepTokenEEG is a compact deep learning model that processes EEG brain signals using…
DeepTokenEEG Enhancing Mild Cognitive Impairment and Alzheimers Classification via Tokenized EEG Features

Bui Thanh Tung +9
physics.plasm-ph 2026-05-14 reviewed

Neural emulators deliver real-time virtual circuits for tokamak control
Real-time virtual circuits for plasma shape control via neural network emulators

Adriano Agnello +9
cs.LG 2026-05-14 reviewed

The paper introduces ICGPS, which uses meta-trained generative models for in-context…
In-Context Learning for Data-Driven Censored Inventory Control

Anh-Duy Pham +3
cs.LG 2026-05-14 reviewed

Reinforcement learning estimates hidden states for multivariate HMM forecasts
DRL-STAF: A Deep Reinforcement Learning Framework for State-Aware Forecasting of Complex Multivariate Hidden Markov Processes

Chen Zhang +3
cs.LG 2026-05-14 reviewed

Learned potential reweights bridges to improve generative fidelity
Action-Inspired Generative Models

Debnath Pal +1
cs.LG 2026-05-14 reviewed

Neural solvers reach energy parity after 158000 deployments
An Amortized Efficiency Threshold for Comparing Neural and Heuristic Solvers in Combinatorial Optimization

Sohaib Afifi
cs.CV 2026-05-14 reviewed

Discriminant loss sharpens segmentation boundaries
Deep Image Segmentation via Discriminant Feature Learning

Adam Dawid Sztamborski +2
cs.LG 2026-05-14 reviewed

Min-Max-IRL reaches fast O(n^{-1}) rates without exploration
Fast Rates for Inverse Reinforcement Learning

Andreas Schlaginhaufen +1
cs.LG 2026-05-14 reviewed

Recursive models collapse internally before metrics detect it
Silent Collapse in Recursive Learning Systems

Zhipeng Zhang
cs.LG 2026-05-14 reviewed

SAM worsens DRL backdoors while other fixes reduce them
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning

Chunyi Zhou +6
physics.chem-ph 2026-05-14 reviewed

Neural corrections boost accuracy of implicit solvent models for proteins
All-atomistic Transferable Neural Potentials for Protein Solvation

Konstantin Popov +2
cs.LG 2026-05-14 reviewed

Algorithm reduces Any-Order-PDIV runtime from millions of years to minutes
Woodelf++: A Fast and Unified Partial Dependence Plot Algorithm for Decision Tree Ensembles

Alexander Nadel +2
cs.RO 2026-05-14 reviewed

Alignment lets robots predict tactile sensations from sight
Let Robots Feel Your Touch: Visuo-Tactile Cortical Alignment for Embodied Mirror Resonance

Anan Li +6
cs.SE 2026-05-14 reviewed

ML classifier beats rules at spotting BDD refactoring chances
Mining Subscenario Refactoring Opportunities in Behaviour-Driven Software Test Suites: ML Classifiers and LLM-Judge Baselines

Ali Hassaan Mughal +2
stat.ML 2026-05-14 reviewed

Sequential feature recovery produces power-law scaling
Scaling Laws from Sequential Feature Recovery: A Solvable Hierarchical Model

Arie Wortsman-Zurich +4
cs.LG 2026-05-14 reviewed

Action tokens carry the training signal in agentic RL
Resolving Action Bottleneck: Agentic Reinforcement Learning Informed by Token-Level Energy

David Wipf +9
cs.LG 2026-05-14 reviewed

Bandits recover multi-objective prompts more efficiently
Efficient Multi-objective Prompt Optimization via Pure-exploration Bandits

Chengshuai Shi +4
cs.LG 2026-05-14 reviewed

SeesawNet fuses normalized and raw sequences for better time series forecasts
SeesawNet: Towards Non-stationary Time Series Forecasting with Balanced Modeling of Common and Specific Dependencies

Hao Li +5
cs.LG 2026-05-14 reviewed

Single score reveals accuracy does not equal responsibility
Multi-Dimensional Model Integrity and Responsibility Assessment Index and Scoring Framework

Hung Cao +3
cs.LG 2026-05-14 reviewed

Fine-tuning exposes physical direction in neural PDE weights
Discovering Physical Directions in Weight Space: Composing Neural PDE Experts

Dong Ni +9
cs.LG 2026-05-14 reviewed

LLMs top out at 46 percent exact match on medication choices
RxEval: A Prescription-Level Benchmark for Evaluating LLM Medication Recommendation

Changmiao Wang +6
cs.LG 2026-05-14 reviewed

Activation patching isolates how LLMs represent relative geography
Exploring Geographic Relative Space in Large Language Models through Activation Patching

Kevin Roitero +3
cs.LG 2026-05-14 reviewed

LLM agents autonomously develop ML interatomic potentials
Lang2MLIP: End-to-End Language-to-Machine Learning Interatomic Potential Development with Autonomous Agentic Workflows

Nontawat Charoenphakdee +2
stat.ML 2026-05-14 reviewed

Product kernels recover saturation and multiple descent in high-dim KRR
Large Dimensional Kernel Ridge Regression: Extending to Product Kernels

Qian Lin +3
cs.LG 2026-05-14 reviewed

Foldable layer norms convert exactly to faster RMSNorm
Enjoy Your Layer Normalization with the Computational Efficiency of RMSNorm

Jie Luo +6
cs.CV 2026-05-14 reviewed

ArcGate activation adapts shape to raise remote sensing accuracy
ArcGate: Adaptive Arctangent Gated Activation

Alejandro C. Frery +4
eess.SY 2026-05-14 reviewed

DRL agent cuts bike availability failures with one truck
Fully Dynamic Rebalancing in Dockless Bike-Sharing Systems via Deep Reinforcement Learning

Alberto Pettena +5
cs.LG 2026-05-14 reviewed

Bi-level optimization automates data mixing in offline-to-online RL
ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization

(2) Ant Group +9
cs.AI 2026-05-14 reviewed

A GNN-Transformer trained to imitate a lookahead heuristic selects reduced scenarios for…
Learning Scenario Reduction for Two-Stage Robust Optimization with Discrete Uncertainty

Jianan Zhou +6
cs.LG 2026-05-14 reviewed

Schur projection keeps state-space neural nets stable
A Novel Schur-Decomposition-Based Weight Projection Method for Stable State-Space Neural-Network Architectures

Fredy Ruiz +2
cs.LG 2026-05-14 reviewed

Models evolve reusable skills library at test time
Test-Time Learning with an Evolving Library

Alessandro Sordoni +6
cs.LG 2026-05-14 reviewed

Focused estimator improves PU learning on imbalanced data
Focused PU learning from imbalanced data

Elias Zavitsanos +1
cs.AI 2026-05-14 reviewed

IIQ metric scores AI integration from 0 to 1000
Intelligence Impact Quotient (IIQ): A Framework for Measuring Organizational AI Impact

Amit Bahree +6
cs.LG 2026-05-14 reviewed

Guardrails adapt from sparse noisy failures via conservative induction
LiSA: Lifelong Safety Adaptation via Conservative Policy Induction

Bharath Chandrasekhar +8
cs.LG 2026-05-14 reviewed

Orthogonal projection isolates hallucination signals in LLM answers
When Answers Stray from Questions: Hallucination Detection via Question-Answer Orthogonal Decomposition

Erhu Feng +2
cs.LG 2026-05-14 reviewed

Synthesized open-ended problems raise LLM coding scores by 8-12 points
FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

Alex Dimakis +16