Mean-field perturbation theory of dropout at the edge of chaos yields distinct universality classes for smooth versus kinked activations, critical scaling laws for correlation decay, and front-loaded dropout schedules that reduce test loss.
hub
Advances in Neural Information Processing Systems , volume =
12 Pith papers cite this work. Polarity classification is still indexing.
hub tools
years
2026 12representative citing papers
IsalProgram is a regular assembly-like language where all instruction strings are valid programs executed on a circular doubly linked list VM without addresses or variable names.
Proves that RoPE attention loses locality bias and token distinction in long contexts, approaching random behavior independent of content.
A new VLA model called SI uses a four-step chain-of-thought to derive driving intent and applies it via classifier-free guidance to a flow-matching trajectory generator, showing competitive Waymo scores and intent-controllable plans.
NeuroMAS reframes multi-agent language systems as neural architectures where LLM agents learn coordination via reinforcement learning rather than predefined roles.
ReflectDrive-2 combines masked discrete diffusion with RL-aligned self-editing to generate and refine driving trajectories, reaching 91.0 PDMS on NAVSIM camera-only and 94.8 in best-of-6.
UTOPYA fuses eight modalities via FiLM-conditioned attention and physics-informed regularization to reach AUROC 0.874 for anomaly detection in batch distillation, outperforming baselines by 0.147.
CKT-WAM transfers teacher WAM knowledge to students via compressed text-embedding contexts using LQCA and adapters, reaching 86.1% success on LIBERO-Plus with 1.17% trainable parameters and 83.3% in real-world tasks.
IConFace performs unified reference-aware and no-reference blind face restoration by asymmetrically conditioning identity from references and structure from the degraded image.
Benchmarking in pediatric ICU antimicrobial stewardship shows performance depends mainly on target prevalence and dataset traits rather than model complexity, with sequence models improving precision-recall at 24-hour resolution but showing poorer calibration than tabular models.
Authors call for contamination-resistant LLM benchmarks that exploit Transformer training-inference asymmetry and require new mathematical methods for cross-architecture interoperability.
A context-aware synthetic augmentation framework with a hybrid clinical-language model improves psychological defense mechanism classification to 58.26% accuracy and 24.62% macro-F1 in low-resource conditions, outperforming the DMRS Co-Pilot baseline.
citing papers explorer
-
Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos
Mean-field perturbation theory of dropout at the edge of chaos yields distinct universality classes for smooth versus kinked activations, critical scaling laws for correlation decay, and front-loaded dropout schedules that reduce test loss.
-
The IsalProgram Programming Language
IsalProgram is a regular assembly-like language where all instruction strings are valid programs executed on a circular doubly linked list VM without addresses or variable names.
-
RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably
Proves that RoPE attention loses locality bias and token distinction in long contexts, approaching random behavior independent of content.
-
Action Emergence from Streaming Intent
A new VLA model called SI uses a four-step chain-of-thought to derive driving intent and applies it via classifier-free guidance to a flow-matching trajectory generator, showing competitive Waymo scores and intent-controllable plans.
-
NeuroMAS: Multi-Agent Systems as Neural Networks with Joint Reinforcement Learning
NeuroMAS reframes multi-agent language systems as neural architectures where LLM agents learn coordination via reinforcement learning rather than predefined roles.
-
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving
ReflectDrive-2 combines masked discrete diffusion with RL-aligned self-editing to generate and refine driving trajectories, reaching 91.0 PDMS on NAVSIM camera-only and 94.8 in best-of-6.
-
UTOPYA: A Multimodal Deep Learning Framework for Physics-Informed Anomaly Detection and Time-Series Prediction
UTOPYA fuses eight modalities via FiLM-conditioned attention and physics-informed regularization to reach AUROC 0.874 for anomaly detection in batch distillation, outperforming baselines by 0.147.
-
CKT-WAM: Parameter-Efficient Context Knowledge Transfer Between World Action Models
CKT-WAM transfers teacher WAM knowledge to students via compressed text-embedding contexts using LQCA and adapters, reaching 86.1% success on LIBERO-Plus with 1.17% trainable parameters and 83.3% in real-world tasks.
-
IConFace: Identity-Structure Asymmetric Conditioning for Unified Reference-Aware Face Restoration
IConFace performs unified reference-aware and no-reference blind face restoration by asymmetrically conditioning identity from references and structure from the degraded image.
-
Benchmarking Machine Learning Architectures for Antimicrobial Stewardship in Pediatric ICUs
Benchmarking in pediatric ICU antimicrobial stewardship shows performance depends mainly on target prevalence and dataset traits rather than model complexity, with sequence models improving precision-recall at 24-hour resolution but showing poorer calibration than tabular models.
-
LLM Benchmark Datasets Should Be Contamination-Resistant
Authors call for contamination-resistant LLM benchmarks that exploit Transformer training-inference asymmetry and require new mathematical methods for cross-architecture interoperability.
-
Mitigating Data Scarcity in Psychological Defense Classification with Context-Aware Synthetic Augmentation
A context-aware synthetic augmentation framework with a hybrid clinical-language model improves psychological defense mechanism classification to 58.26% accuracy and 24.62% macro-F1 in low-resource conditions, outperforming the DMRS Co-Pilot baseline.