archive
Every paper Pith has read. Search by title, abstract, or pith.
46077 papers indexed · page 4
-
The paper generalizes bond percolation to tube percolation
Sharp Phase Transition for the Formation of Infinite Tubes
-
Interaction ramp exposes hidden CDW correlations in Hubbard atoms
Revealing Hidden Correlations in a Fermi-Hubbard system via Interaction Ramps
-
Survey ties LLM agent collaboration to failure detection and self-fix
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems
-
BiFedKD raises ECG accuracy 3.5 percent with 40 percent less communication
BiFedKD: Bidirectional Federated Knowledge Distillation Framework for Non-IID and Long-Tailed ECG Monitoring
-
Head ranking doubles KV cache compression in image generators
HeatKV: Head-tuned KV-cache Compression for Visual Autoregressive Modeling
-
The paper presents the Closed-Loop Visual Reasoning (CLVR) framework that integrates…
Unlocking Complex Visual Generation via Closed-Loop Verified Reasoning
-
Transducers extend stabilization for faster relational string solving
String Solving with Stabilization and Transducers (Technical Report)
-
Decomposing traces boosts AI agent diagnosis accuracy up to 12x
Holistic Evaluation and Failure Diagnosis of AI Agents
-
Twists around restriction functors yield autoequivalences for Gorenstein orders
Spherical Twists for Gorenstein Orders and $G$-Hilb
-
The paper presents a fixed six-stage deterministic workflow that confines language model…
A Deterministic Agentic Workflow for HS Tariff Classification: Multi-Dimensional Rule Reasoning with Interpretable Decisions
-
Nested reset counters hit exact F_Ωk levels
The Complexity of Nested Reset Counter Systems
-
The paper introduces ICGPS, which uses meta-trained generative models for in-context…
In-Context Learning for Data-Driven Censored Inventory Control
-
Min-1-planarity testing is NP-hard
Min-1-Planarity is NP-Hard
-
The paper theoretically identifies which post-transition metal oxides form transient…
Transient superionic state in ultrafast-irradiated post-transition metal oxides
-
Mat2Boundary turns boundary conditions into SpMV for PDE solvers
Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids
-
Shared channel basis across frequencies boosts spectral mixers
CHASM: Cross-frequency Harmonized Axis-Separable Mixing for Spectral Token Operators
-
Model reads cell types and protein levels from label-free images
Towards Label-Free Single-Cell Phenotyping Using Multi-Task Learning
-
Vision features align LLM text with clinical data for stroke prognosis
Vision-Core Guided Contrastive Learning for Balanced Multi-modal Prognosis Prediction of Stroke
-
Adaptive mode switching raises fidelity on complex image prompts
Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners
-
Dual-branch model copies text styles across languages in scenes
StyleTextGen: Style-Conditioned Multilingual Scene Text Generation
-
Model generates sign language replies from signing context alone
Towards Continuous Sign Language Conversation from Isolated Signs
-
VLMs fail to locate hidden functional objects from task instructions
SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
-
Generative model turns SDR video into HDR by predicting bracketed exposures
Generating HDR Video from SDR Video
-
Driving model gains planning edge by forecasting 3D futures
EponaV2: Driving World Model with Comprehensive Future Reasoning
-
Randomly initialized nets match active learning without candidate models
Are Candidate Models Really Needed for Active Learning?
-
Multiscale VLM features raise video edit quality
MiVE: Multiscale Vision-language features for reference-guided video Editing
-
Anatomy topology across patients boosts medical scan pre-training
Beyond Instance-Level Self-Supervision in 3D Multi-Modal Medical Imaging
-
New dataset tracks urban land and vegetation shifts with 5221 Sentinel-2 pairs
TERRA-CD: Multi-Temporal Framework for Multi-class and Semantic Change Detection
-
Vision framework with physical priors lifts water level accuracy
Vision-Based Water Level and Flow Estimation
-
RefineCAM improves high-resolution CAMs for CNN explanations
How to Evaluate and Refine your CAM
-
Multi-label benchmark shows MLLMs still miss full emotion mixes
MultiEmo-Bench: Multi-label Visual Emotion Analysis for Multi-modal Large Language Models
-
Reinforcement learning estimates hidden states for multivariate HMM forecasts
DRL-STAF: A Deep Reinforcement Learning Framework for State-Aware Forecasting of Complex Multivariate Hidden Markov Processes
-
Learned potential reweights bridges to improve generative fidelity
Action-Inspired Generative Models
-
Unified diffusion generates aligned VIS-IR-Label triplets from few pairs
UniTriGen: Unified Triplet Generation of Aligned Visible-Infrared-Label for Few-Shot RGB-T Semantic Segmentation
-
Neural solvers reach energy parity after 158000 deployments
An Amortized Efficiency Threshold for Comparing Neural and Heuristic Solvers in Combinatorial Optimization
-
Internal masking cuts hallucinations in vision-language models
Do We Really Need External Tools to Mitigate Hallucinations? SIRA: Shared-Prefix Internal Reconstruction of Attribution
-
Calibration works from any number of real-world views
CalibAnyView: Beyond Single-View Camera Calibration in the Wild
-
Discriminant loss sharpens segmentation boundaries
Deep Image Segmentation via Discriminant Feature Learning
-
ViMU benchmark tests video AI on hidden meanings
ViMU: Benchmarking Video Metaphorical Understanding
-
Hybrid Mamba-attention model extends rainfall forecasts to three hours
MambaRain: Multi-Scale Mamba-Attention Framework for 0-3 Hour Precipitation Nowcasting
-
Gaussians replace grids to lift panoramic images into 3D detections
Towards Accurate Single Panoramic 3D Detection: A Semantic Gaussian Centric Approach
-
Min-Max-IRL reaches fast O(n^{-1}) rates without exploration
Fast Rates for Inverse Reinforcement Learning
-
Two-stage model fuses radar and satellite for sharper rain forecasts
VMU-Diff: A Coarse-to-fine Multi-source Data Fusion Framework for Precipitation Nowcasting
-
TOPOS locks single-image 3D heads to fixed studio topology
TOPOS: High-Fidelity and Efficient Industry-Grade 3D Head Generation
-
Quandle presentations now work for surface knots in any 4-manifold
Quandle presentations of surface knots in 4-manifolds and bridge numbers
-
Twisted crystals yield telecom entangled photons with 95% fidelity
Entangled Telecom Photon Generation using Twisted Van der Waals Crystals
-
Privacy audits need no retraining runs
Privacy Auditing with Zero (0) Training Run
-
Higher-order stain stats raise federated pathology accuracy 3.9%
FedStain: Modeling Higher-Order Stain Statistics for Federated Domain Generalization in Computational Pathology
-
Terminal anchors extend LLM context to 64K from short sequences
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring
-
Recursive models collapse internally before metrics detect it
Silent Collapse in Recursive Learning Systems