archive

Every paper Pith has read. Search by title, abstract, or pith.

595 papers in cs.DC · page 1

cs.AI 2026-05-14 reviewed

APWA scales agent workflows by parallelizing non-communicating subproblems
APWA: A Distributed Architecture for Parallelizable Agentic Workflows

Alina Oprea +4
quant-ph 2026-05-14 reviewed

Cache reorganization lifts GPU speedups for 28-qubit simulations on laptops
Accelerating State-Vector Quantum Simulation on Integrated GPUs via Cache Locality Optimization: A Cross-Architecture Evaluation

Eduarda Rodrigues Monteiro +4
cs.DC 2026-05-14 reviewed

Wi-Fi logs build hierarchical mobility models with lower complexity
Analysis of wireless network access logs for a hierarchical characterization of user mobility

Carlos Guerrero +2
cs.GR 2026-05-14 reviewed

Unified GPU solver gives exact gradients for stiff heterogeneous soft bodies
DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration

Bing-Yu Chen +11
cs.DC 2026-05-14 reviewed

Exploration fails above ceil(k/(n-2))-1 deactivations per round
Semi-Synchronous Exploration in Dynamic Graphs

Anisur Rahaman Molla +3
cs.DC 2026-05-13 reviewed

Distributed Sumcheck gives statistical zero-knowledge for graph problems
Distributed Statistical Zero-Knowledge Proofs via Sumcheck

Benjamin Jauregui +1
cs.LG 2026-05-13 reviewed

EMA cuts model adaptation costs 15-42% in shifting environments
EMA: Efficient Model Adaptation for Learning-based Systems

Daiyang Yu +5
cs.LG 2026-05-13 reviewed

MinT manages million LoRA policies over shared 1T models
MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Aaron Guan +60
cs.LG 2026-05-13 reviewed

Federated fine-tuning matches centralized LLM training on private data
Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning

Daniel M. Jimenez-Gutierrez +5
cs.DC 2026-05-13 reviewed

Adaptive KV compression speeds disaggregated LLM serving up to 9x
KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

Bing Lu +11
cs.CR 2026-05-13 reviewed

Client committee speeds secure aggregation 4.6x
DisAgg: Distributed Aggregators for Efficient Secure Aggregation in Federated Learning

Anastasios Drosou +6
cs.DC 2026-05-13 reviewed

Multi-agent RL cuts LLM carbon by 33% and water by 43%
MARLIN: Multi-Agent Game-Theoretic Reinforcement Learning for Sustainable LLM Inference in Cloud Datacenters

C. Bash +4
cs.DC 2026-05-13 reviewed

Hybrid method cuts graph scheduling violations 45 percent
Sustainable Graph Analytics Workload Scheduling with Evolutionary Reinforcement Learning in Edge-Cloud Systems

A. Islam +7
cs.LG 2026-05-13 reviewed

Rescaled stepsizes remove bias in async SGD
Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Ammar Mahran +2
cs.DC 2026-05-13 reviewed

TurboGR trains 0.2B-param generative recommenders at 54.71% MFU
TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

Hengfeng Wang +10
cs.AR 2026-05-13 reviewed

FPGA lock agents boost OLTP throughput 51X over CPUs
FPGA-Accelerated Lock Management and Transaction Processing: Architecture, Optimization, and Design Space Exploration

Gustavo Alonso +1
cs.MA 2026-05-13 reviewed

One rule unifies voting, proposals and constitutional amendment in metric spaces
Constitutional Governance in Metric Spaces

Ehud Shapiro +1
cs.MA 2026-05-13 reviewed

Metric-space protocol lets communities self-amend constitutions in polynomial time
Constitutional Governance in Metric Spaces

Ehud Shapiro +1
cs.GR 2026-05-13 reviewed

Hierarchical transformer preconditioner reaches 21 fps on stiff Poisson systems
Hierarchical Transformer Preconditioning for Interactive Physics Simulation

Carl Osborne +3
cs.GR 2026-05-13 reviewed

Transformer preconditioner speeds stiff physics 28x
Hierarchical Transformer Preconditioning for Interactive Physics Simulation

Carl Osborne +3
cs.NI 2026-05-13 reviewed

Drone swarms adapt composition to deliver lower latency connectivity
Swarm Network-as-a-Service (SNaaS)

Balsam Alkouz +2
cs.DC 2026-05-13 reviewed

Pipeline overlap speeds cloud-edge LLM inference up to 2.16x
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

Bing Hu +6
cs.DC 2026-05-13 reviewed

Pipeline speeds cloud-edge LLM inference 1.16-2.16x
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

Bing Hu +6
cs.DC 2026-05-13 reviewed

Heterogeneous solvers up to 32% faster than GPU-only for big matrices
Comparing the Performance of Heterogeneous Conjugate Gradient and Cholesky Solvers on Various Hardware Using SYCL

Alexander Strack +2
cs.GT 2026-05-12 reviewed

Dynamic pricing stabilizes mempool volume at target capacity
Dynamic Transaction Scheduling and Pricing in the Ethereum Mempool

Fatemeh Fardno +1
cs.DC 2026-05-12 reviewed

LCL complexity on trees shifts without exact n knowledge
The Distributed Complexity Landscape on Trees Depends on the Knowledge About the Network Size

Alkida Balliu +5
cs.DC 2026-05-12 reviewed

Overdecomposition supported efficiently on mixed GPGPU clusters
Efficient and Portable Support for Overdecomposition on Distributed Memory GPGPU Platforms

Aditya Bhosale +5
cs.LG 2026-05-12 reviewed

Parallel training lets RNNs learn from sequences over 10,000 steps
Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction

Daniel Durstewitz +2
cs.DC 2026-05-12 reviewed

Decoupled compression speeds GPU collectives up to 9.65x
NCCLZ: Compression-Enabled GPU Collectives with Decoupled Quantization and Entropy Coding

Jiamin Wang +2
cs.IT 2026-05-12 reviewed

Link failures cap LEO capacity scalability at O(1/n)
Capacity Scalability of LEO Constellations With Dynamic Link Failures

Min Sheng +1
cs.DC 2026-05-12 reviewed

Per-head adaptive blocks improve sparse attention accuracy by 5.43%
AB-Sparse: Sparse Attention with Adaptive Block Size for Accurate and Efficient Long-Context Inference

Chen Chen +8
cs.IT 2026-05-12 reviewed

Node failures scale wireless capacity and delay with sqrt of reliable nodes
On Capacity and Delay of Wireless Networks with Node Failures

Jiandong Li +3
cs.DC 2026-05-12 reviewed

Power capping leaves LLM decode energy untouched
The Illusion of Power Capping in LLM Decode: A Phase-Aware Energy Characterisation Across Attention Architectures

Ayesha Afzal +3
cs.DC 2026-05-12 reviewed

Overlays trade reliability against overhead for AI agent discovery
Trade-offs in Decentralized Agentic AI Discovery Across the Compute Continuum

Emanuele Carlini +3
cs.CE 2026-05-12 reviewed

LLM inference should be measured in joules per token at scale
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production

Bo Li +7
cs.DC 2026-05-12 reviewed

GraphFlash hits 127x speedup in serverless graph processing
GraphFlash: Enabling Fast and Elastic Graph Processing on Serverless Infrastructure

Adel N. Toosi +4
cs.DC 2026-05-12 reviewed

NAVIS speeds on-SSD vector inserts up to 2.74x
NAVIS: Concurrent Search and Update with Low Position-Seeking Overhead in On-SSD Graph-Based Vector Search

Changmin Shin +6
cs.DC 2026-05-12 reviewed

Off-chain twins let DeFi agents simulate trades without waiting for blocks
State Twins: An Off-Chain Substrate for Agentic Reasoning over Decentralized Finance Protocols

Ian C. Moore
cs.DC 2026-05-12 reviewed

Storage offloading breaks memory wall for full-graph GNN training
GriNNder: Breaking the Memory Capacity Wall in Full-Graph GNN Training with Storage Offloading

Hongsun Jang +6
quant-ph 2026-05-12 reviewed

Task runtime dispatches QIR programs to multiple quantum processors
Classic and Quantum Task-Based Intelligent Runtime for QIRs Running on Multiple QPUs

Elaine Wong +4
cs.RO 2026-05-12 reviewed

Kairos cuts physical AI task latency by 32-66 percent
Kairos: A Scalable Serving System for Physical AI

Bozidar Radunovic +5
cs.DC 2026-05-11 reviewed

Chunked prefetching speeds DiT steps up to 1.28x with 49% less GPU memory
ChunkFlow: Communication-Aware Chunked Prefetching for Layerwise Offloading in Distributed Diffusion Transformer Inference

Danny Willow Liu (University of Chicago) +5
cs.DC 2026-05-11 reviewed

Chakra standardizes graph traces for AI workload benchmarking
MLCommons Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces

Andy Balogh +27
cs.DC 2026-05-11 reviewed

Directed graphs support Byzantine consensus only under specific connectivity
Byzantine Consensus in Directed Graphs with Message Authentication

Lewis Tseng +1
cs.DC 2026-05-11 reviewed

ReCoVer preserves exact training trajectory after GPU losses
ReCoVer: Resilient LLM Pre-Training System via Fault-Tolerant Collective and Versatile Workload

Avinash Maurya +9
cs.DC 2026-05-11 reviewed

ShardTensor scales SciML to arbitrary spatial resolutions
ShardTensor: Domain Parallelism for Scientific Machine Learning

Akshay Subramaniam +6
cs.DC 2026-05-11 reviewed

GCC 15 outperforms LLVM 21 in four of six RISC-V vector apps
Closer in the Gap: Towards Portable Performance on RISC-V Vector Processors

Ivy Peng +4
cs.DC 2026-05-11 reviewed

Edge micro-agent fixes failures safely with no destructive actions
An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

Alaa Saleh +4
cs.DC 2026-05-11 reviewed

Mutable membership lets MoE survive rank faults without restarts
Surviving Partial Rank Failures in Wide Expert-Parallel MoE Inference

Cheng Wan +20
cs.CR 2026-05-11 reviewed

This paper performs a structured bidirectional review of peer-reviewed studies on AI and…
SoK: A Systematic Bidirectional Literature Review of AI & DLT Convergence

Abylay Satybaldy +5