pith. sign in

A vision-language-action-critic model for robotic real-world reinforcement learning.CoRR, abs/2509.15937, 2025

22 Pith papers cite this work. Polarity classification is still indexing.

22 Pith papers citing it

citation-role summary

background 4

citation-polarity summary

years

2026 21 2025 1

roles

background 4

polarities

background 3 support 1

clear filters

representative citing papers

Rank-Then-Act: Reward-Free Control from Frame-Order Progress

cs.LG · 2026-07-02 · unverdicted · novelty 6.0

RTA trains a VLM as a progress ordinal scorer via GRPO on shuffled expert frames and uses Spearman rank correlation with temporal indices as a bounded RL reward, matching or exceeding prior video reward methods on discrete and continuous control benchmarks.

Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

cs.DC · 2026-05-15 · unverdicted · novelty 6.0

Asteria is a runtime system that enables second-order optimization for LLMs by dynamically distributing optimizer state across GPU, CPU, and NVMe while using asynchronous inverse-root computations and bounded-staleness synchronization.

High Precision Hydraulic Excavator Control for Heavy-Duty Grading

cs.RO · 2026-05-10 · unverdicted · novelty 6.0

Autonomous excavator controller achieves 1.8 cm RMSE in heavy-duty grading across different hydraulic architectures, outperforming commercial solutions by a factor of 2.6 in precision while better utilizing machine pressure.

KAPPS: A knowledge-based CPPS Architecture for the Circular Factory

cs.AI · 2026-05-21 · unverdicted · novelty 5.0

KAPPS is a knowledge-based CPPS architecture that uses an ontology-grounded knowledge graph as the unifying data backbone and authoritative write-time state for handling uncertainty in circular manufacturing, demonstrated via anomaly detection and constraint enforcement use cases.

citing papers explorer

Showing 3 of 3 citing papers after filters.