pith. sign in

← back to paper

Review history

arxiv: 2606.23686 · 2 revisions

LIBERO-Safety: A Comprehensive Benchmark for Physical and Semantic Safety in Vision-Language-Action Models

  1. 2026-06-29 UNVERDICTED LOW v0.9.1-grok novelty 7.0
    70867 ms 5735 in 1261 out 2026-06-29T04:38:24.333968+00:00
  2. 2026-06-26 UNVERDICTED LOW v0.9.1-grok novelty 6.0
    35561 ms 5735 in 1208 out 2026-06-26T07:53:37.595132+00:00