• Including the 24 minutes spent in the coffee shop, the total time is 3 + 0.4 = 3.4 hours, which is 3.4 × 60 = 204 minutes

Determine the time for the walk at s + 1 2 = 3 km/h: • The time for the walk is 9 3 = 3 hours

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Majority Voting: Towards Fine-grained and More Reliable Reward Signal for Test-Time Reinforcement Learning

cs.CL · 2025-12-17 · unverdicted · novelty 7.0

SCOPE uses step-wise confidence and dynamic subgroups to create finer pseudo-labels in test-time RL, delivering 13.1% relative gains on AIME 2025 over majority-voting baselines.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Majority Voting: Towards Fine-grained and More Reliable Reward Signal for Test-Time Reinforcement Learning cs.CL · 2025-12-17 · unverdicted · none · ref 19
SCOPE uses step-wise confidence and dynamic subgroups to create finer pseudo-labels in test-time RL, delivering 13.1% relative gains on AIME 2025 over majority-voting baselines.

• Including the 24 minutes spent in the coffee shop, the total time is 3 + 0.4 = 3.4 hours, which is 3.4 × 60 = 204 minutes

fields

years

verdicts

representative citing papers

citing papers explorer