Sequence discriminative distributed training of long short-term memory recurrent neural networks

Haşim Sak, Oriol Vinyals, Georg Heigold, Andrew Senior, Erik McDermott, Rajat Monga + 1 more · 2014 · Interspeech 2014 · DOI 10.21437/interspeech.2014-305

1 Pith paper cite this work, alongside 26 external citations. Polarity classification is still indexing.

1 Pith paper citing it

26 external citations · Crossref

open at publisher browse 1 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance

cs.LG · 2026-05-14 · unverdicted · novelty 5.0

FEST improves RLVR sample efficiency on math and coding benchmarks by combining supervised signals, on-policy signals, and decaying weights on just 128 randomly chosen demonstrations, matching full-dataset baselines.

citing papers explorer

Showing 1 of 1 citing paper.

Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance cs.LG · 2026-05-14 · unverdicted · none · ref 8
FEST improves RLVR sample efficiency on math and coding benchmarks by combining supervised signals, on-policy signals, and decaying weights on just 128 randomly chosen demonstrations, matching full-dataset baselines.

Sequence discriminative distributed training of long short-term memory recurrent neural networks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer