pith. sign in

Codedpo: Aligning code models with self generated and verified source code.arXiv preprint arXiv:2410.05605, 2024a

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 6 2025 1

roles

background 2

polarities

background 2

clear filters

representative citing papers

Learn from Your Mistakes: Tree-like Self-Play for Secure Code LLMs

cs.CR · 2026-06-02 · unverdicted · novelty 6.0

TSP reframes secure code generation as a tree-structured self-play process that supplies dense on-policy signals at vulnerability-prone nodes, yielding higher security pass rates and cross-language generalization than SFT or unstructured self-play.

Visual-RFT: Visual Reinforcement Fine-Tuning

cs.CV · 2025-03-03 · conditional · novelty 6.0

Visual-RFT applies reinforcement learning with verifiable perception rewards to improve large vision-language models on fine-grained classification, few-shot detection, and grounding tasks.

citing papers explorer

Showing 6 of 6 citing papers after filters.