EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning
arXiv:2603.12698v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) is a promising approach for improving code generation in large language models, but its …
Chi Ruan, Dongfu Jiang, Huaye Zeng, Ping Nie, Wenhu Chen
9 views