RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning
arXiv:2604.00790v1 Announce Type: new Abstract: While large language models (LLMs) have demonstrated strong performance on complex reasoning tasks such as competitive programming (CP), existing methods …
Shaopeng Fu, Xingxing Zhang, Li Dong, Di Wang, Furu Wei
1 views