Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
arXiv:2602.13218v1 Announce Type: new Abstract: Scaling verifiable training signals remains a key bottleneck for Reinforcement Learning from Verifiable Rewards (RLVR). Logical reasoning is a natural …
Bowen Liu, Zhi Wu, Runquan Xie, Zhanhui Kang, Jia Li
17 views