Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Paper • 2604.11446 • Published 6 days ago • 4
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 41
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 39
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning Paper • 2602.00759 • Published Jan 31 • 5
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14, 2025 • 29
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7, 2025 • 3
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30, 2025 • 59
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7, 2025 • 3