Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments Paper • 2605.27209 • Published 3 days ago • 6
Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments Paper • 2605.27209 • Published 3 days ago • 6
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions Paper • 2605.27141 • Published 3 days ago • 11
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions Paper • 2605.27141 • Published 3 days ago • 11
AlphaAlign: Incentivizing Safety Alignment with Extremely Simplified Reinforcement Learning Paper • 2507.14987 • Published Jul 20, 2025
MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation Paper • 2510.24431 • Published Oct 28, 2025
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published Jan 29 • 25
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs Paper • 2602.03048 • Published Feb 3 • 32
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents Paper • 2509.23040 • Published Sep 27, 2025 • 12
Learning to Self-Verify Makes Language Models Better Reasoners Paper • 2602.07594 • Published Feb 7 • 3
Transport and Merge: Cross-Architecture Merging for Large Language Models Paper • 2602.05495 • Published Feb 5
Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer Paper • 2602.19058 • Published Feb 22
AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation Paper • 2604.18240 • Published Apr 20 • 16
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 22 days ago • 111
Self-ReSET: Learning to Self-Recover from Unsafe Reasoning Trajectories Paper • 2605.08936 • Published 20 days ago • 1
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions Paper • 2605.27141 • Published 3 days ago • 11
Learning to Self-Verify Makes Language Models Better Reasoners Paper • 2602.07594 • Published Feb 7 • 3