EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published 10 days ago • 77
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 12 days ago • 204
HodgeCover: Higher-Order Topological Coverage Drives Compression of Sparse Mixture-of-Experts Paper • 2605.13997 • Published 19 days ago • 5
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows Paper • 2605.14678 • Published 13 days ago • 102
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 13 days ago • 185
PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset Paper • 2605.20147 • Published 13 days ago • 11
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation Paper • 2605.10912 • Published 21 days ago • 46
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published Apr 21 • 39
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 242