arxiv:2604.27776
Zhenran Xu
imryanxu
AI & ML interests
fishing in lab while working on language agents
Recent Activity
authored a paper 11 days ago
MSVBench: Towards Human-Level Evaluation of Multi-Shot Video Generation authored a paper 11 days ago
WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments