InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published 13 days ago • 304
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published Feb 9 • 285
OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent Paper • 2601.07779 • Published Jan 12 • 28
Graph Out-of-Distribution Detection via Test-Time Calibration with Dual Dynamic Dictionaries Paper • 2511.13541 • Published Nov 17, 2025
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published Jul 24, 2025 • 86
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents Paper • 2512.12730 • Published Dec 14, 2025 • 51
Redundancy-Aware Test-Time Graph Out-of-Distribution Detection Paper • 2510.14562 • Published Oct 16, 2025 • 1
Structural Entropy Guided Unsupervised Graph Out-Of-Distribution Detection Paper • 2503.03241 • Published Mar 5, 2025 • 2
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents Paper • 2507.19478 • Published Jul 25, 2025 • 33
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data Paper • 2509.15221 • Published Sep 18, 2025 • 111
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 217
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics Paper • 2506.04308 • Published Jun 4, 2025 • 43
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments Paper • 2506.02387 • Published Jun 3, 2025 • 58
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Paper • 2403.13352 • Published Mar 20, 2024 • 1
Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models Paper • 2405.20775 • Published May 26, 2024
Dynamic Pyramid Network for Efficient Multimodal Large Language Model Paper • 2503.20322 • Published Mar 26, 2025 • 1