Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR Paper • 2605.20164 • Published 4 days ago • 5
VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap Paper • 2405.15683 • Published May 24, 2024
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Paper • 2406.11768 • Published Jun 17, 2024 • 24
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper • 2410.19168 • Published Oct 24, 2024 • 24
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models Paper • 2310.08753 • Published Oct 12, 2023
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning Paper • 2510.12712 • Published Oct 14, 2025
Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction Paper • 2512.14865 • Published Dec 16, 2025 • 2
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences? Paper • 2604.10718 • Published Apr 12 • 4
Audio Hallucination Attacks: Probing the Reliability of Large Audio Language Models Paper • 2603.29263 • Published Mar 31
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions Paper • 2406.04286 • Published Jun 6, 2024
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR Paper • 2605.20164 • Published 4 days ago • 5