Diversed Model Discovery via Structured Table Discovery Paper • 2605.22766 • Published 8 days ago • 6
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published Apr 27 • 71
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published Apr 8 • 95
VecGlypher: Unified Vector Glyph Generation with Language Models Paper • 2602.21461 • Published Feb 25 • 12
SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment Paper • 2603.13669 • Published Mar 14 • 1
Scaling Zero-Shot Reference-to-Video Generation Paper • 2512.06905 • Published Dec 7, 2025 • 29
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published Dec 8, 2025 • 46
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published Dec 24, 2025 • 23
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance Paper • 2601.01887 • Published Jan 5 • 1
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance Paper • 2601.01887 • Published Jan 5 • 1
Shape of Thought: When Distribution Matters More than Correctness in Reasoning Tasks Paper • 2512.22255 • Published Dec 24, 2025 • 6
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 78
VisCoder2: Building Multi-Language Visualization Coding Agents Paper • 2510.23642 • Published Oct 24, 2025 • 22
VideoScore2: Think before You Score in Generative Video Evaluation Paper • 2509.22799 • Published Sep 26, 2025 • 26
Locket: Robust Feature-Locking Technique for Language Models Paper • 2510.12117 • Published Oct 14, 2025 • 1
Bench-NPIN: Benchmarking Non-prehensile Interactive Navigation Paper • 2505.12084 • Published May 17, 2025 • 2
Real-Time Navigation for Autonomous Surface Vehicles In Ice-Covered Waters Paper • 2302.11601 • Published Feb 22, 2023
Hallucination Score: Towards Mitigating Hallucinations in Generative Image Super-Resolution Paper • 2507.14367 • Published Jul 18, 2025
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published May 21, 2025 • 53