LLM-OS-Models/gemma-4-E4B-Terminal-SFT-Native-Liquid-2Epoch Text Generation • 8B • Updated 1 minute ago • 1.08k • 1
Agentic AI Systems Should Be Designed as Marginal Token Allocators Paper • 2605.01214 • Published 10 days ago • 4
Action Images: End-to-End Policy Learning via Multiview Video Generation Paper • 2604.06168 • Published Apr 7 • 14
Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing Paper • 2604.02288 • Published Apr 2 • 33
WAFT-Stereo: Warping-Alone Field Transforms for Stereo Matching Paper • 2603.24836 • Published Mar 25 • 6
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published Mar 26 • 156
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published Mar 4 • 210
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models Paper • 2602.22859 • Published Feb 26 • 151