MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 489k • 355
Qwen3-VL Collection Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats. • 56 items • Updated 26 days ago • 33
Running on Zero Agents 15 Qwen3-VL Multimodal Search Engine 🔥 15 Cross-modal text-image search powered by Qwen3-VL
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 167
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 32 items • Updated 5 days ago • 83