Running on CPU Upgrade Agents Featured 1.32k Open ASR Leaderboard 🏆 1.32k Explore and compare speech‑recognition model benchmarks
Running 3.8k The Ultra-Scale Playbook 🌌 3.8k The ultimate guide to training LLM on large GPU Clusters
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published Feb 7, 2025 • 154
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Paper • 2502.08690 • Published Feb 12, 2025 • 43
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published Feb 13, 2025 • 192
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 264
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade Agents 1.01k Open VLM Leaderboard 🌎 1.01k VLMEvalKit Evaluation Results Collection