Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step 8 days ago • 18
🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do Mar 10 • 38
DARWIN-Family 비드래프트 FINAL-Bench/Metacognitive Viewer • Updated Feb 27 • 100 • 899 • 90 Running Featured 50 Leaderboard - FINAL Bench 'Metacognitive' 🚀 50 Metacognitive Running 79 ALL Bench Leaderboard 🚀 79 ALL Bench Leaderboard FINAL-Bench/Darwin-4B-Genesis Text Generation • 8B • Updated 7 days ago • 463 • 33
DARWIN-Family 비드래프트 FINAL-Bench/Metacognitive Viewer • Updated Feb 27 • 100 • 899 • 90 Running Featured 50 Leaderboard - FINAL Bench 'Metacognitive' 🚀 50 Metacognitive Running 79 ALL Bench Leaderboard 🚀 79 ALL Bench Leaderboard FINAL-Bench/Darwin-4B-Genesis Text Generation • 8B • Updated 7 days ago • 463 • 33