deepseek-ai/DeepSeek-V4-Flash Text Generation β’ 158B β’ Updated 8 days ago β’ 1.37M β’ β’ 1.07k
OpenCulture Collection A multilingual dataset of public domain books and newspapers. β’ 25 items β’ Updated Mar 2 β’ 134
Running on CPU Upgrade Featured 3.17k The Smol Training Playbook π 3.17k The secrets to building world-class LLMs
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs Pclanglais β’ Mar 20, 2024 β’ 32
Runtime error Featured 142 smolagents LLM leaderboard π 142 A leaderboard for LLMs powering smolagents