Running 167 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 167 Building and scaling RL environments for LLM training
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
NetherlandsForensicInstitute/ARM64BERT-embedding Sentence Similarity • 87.8M • Updated 25 days ago • 451 • 8
Menlo/ReZero-v0.1-llama-3.2-3b-it-grpo-250404 Text Generation • 3B • Updated Apr 17, 2025 • 154 • • 63