SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 259
Pythia Scaling Suite Collection Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26, 2025 • 33
Encoders vs Decoders: the Ettin Suite Collection A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 30 items • Updated Mar 2 • 29
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 777
CIRCL/vulnerability-description-generation-gpt2-xl Text Generation • 2B • Updated Dec 6, 2025 • 31 • 3
VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification Paper • 2507.03607 • Published Jul 4, 2025 • 9
VLAI for CWE Guessing Collection A collection of models and datasets supporting the AI and NLP components of the Vulnerability-Lookup project, for CWE guessing. • 2 items • Updated Mar 23 • 3
VLAI for Severity Collection A collection of papers, models, and datasets supporting the AI and NLP components of the Vulnerability-Lookup project. • 9 items • Updated Apr 7 • 2
Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 158
lmstudio-community/r1-1776-distill-llama-70b-GGUF Text Generation • 71B • Updated Feb 22, 2025 • 28 • 3