Models

19

Full-text search

Active filters: quantllm

codewithdark/Llama-3.2-3B-4bit

3B • Updated Dec 18, 2025 • 17

codewithdark/Llama-3.2-3B-GGUF-4bit

3B • Updated Dec 19, 2025 • 3

codewithdark/Llama-3.2-3B-4bit-mlx

Text Generation • 3B • Updated Dec 19, 2025 • 64

QuantLLM/Llama-3.2-3B-4bit-mlx

Text Generation • 3B • Updated Dec 20, 2025 • 15

QuantLLM/Llama-3.2-3B-2bit-mlx

Text Generation • 3B • Updated Dec 20, 2025 • 10

QuantLLM/Llama-3.2-3B-8bit-mlx

Text Generation • 3B • Updated Dec 20, 2025 • 61

QuantLLM/Llama-3.2-3B-5bit-mlx

Text Generation • 3B • Updated Dec 20, 2025 • 7

QuantLLM/Llama-3.2-3B-5bit-gguf

3B • Updated Dec 20, 2025 • 12

QuantLLM/Llama-3.2-3B-2bit-gguf

3B • Updated Dec 20, 2025 • 12

QuantLLM/functiongemma-270m-it-8bit-gguf

0.3B • Updated Dec 21, 2025 • 9 • 1

QuantLLM/functiongemma-270m-it-4bit-gguf

0.3B • Updated Dec 21, 2025 • 23

QuantLLM/functiongemma-270m-it-4bit-mlx

0.3B • Updated Dec 21, 2025 • 8

QuantLLM/Meta-Llama-3-70B-Instruct-4bit-gguf

Text Generation • 71B • Updated Apr 24 • 78 • 1

QuantLLM/Qwen3-0.6B-2bit-gguf

0.6B • Updated Apr 25 • 39

QuantLLM/Qwen3-0.6B-4bit-gguf

0.6B • Updated Apr 25 • 38

QuantLLM/Qwen3-0.6B-8bit-gguf

0.6B • Updated Apr 25 • 18

QuantLLM/TinyLlama-1.1B-Chat-GGUF

1B • Updated 28 days ago • 151

QuantLLM/SmolLM2-135M-QuantLLM

Text Generation • 0.1B • Updated 28 days ago • 369

QuantLLM/SmolLM2-135M-GGUF

0.1B • Updated 28 days ago • 449