Gemma 3 4B

Model Capabilities

  • Text generation โ€” instruction-following chat model
  • Vision โ€” understands and reasons about images
  • Multilinguality โ€” supports over 140 languages

The full description can be found at the original model page.

Getting Started

Download the model locally through the huggingface UI or run this in the command line:

wget https://huggingface.co/NobodyWho/Google_Gemma3-4B-GGUF/resolve/main/gemma-3-4b-it-Q4_K_M.gguf
wget https://huggingface.co/NobodyWho/Google_Gemma3-4B-GGUF/resolve/main/mmproj-BF16.gguf

Run with NobodyWho:

from nobodywho import Chat

chat = Chat("./gemma-3-4b-it-Q4_K_M.gguf")
response = chat.ask("What is the capital of Denmark?").completed()
print(response) # Copenhagen!

Vision

from nobodywho import Model, Chat, Prompt, Image, Text

model = Model(
    "./gemma-3-4b-it-Q4_K_M.gguf",
    projection_model_path="./mmproj-BF16.gguf"
)
chat = Chat(model, system_prompt="You are a helpful assistant.")

prompt = Prompt([
    Text("What do you see in this image?"),
    Image("./photo.png"),
])
response = chat.ask(prompt).completed()
print(response)

Benchmarks

Coming soon.

How are these GGUFs different?

These GGUF files serve mainly for NobodyWho inference library and are a way that we can guarantee that they will include all of the necessary info to be truly portable (sampler config, token descriptions, etc.). It is our informal effort to somehow arrive at more standardized GGUF files, which contain everything the runtime needs so that the user can have effortless experience using them, without additional fiddling or patching stuff up for every single model.

Model Details

Property Value
Parameters 4B
Context length 131,072 tokens
Knowledge cutoff August 2024
Languages 140+
License Gemma Terms of Use
Base model google/gemma-3-4b-it

Credits

GGUF quantizations provided by unsloth. Thanks!

Downloads last month
569
GGUF
Model size
4B params
Architecture
gemma3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for NobodyWho/Google_Gemma3-4B-GGUF

Quantized
(221)
this model

Collection including NobodyWho/Google_Gemma3-4B-GGUF