Instructions to use Nanbeige/Nanbeige4.1-3B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Nanbeige/Nanbeige4.1-3B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Nanbeige/Nanbeige4.1-3B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Nanbeige/Nanbeige4.1-3B")
model = AutoModelForCausalLM.from_pretrained("Nanbeige/Nanbeige4.1-3B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Nanbeige/Nanbeige4.1-3B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Nanbeige/Nanbeige4.1-3B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Nanbeige/Nanbeige4.1-3B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Nanbeige/Nanbeige4.1-3B

SGLang

How to use Nanbeige/Nanbeige4.1-3B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Nanbeige/Nanbeige4.1-3B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Nanbeige/Nanbeige4.1-3B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Nanbeige/Nanbeige4.1-3B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Nanbeige/Nanbeige4.1-3B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Nanbeige/Nanbeige4.1-3B with Docker Model Runner:
```
docker model run hf.co/Nanbeige/Nanbeige4.1-3B
```

LiveCodeBench-Pro Version

#43

by beyoung - opened Apr 10

Discussion

beyoung

Apr 10

may I ask which version of the livecodebench-pro do you use? 24? 25? 25Q2? all?

jymmmmm

Nanbeige LLM Lab org Apr 10

This comment has been hidden

beyoung

Apr 10

So far as I know, 25Q3 do not have official test samples? see https://huggingface.co/datasets/QAQAQAQAQ/LiveCodeBench-Pro-Testcase

beyoung

Apr 10

but 25Q2 do have

jymmmmm

Nanbeige LLM Lab org Apr 10

Sorry for the confusion. I was mistaken earlier — the correct version is 25Q2, not 25Q3.

beyoung

Apr 11

Do you run 4 or 8 repeats to get the average or pass@1?

beyoung

Apr 14

cannot reproduce the score of livebench pro 25Q2 Medium, can you provide any suggestion? tricks? of prompts?

beyoung

Apr 15

Hi!~

jymmmmm

Nanbeige LLM Lab org Apr 15

Sorry for the delayed response. For LiveCodeBench-Pro, we only run the evaluation once to obtain the final score. For reproducibility, could you share your configuration?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment