How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="mlx-community/deepseek-vl2-tiny-4bit")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("mlx-community/deepseek-vl2-tiny-4bit", dtype="auto")
Quick Links

mlx-community/deepseek-vl2-tiny-4bit

This model was converted to MLX format from prince-canuma/deepseek-vl2-tiny using mlx-vlm version 0.1.5. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/deepseek-vl2-tiny-4bit --max-tokens 100 --temp 0.0
Downloads last month
32
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mlx-community/deepseek-vl2-tiny-4bit