Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Alibaba-NLP
/
gme-Qwen2-VL-2B-Instruct

Sentence Similarity
sentence-transformers
Safetensors
Transformers
English
Chinese
qwen2_vl
image-text-to-text
mteb
Qwen2-VL
vidore
custom_code
Eval Results (legacy)
Model card Files Files and versions
xet
Community
26

Instructions to use Alibaba-NLP/gme-Qwen2-VL-2B-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use Alibaba-NLP/gme-Qwen2-VL-2B-Instruct with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("Alibaba-NLP/gme-Qwen2-VL-2B-Instruct", trust_remote_code=True)
    
    sentences = [
        "That is a happy person",
        "That is a happy dog",
        "That is a very happy person",
        "Today is a sunny day"
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Transformers

    How to use Alibaba-NLP/gme-Qwen2-VL-2B-Instruct with Transformers:

    # Load model directly
    from transformers import AutoProcessor, AutoModelForImageTextToText
    
    processor = AutoProcessor.from_pretrained("Alibaba-NLP/gme-Qwen2-VL-2B-Instruct", trust_remote_code=True)
    model = AutoModelForImageTextToText.from_pretrained("Alibaba-NLP/gme-Qwen2-VL-2B-Instruct", trust_remote_code=True)
  • Notebooks
  • Google Colab
  • Kaggle
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Training strategy

#26 opened 8 days ago by
Mashiro-tomes

Do you have any plans to update the embedding model using Qwen3-VL as the base model ?

2
#25 opened 5 months ago by
xmulk

How to calculate video embedding?

#24 opened 8 months ago by
leaf-potato

modelscope和huggingface下载的config文件不一样

#23 opened 10 months ago by
andrewdeeplearning

About fine-tuned

1
#22 opened 10 months ago by
sherlhw

How to enable bidirectional attention?

#21 opened 11 months ago by
Adenialzz

finetuning

#20 opened 11 months ago by
Hadbeen123

error with AutoModel GmeQwen2VLConfig after upgrade

7
#16 opened 12 months ago by
findpather

Does batch_size=128 during training refer to the global or single-GPU batch size, and is it trained using DeepSpeed Zero3?

1
#13 opened about 1 year ago by
Hipanda

training code supported

#8 opened about 1 year ago by
tastelikefeet

Results on M-BEIR

2
#7 opened about 1 year ago by
wongyukim

LoRA weights

#6 opened over 1 year ago by
NohTow

Fused-Modal Data

#5 opened over 1 year ago by
paralym

training batch size

1
#3 opened over 1 year ago by
yyy111

Training code release

6
#2 opened over 1 year ago by
listli
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs