Qwen or not?

by clover-supply - opened Jan 11

Jan 11

The model page says it's based on Qwen but when I try to gguf the model it says the architecture is not supported? Is it too much differrent now?

Geralt-Targaryen

CodeFuse AI org Jan 12

Yes, it's based on Qwen2.5. However, as described in the technical report, we apply a PMA layer on top of the model, so you will need to load with trust_remote_code=True.

rayzinnz

Feb 8

How many input tokens and how many embeddings dimensions please?

Geralt-Targaryen

CodeFuse AI org Feb 9

C2LLM-0.5B has an embedding dimension of 896, and C2LLM-7B has an embedding dimension of 3584. Both models support 8192 input tokens.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment