performance of quantized models

by darvec - opened Feb 2

Feb 2

Is there any comparison between the quantized version of kimi k2.5 and other good open-source models like Qwen3? For example, Qwen3-235B-A22B-Instruct-2507 is 470B large which is about the same size as UD-Q3_K_XL, but I am not sure if UD-Q3_K_XL has a better performance than qwen

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment