AI & ML interests
causality
Organizations
None yet
zzhang1987/Qwen3-LLMOPT-SFT-14B
Text Generation
• 15B • Updated • 2
zzhang1987/Qwen2.5-LLMOPT-SFT-7B
Text Generation
• 8B • Updated • 5
zzhang1987/Qwen2.5-7B-Instruct-GRPO
8B • Updated • 7
zzhang1987/Qwen2.5-3B-Open-R1-SFT
Text Generation
• 3B • Updated • 4
zzhang1987/Qwen2.5-3B-Instruct-GRPO
3B • Updated • 1
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-Distill
Image-Text-to-Text
• 4B • Updated • 3
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-Distill-select
Image-Text-to-Text
• 4B • Updated • 3
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-Distill_max_len1k
Updated
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-DistillLORA
Updated
zzhang1987/Qwen2.5-VL-7B-Instruct-Open-R1-Distill
zzhang1987/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 8