Lorenzo

lsteno

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

Qwen 3 4B RLM RLVR

published a model 2 days ago

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr5e-6-depth1-v1

published a model 2 days ago

lsteno/qwen3-rlm-depth1-r64-a128-lr1e-5-s150-bal35f40v1-lora

View all activity

Organizations

Collections 1

models 12

datasets 1

lsteno/BEEG-agents

Viewer • Updated 7 days ago • 3.02k • 133

Lorenzo

AI & ML interests

Recent Activity

Organizations

Collections 1

lsteno/Qwen3-4B-Instruct-2507-RLM-SFT-v3-per-root-turn

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr5e-7-s150-lora

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr1e-5-s150-lora

lsteno/qwen3-rlm-depth1-r4-a8-lr1e-4-s150-bal35f40v1-lora

lsteno/Qwen3-4B-Instruct-2507-RLM-SFT-v3-per-root-turn

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr5e-7-s150-lora

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr1e-5-s150-lora

lsteno/qwen3-rlm-depth1-r4-a8-lr1e-4-s150-bal35f40v1-lora

models 12

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr1e-5-depth1-v1

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr5e-6-depth1-v1

lsteno/qwen3-rlm-depth1-r64-a128-lr1e-5-s150-bal35f40v1-lora

lsteno/qwen3-rlm-depth1-r64-a128-lr5e-7-s150-bal35f40v1-lora

lsteno/qwen3-rlm-depth1-r16-a32-lr1e-4-s150-bal35f40v1-lora

lsteno/qwen3-rlm-depth1-r16-a32-lr1e-5-s150-bal35f40v1-lora

lsteno/qwen3-rlm-depth1-r16-a32-lr5e-7-s150-bal35f40v1-lora

lsteno/qwen3-rlm-depth1-r4-a8-lr1e-4-s150-bal35f40v1-lora

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr1e-5-s150-lora

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr5e-7-s150-lora

datasets 1

lsteno/BEEG-agents

Lorenzo

AI & ML interests

Recent Activity

Organizations

Collections 1

models 12 Sort: Recently updated

datasets 1

models 12