Lorenzo's picture

9

Lorenzo

lsteno

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

NemoStation/Marlin-2B

updated a collection 4 days ago

Qwen 3 4B RLM RLVR

published a model 4 days ago

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr5e-6-depth1-v1

View all activity

Organizations

liked a model 1 day ago

NemoStation/Marlin-2B

Video-Text-to-Text • 2B • Updated 8 days ago • 9.14k • 416

updated a collection 4 days ago

Qwen 3 4B RLM RLVR

LoRA adapters, full fine-tuned checkpoints, and SFT warmup models trained with RLVR in the recursive language model depth-1 harness. • 12 items • Updated 4 days ago

published 9 models 4 days ago

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr5e-6-depth1-v1

Text Generation • 4B • Updated 5 days ago • 71 •

lsteno/qwen3-rlm-depth1-r64-a128-lr1e-5-s150-bal35f40v1-lora

Updated 7 days ago • 8

lsteno/qwen3-rlm-depth1-r64-a128-lr5e-7-s150-bal35f40v1-lora

Updated 7 days ago • 10

lsteno/qwen3-rlm-depth1-r16-a32-lr1e-4-s150-bal35f40v1-lora

Updated 7 days ago • 9

lsteno/qwen3-rlm-depth1-r16-a32-lr1e-5-s150-bal35f40v1-lora

Updated 8 days ago • 10

lsteno/qwen3-rlm-depth1-r16-a32-lr5e-7-s150-bal35f40v1-lora

Updated 8 days ago • 10

lsteno/qwen3-rlm-depth1-r4-a8-lr1e-4-s150-bal35f40v1-lora

Updated 9 days ago • 3

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr1e-5-s150-lora

Updated 10 days ago • 3

lsteno/Qwen3-4B-Instruct-2507-RLM-SFT-v3-per-root-turn

4B • Updated 11 days ago • 65

updated a collection 4 days ago

Qwen 3 4B RLM RLVR

LoRA adapters, full fine-tuned checkpoints, and SFT warmup models trained with RLVR in the recursive language model depth-1 harness. • 12 items • Updated 4 days ago