CL-From-Nothing/opd_polaris_hard_polaris_ROSE_warmup_qwen3-1.7b_epoch_1_mask_resp16384-T1.0-n8-topk16-step40 2B • Updated 1 day ago • 12
CL-From-Nothing/grpo_polaris_hard_polaris_POPE_warmup_40K-parquet_qwen3-4b_epoch_1_mask_resp16384-T1.0-n8 4B • Updated 3 days ago • 16
CL-From-Nothing/grpo_polaris_hard_polaris_ROSE_warmup_40K-parquet_qwen3-1.7b_epoch_1_mask_resp16384-T1.0-n8 2B • Updated 3 days ago • 13
CL-From-Nothing/opd_polaris_hard_polaris_warmup_polaris_offline_40K-qwen3-4b_resp16384-T1.0-n8-topk16_step40 4B • Updated 5 days ago • 14
CL-From-Nothing/grpo_polaris_hard_polaris_offline_40K-qwen3-1.7b-mask_resp16384-T1.0-n8_step40 2B • Updated 5 days ago • 11
CL-From-Nothing/grpo_polaris_hard_polaris_offline_40K-qwen3-4b-mask-step312_resp16384-T1.0-n8_step40 4B • Updated 6 days ago • 12
CL-From-Nothing/grpo_polaris_hard_polaris_offline_40K-qwen3-4b-mask-k4096-step312_resp16384-T1.0-n8_step40 4B • Updated 6 days ago • 14
CL-From-Nothing/polaris_warmup_polaris_offline_40K-parquet_qwen3-4b_epoch_1_mask_k4096_step312 4B • Updated 8 days ago • 20
CL-From-Nothing/polaris_warmup_polaris_offline_40K-parquet_qwen3-4b_epoch_1_mask_step312 4B • Updated 8 days ago • 19
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-MixedSFT-Thinking-epoch3 2B • Updated 13 days ago • 19
CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500 Text Generation • 4B • Updated 29 days ago • 359
CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500 Text Generation • 2B • Updated about 1 month ago • 200
CL-From-Nothing/teacher_prefix_sudoku_10K_qwen3_4b_thinking_continual_qwen3-1-7b-parquet_qwen3-1.7b_epoch_3 2B • Updated Apr 16 • 1