ishikaa/acquisition_student_gpt_llama8bins_numina_diversity Text Generation • 8B • Updated 7 days ago • 40
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_gradient Text Generation • 3B • Updated 7 days ago • 23
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_answer_variance Text Generation • 3B • Updated 7 days ago • 27
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_proximity Text Generation • 3B • Updated 7 days ago • 24
ishikaa/acquisition_student_gpt_qwen3bins_medmcqa_diversity Text Generation • 3B • Updated 7 days ago • 27
ishikaa/acquisition_student_RL_random_numina_llama8bins Text Generation • 8B • Updated 7 days ago • 20
ishikaa/acquisition_student_RL_random_medmcqa_llama8bins Text Generation • 8B • Updated 7 days ago • 15
ishikaa/acquisition_student_RL_base_llama8bins_medmcqa Text Generation • 8B • Updated 7 days ago • 28
ishikaa/acquisition_student_RL_llama8bins_medmcqa_format Text Generation • 8B • Updated 7 days ago • 27
ishikaa/acquisition_student_RL_llama8bins_medmcqa_proximity Text Generation • 8B • Updated 7 days ago • 25
ishikaa/acquisition_student_RL_llama8bins_numina_diversity Text Generation • 8B • Updated 7 days ago • 27
ishikaa/acquisition_student_RL_llama8bins_medmcqa_confidence Text Generation • 8B • Updated 7 days ago • 26
ishikaa/acquisition_student_RL_llama8bins_medmcqa_diversity Text Generation • 8B • Updated 7 days ago • 26
ishikaa/acquisition_student_RL_llama8bins_numina_gradient Text Generation • 8B • Updated 7 days ago • 26
ishikaa/acquisition_student_RL_DataEnvGym_numina_llama8bins Text Generation • 8B • Updated 7 days ago • 27
ishikaa/acquisition_student_RL_llama8bins_medmcqa_gradient Text Generation • 8B • Updated 7 days ago • 20
ishikaa/acquisition_student_RL_llama8bins_numina_confidence Text Generation • 8B • Updated 7 days ago • 27
ishikaa/acquisition_student_RL_llama8bins_numina_proximity Text Generation • 8B • Updated 7 days ago • 27
ishikaa/acquisition_student_RL_llama8bins_numina_format Text Generation • 8B • Updated 7 days ago • 26
ishikaa/acquisition_student_RL_llama8bins_numina_answer_variance Text Generation • 8B • Updated 7 days ago • 30
ishikaa/acquisition_student_RL_DataEnvGym_medmcqa_llama8bins Text Generation • 8B • Updated 7 days ago • 24
ishikaa/acquisition_student_RL_filtered_llama8bins_medmcqa Text Generation • 8B • Updated 7 days ago • 25
ishikaa/acquisition_student_RL_filtered_llama8bins_numina Text Generation • 8B • Updated 8 days ago • 43