Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated a model 3 days ago
mehuldamani/countdown_arl-sft-no-combine-v2 published a model 3 days ago
mehuldamani/countdown_arl-sft-no-combine-v2 updated a dataset 3 days ago
mehuldamani/neurips-story-main-story-features-sample-v1Organizations
None yet