This collection includes KnowRL-Nemotron-1.5B, train data, test data from the KnowRL project.
Linhao Yu
HasuerYu
AI & ML interests
None yet
Recent Activity
commentedon a paper 8 days ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance upvoted a paper 16 days ago
Co-Evolving Policy Distillation liked a model 27 days ago
HasuerYu/KnowRL-Nemotron-1.5BOrganizations
None yet