SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published 22 days ago • 13
SOD Collection SOD (Step-wise On-policy Distillation) model family for small language model agents. • 3 items • Updated 17 days ago • 1