唐紫怡
lucas-hill
AI & ML interests
None yet
Recent Activity
upvoted a paper about 20 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards liked a model 1 day ago
tencent/Hy-MT2-1.8BOrganizations
None yet