Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
41
223
53
KABI
dongguanting
Follow
MengjieDeng's profile picture
zstanjj's profile picture
dark-pen's profile picture
68 followers
·
106 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
4 days ago
RAGEN-2: Reasoning Collapse in Agentic RL
upvoted
a
paper
10 days ago
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
commented
on
a paper
10 days ago
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
View all activity
Organizations
dongguanting
's datasets
11
Sort: Recently updated
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Oct 17, 2025
•
1.07k
•
59
•
6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
Oct 17, 2025
•
10k
•
112
•
4
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
Oct 17, 2025
•
54.6k
•
148
•
15
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
Jun 28, 2025
•
100k
•
33
•
3
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29, 2025
•
54k
•
233
•
10
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25, 2025
•
10k
•
86
•
5
dongguanting/RAG-QA-40K
Viewer
•
Updated
Dec 27, 2024
•
32.8k
•
35
•
2
dongguanting/ShareGPT-12K
Viewer
•
Updated
Dec 27, 2024
•
12.9k
•
67
•
1
dongguanting/VIF-RAG-QA-110K
Viewer
•
Updated
Dec 27, 2024
•
111k
•
50
•
7
dongguanting/DotamathQA
Viewer
•
Updated
Dec 26, 2024
•
574k
•
47
•
2
dongguanting/VIF-RAG-QA-20K
Viewer
•
Updated
Nov 1, 2024
•
20k
•
7
•
4