Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
6
20
5
mz.w
iiiiwis
Follow
tnlin's profile picture
RainBowLuo's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space
upvoted
a
paper
about 1 month ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
upvoted
a
paper
about 2 months ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
View all activity
Organizations
None yet
iiiiwis
's models
1
Sort: Recently updated
iiiiwis/DEMO_Agent
Text Generation
•
Updated
Dec 10, 2024
•
2