🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement.
Shawn
csfufu
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Gen-Searcher: Reinforcing Agentic Search for Image Generation upvoted a paper 14 days ago
AI Can Learn Scientific Taste upvoted a paper about 2 months ago
Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments