13 13

A Clark

aclark63

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

liked a model 5 days ago

MinhPhuc0804/me5-256-kiem-tra-di-t1-v2.2-epoch-10

upvoted a paper 7 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

View all activity

Organizations

None yet

upvoted a paper about 21 hours ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Paper • 2605.06139 • Published 5 days ago • 61

liked a model 5 days ago

MinhPhuc0804/me5-256-kiem-tra-di-t1-v2.2-epoch-10

upvoted a paper 7 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 9 days ago • 152

liked a dataset 11 days ago

QingyiSi/Alpaca-CoT

Preview • Updated Sep 14, 2023 • 17.9k • 758

liked a model 23 days ago

tencent/HY-World-2.0

Image-to-3D • Updated 1 day ago • 3.7k • 642

liked a model 28 days ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated 28 days ago • 2.69k • 905

liked a dataset 28 days ago

PhillyMac/Authentic_Leadership_Practical

Viewer • Updated 28 days ago • 307 • 151 • 1

liked a dataset 30 days ago

mlfoundations/MINT-1T-PDF-CC-2023-23

Viewer • Updated Sep 19, 2024 • 2.82M • 14.6k • 10

upvoted 2 papers 30 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 324

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 289

liked a dataset about 1 month ago

yahma/alpaca-cleaned

Viewer • Updated Apr 10, 2023 • 51.8k • 33.4k • 821

upvoted 5 papers about 1 month ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 187

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published Apr 3 • 34

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 627

Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving

Paper • 2604.01483 • Published Apr 1 • 7

liked a dataset about 1 month ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 16.7k • 995

upvoted a paper about 1 month ago

Superintelligence and Law

Paper • 2603.28669 • Published Mar 30 • 7

liked a dataset about 1 month ago

camel-ai/seta-env-seed2synth-seed

Preview • Updated Apr 5 • 38 • 1

liked a model about 1 month ago

snoobvn20265/zW9Wpkg9pRCqEPPR

Updated Apr 6 • 1

A Clark

AI & ML interests

Recent Activity

Organizations

aclark63's activity