Junyao Yang's picture

3 7

Junyao Yang

TberiusJunyao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

liked a dataset 2 months ago

AI45Research/ATBench

liked a model 3 months ago

AI45Research/AgentDoG-FG-Llama3.1-8B

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 9 days ago • 312

liked a dataset 2 months ago

AI45Research/ATBench

Viewer • Updated 8 days ago • 1.5k • 875 • 34

liked 6 models 3 months ago

AI45Research/AgentDoG-FG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 14 • 9

AI45Research/AgentDoG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 21 • 11

AI45Research/AgentDoG-FG-Qwen2.5-7B

Text Classification • 8B • Updated Feb 6 • 23 • 8

AI45Research/AgentDoG-Qwen2.5-7B

Text Classification • 8B • Updated 8 days ago • 36 • 10

AI45Research/AgentDoG-FG-Qwen3-4B

Text Classification • 4B • Updated 8 days ago • 46 • 9

AI45Research/AgentDoG-Qwen3-4B

Text Classification • 4B • Updated 8 days ago • 217 • 23

upvoted a collection 3 months ago

AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated about 8 hours ago • 107

upvoted a paper 5 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

published 3 models about 1 year ago

TberiusJunyao/Qwen2.5-7B-Instruct-Math-GRPO

Updated Mar 27, 2025

TberiusJunyao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Mar 8, 2025

TberiusJunyao/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 6, 2025