arxiv:2605.16679
Zeyu Tang
zeyutang
AI & ML interests
Trustworthy AI
Recent Activity
authored a paper 3 days ago
Fantastic Bugs and Where to Find Them in AI Benchmarks authored a paper 3 days ago
CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows? liked a dataset 4 days ago
actava/chi-bench