9 32 157

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

upvoted a paper about 1 month ago

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

liked a dataset about 1 month ago

nvidia/Nemotron-VLM-Dataset-v2

View all activity

Organizations

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 22 days ago • 5.02M • • 4.36k

upvoted a paper about 1 month ago

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Paper • 2604.15093 • Published Apr 16 • 30

liked a dataset about 1 month ago

nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated Dec 18, 2025 • 4.58M • 16k • 90

upvoted 2 papers about 1 month ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 66

Many-Tier Instruction Hierarchy in LLM Agents

Paper • 2604.09443 • Published Apr 10 • 16

upvoted a paper about 2 months ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 121

liked a dataset about 2 months ago

claw-eval/Claw-Eval

Benchmark • Updated 19 days ago • 4.66k • 26

upvoted a paper 2 months ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

liked a model 3 months ago

Qwen/Qwen3.5-35B-A3B-Base

Image-Text-to-Text • 36B • Updated Apr 23 • 74.4k • 131

liked a dataset 3 months ago

InternScience/SGI-Reasoning

Viewer • Updated Dec 30, 2025 • 291 • 696 • 6

upvoted a collection 3 months ago

SGI-Bench

Collection

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows • 12 items • Updated 22 days ago • 33

liked a dataset 4 months ago

ellisbrown/SIMS-VSI

Viewer • Updated Nov 7, 2025 • 242k • 495 • 7

liked a model 6 months ago

EssentialAI/rnj-1-instruct

Text Generation • 8B • Updated Dec 24, 2025 • 886 • • 318

liked a Space 6 months ago

CUA - Computer Use Agent 2.0

🤖

154

Launch an interactive web interface

liked a dataset 6 months ago

rl-research/dr-tulu-rl-data

Viewer • Updated Nov 25, 2025 • 4.88k • 422 • 13

liked a model 6 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 87.9k • 535

upvoted a paper 7 months ago

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 73

liked a dataset 7 months ago

zjunlp/DataMind-Data

Preview • Updated Oct 11, 2025 • 110 • 2

upvoted a paper 7 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 47

liked a dataset 7 months ago

neulab/agent-data-collection

Preview • Updated Mar 9 • 5.43k • 112

Xie

AI & ML interests

Recent Activity

Organizations

Zhihui's activity

CUA - Computer Use Agent 2.0