Xingshan Zeng
zxshamson
ยท
AI & ML interests
None yet
Recent Activity
authored a paper about 20 hours ago
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large
Language Models authored a paper about 20 hours ago
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark
for Large Language Models authored a paper about 20 hours ago
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context
Evaluation Benchmark for Large Language Models