The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents Paper • 2604.10577 • Published 5 days ago • 23
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 6 days ago • 72
Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers Paper • 2604.01128 • Published 15 days ago • 15
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 15 days ago • 31
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 144
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 185
Probing Cultural Signals in Large Language Models through Author Profiling Paper • 2603.16749 • Published about 1 month ago • 3
Probing Cultural Signals in Large Language Models through Author Profiling Paper • 2603.16749 • Published about 1 month ago • 3
Exposing the Illusion of Fairness: Auditing Vulnerabilities to Distributional Manipulation Attacks Paper • 2507.20708 • Published Jul 28, 2025