OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Paper • 2605.17757 • Published 6 days ago • 61
SNLP: Layer-Parallel Inference via Structured Newton Corrections Paper • 2605.17842 • Published 6 days ago • 4
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 25 days ago • 108
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework Paper • 2604.06170 • Published Apr 7 • 31
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published Apr 6 • 13
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 265