EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs Paper β’ 2509.09174 β’ Published Sep 11, 2025 β’ 62
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper β’ 2508.14444 β’ Published Aug 20, 2025 β’ 47
Representing Speech Through Autoregressive Prediction of Cochlear Tokens Paper β’ 2508.11598 β’ Published Aug 15, 2025 β’ 17
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper β’ 2508.02193 β’ Published Aug 4, 2025 β’ 138
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents Paper β’ 2507.22827 β’ Published Jul 30, 2025 β’ 101
OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder Paper β’ 2507.14129 β’ Published Jul 18, 2025 β’ 11
A Survey of Context Engineering for Large Language Models Paper β’ 2507.13334 β’ Published Jul 17, 2025 β’ 263
Test-Time Scaling with Reflective Generative Model Paper β’ 2507.01951 β’ Published Jul 2, 2025 β’ 108
SingLoRA: Low Rank Adaptation Using a Single Matrix Paper β’ 2507.05566 β’ Published Jul 8, 2025 β’ 116
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Paper β’ 2506.20639 β’ Published Jun 25, 2025 β’ 31
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper β’ 2506.20920 β’ Published Jun 26, 2025 β’ 78
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper β’ 2506.16406 β’ Published Jun 19, 2025 β’ 133