view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +5 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra • 3 days ago • 30
view article Article Dell Enterprise Hub at Dell Tech World 2026: new models, new platforms, faster to production balaatdell • about 19 hours ago • 6
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 4 days ago • 117
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • 18 days ago • 23
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 21 days ago • 38
view article Article Adding Benchmaxxer Repellant to the Open ASR Leaderboard +9 bezzam, Steveeeeeeen, eustlb, SBruccoleriAppen, jmss-appen, c-e-ford-appen, wgb14, YukaiHuang, like2026, logicbean, ally-lxl • 24 days ago • 17
view article Article Introducing the agentic robotics appstore for 10,000 Reachy Minis clem • 23 days ago • 35
view article Article Hugging Face on AMD Instinct MI300 GPU +2 fxmarty, mohitsha, seungrokj, mfuntowicz • May 21, 2024 • 16
view article Article Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms juliensimon • Jun 13, 2023 • 5
view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents nvidia • Apr 28 • 60
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 47