Open to Work

7 10

D B PRO

d-s-b

AI & ML interests

Exploring

Recent Activity

liked a model 3 days ago

google/translategemma-12b-it

upvoted an article 8 days ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

liked a Space 8 days ago

AdithyaSK/rl-environments-guide

View all activity

Organizations

liked a model 3 days ago

google/translategemma-12b-it

Image-Text-to-Text • Updated Jan 28 • 14.7k • 300

upvoted an article 8 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 152

liked a Space 8 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

161

Building and scaling RL environments for LLM training

upvoted an article about 1 month ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

•

Jan 27

• 75

liked a model about 2 months ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated Apr 6 • 219k • • 2.84k

liked a Space 2 months ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

234

Explore synthetic data experiments on a virtual bookshelf

upvoted an article 3 months ago

Article

Optimization story: Bloom inference

Narsil

•

Oct 12, 2022

• 8

liked a model 3 months ago

mistralai/Voxtral-Mini-4B-Realtime-2602

Automatic Speech Recognition • 4B • Updated Mar 11 • 1.4M • 848

upvoted 4 articles 6 months ago

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 119

Article

Mastering Tensor Dimensions in Transformers

not-lain

•

Jan 12, 2025

• 173

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 331

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 388

updated a model 6 months ago

d-s-b/Qwen-3-0.6-medical

Updated Nov 25, 2025

published a model 6 months ago

d-s-b/Qwen-3-0.6-medical

Updated Nov 25, 2025

liked 2 Spaces 6 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.34k

Explore and download the FineWeb web‑text dataset

The Ultra-Scale Playbook

🌌

3.85k

The ultimate guide to training LLM on large GPU Clusters

liked a Space 7 months ago

The Smol Training Playbook

📚

3.18k

The secrets to building world-class LLMs

updated a model 7 months ago

d-s-b/gemma-270m-gsm8k

Text Generation • 0.3B • Updated Oct 30, 2025 • 2

published a model 7 months ago

d-s-b/gemma-270m-gsm8k

Text Generation • 0.3B • Updated Oct 30, 2025 • 2

updated a model 9 months ago

d-s-b/meme

Updated Aug 30, 2025 • 1

D B PRO

AI & ML interests

Recent Activity

Organizations

d-s-b's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

The ultimate guide to RL environments: building and scaling them in the LLM era

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Optimization story: Bloom inference

KV Cache from scratch in nanoVLM

Mastering Tensor Dimensions in Transformers

KV Caching Explained: Optimizing Transformer Inference Efficiency

Continuous batching from first principles

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

The Smol Training Playbook