delewis (Derek Lewis)

upvoted an article 4 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

+2

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

upvoted an article about 1 year ago

Article

The Transformers Library: standardizing model definitions

+2

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 122

upvoted a paper over 1 year ago

Enhancing Training Efficiency Using Packing with Flash Attention

Paper • 2407.09105 • Published Jul 12, 2024 • 17

upvoted an article over 1 year ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

+4

RQlee, ArthurZ, achikundu, lwtr, rganti, mayank-mishra

•

Aug 21, 2024

• 41

upvoted a paper almost 2 years ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 69

upvoted 2 articles almost 2 years ago

Article

Introduction to ggml

+1

ngxson, ggerganov, slaren

•

Aug 13, 2024

• 284

Article

Welcome Falcon Mamba: The first strong attention-free 7B model

+4

JingweiZuo, yellowvm, DhiyaEddine, IChahed, ybelkada, Gkunsch

•

Aug 12, 2024

• 113

upvoted a paper over 2 years ago

Extending LLMs' Context Window with 100 Samples

Paper • 2401.07004 • Published Jan 13, 2024 • 16

Derek Lewis

AI & ML interests

Organizations

We Got Claude to Build CUDA Kernels and teach open models!

The Transformers Library: standardizing model definitions

Enhancing Training Efficiency Using Packing with Flash Attention

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Introduction to ggml

Welcome Falcon Mamba: The first strong attention-free 7B model

Extending LLMs' Context Window with 100 Samples

Derek Lewis

AI & ML interests

Organizations

delewis's activity

We Got Claude to Build CUDA Kernels and teach open models!

The Transformers Library: standardizing model definitions

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Introduction to ggml

Welcome Falcon Mamba: The first strong attention-free 7B model