In a Training Loop 🔄

20 15 50

ouasdg

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

upvoted a paper about 23 hours ago

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

upvoted a paper about 23 hours ago

TextLDM: Language Modeling with Continuous Latent Diffusion

View all activity

Organizations

upvoted 3 papers about 23 hours ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 3 days ago • 58

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Paper • 2605.12496 • Published 5 days ago • 26

TextLDM: Language Modeling with Continuous Latent Diffusion

Paper • 2605.07748 • Published 9 days ago • 25

liked a model 3 days ago

ResembleAI/Dramabox

Text-to-Speech • Updated 3 days ago • 869 • 110

liked a dataset 27 days ago

k2-fsa/OpenDialog

Viewer • Updated 29 days ago • 996k • 586 • 22

liked a Space 27 days ago

DialogueSidon Demo

🔥

Separate two speakers from an audio or video recording

updated a Space 29 days ago

Dirt TTS

💻

text-to-speech demo

liked a model about 1 month ago

Skywork/Matrix-Game-3.0

Image-Text-to-Video • Updated 19 days ago • 425 • 115

liked a dataset about 2 months ago

IVLLab/MultiDialog

Updated Aug 29, 2024 • 1.08k • 30

liked a model about 2 months ago

AGI-Eval/Auto-ATT

Audio Classification • Updated May 16, 2025 • 34 • 4

upvoted an article 3 months ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

tomaarsen

•

Oct 9, 2023

• 37

liked a model 3 months ago

amphion/Metis

Text-to-Speech • Updated Apr 13, 2025 • 20 • 30

liked 2 datasets 3 months ago

espnet/yodas2

Updated May 15, 2025 • 23.7k • 49

EQ4You/podcastvideos

Updated May 24, 2024 • 227 • 3

liked a model 3 months ago

Qwen/Qwen2.5-0.5B

Text Generation • 0.5B • Updated Sep 25, 2024 • 2.37M • • 402

liked a Space 4 months ago

The Tokenizer Playground

📝

662

Experiment with and compare different tokenizers

liked a model 4 months ago

speechbrain/spkrec-ecapa-voxceleb

Updated Feb 18, 2025 • 2.73M • 325

upvoted a paper 4 months ago

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Paper • 2601.11141 • Published Jan 16 • 23

liked a model 4 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • Updated Mar 2 • 451k • 2.5k

updated a model 4 months ago

ouasdg/vilex

Updated Jan 21

ouasdg

AI & ML interests

Recent Activity

Organizations

ouasdg's activity

DialogueSidon Demo

Dirt TTS

🕳️ Attention Sinks in LLMs for endless fluency

The Tokenizer Playground