GigaAM v3 e2e RNNT โ€” MLX

MLX port of GigaAM-v3 RNNT variant for Apple Silicon. Higher quality than CTC, ~77x realtime on M2 Max.

Usage

pip install git+https://github.com/aystream/gigaam-mlx.git
from gigaam_mlx import load_model, transcribe

model, tokenizer = load_model("rnnt")  # downloads automatically
text = transcribe(model, tokenizer, "recording.wav")

Or via CLI:

gigaam-mlx recording.wav --model-type rnnt

CTC vs RNNT

Variant Speed (20s chunk) Quality Full 18-min video
CTC 0.06s (~330x) Good 21.5s
RNNT (this) 0.26s (~77x) Better 25.0s

Links

Downloads last month
82
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for aystream/GigaAM-v3-e2e-rnnt-mlx

Finetuned
(2)
this model

Paper for aystream/GigaAM-v3-e2e-rnnt-mlx