Automatic Speech Recognition
NeMo
Safetensors
PyTorch
parakeet_tdt
speech
audio
Transducer
TDT
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Transformers
Eval Results (legacy)
Eval Results
Instructions to use nvidia/parakeet-tdt-0.6b-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/parakeet-tdt-0.6b-v3 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/parakeet-tdt-0.6b-v3") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
The model sometimes drops full sentences
#42 opened 12 days ago
by
nenad1002
CUDA out of memory with long audio
#41 opened 26 days ago
by
CarlsMM7
Model support per CrispASR β pure C++ inference with GGUF (no Python/NeMo needed)
#38 opened about 1 month ago
by
cstr
Add Open ASR Leaderboard evaluation results
#36 opened about 1 month ago
by
SaylorTwift
Environment setup
#35 opened about 2 months ago
by
pr-tet-usr
Inconsistent number transcription - often letters instead of digits (french language)
#34 opened about 2 months ago
by
poulpor
requirements.txt ? What Python version does this need?
2
#33 opened 2 months ago
by
Jhaut
eror timestamp
1
#32 opened 3 months ago
by
ghoza
Word boosting/Custom vocabulary
π₯ 4
1
#31 opened 4 months ago
by
buzzb0x
I want transcription not translation
π 5
1
#29 opened 5 months ago
by
smartire
Update Readme
#27 opened 6 months ago
by
jbalam-nv
How to specify the output language?
π 4
1
#26 opened 6 months ago
by
dragonhunterau
Will there be support for other languages?
ππ 9
2
#25 opened 6 months ago
by
altunenes
First 273 vocabulary tokens
#24 opened 6 months ago
by
comodoro
Does it support Realtime ?
1
#23 opened 6 months ago
by
ism0il
Separate languages into distinct models. How?
#22 opened 7 months ago
by
mv24
Running parakeet-tdt-0.6b-v3 on Jetson AGX Orin, Thor, or Spark!
4
#21 opened 7 months ago
by
raymondlo84
Seeking a Clear Tutorial for Fine-Tuning NVIDIA NeMo Models on New English Audio Domains
1
#19 opened 7 months ago
by
jacktol
Streaming question
1
#18 opened 7 months ago
by
koifish12
training script
1
#17 opened 7 months ago
by
sugintama
[EXAMPLE] Working streaming POC with Gradio MIC input.
#16 opened 8 months ago
by
WJ88
Can support for Irish Gaelic be added?
1
#15 opened 8 months ago
by
cgiwouter
Fine-tune on the other Language
3
#14 opened 8 months ago
by
Chonlasitk
Questions about streaming with Parakeet and TDT merging methods
π 1
2
#13 opened 8 months ago
by
alexandreacff
Missing sentences when transcribe some audios
β 8
#12 opened 8 months ago
by
josscii
Streaming?
1
#11 opened 9 months ago
by
dyqiang
Question about inference speed
1
#10 opened 9 months ago
by
cX1y
Async streaming container?
#9 opened 9 months ago
by
lukiggs
Is it possible to prompt or output language?
ππ 8
1
#8 opened 9 months ago
by
ndlc
Local Installation Video and Testing - Step by Step
β€οΈ 2
1
#6 opened 9 months ago
by
fahdmirzac
Japanese support plan?
π 5
6
#5 opened 9 months ago
by
sttt
Recognise separate voices
1
#4 opened 9 months ago
by
Jappie
Word boosting
βπ 2
2
#3 opened 9 months ago
by
stefanr123
training hyper-parameters
#2 opened 9 months ago
by
sugintama
Code switching
π 1
1
#1 opened 9 months ago
by
pscar