Automatic Speech Recognition
NeMo
Safetensors
PyTorch
parakeet_tdt
speech
audio
Transducer
TDT
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Transformers
Eval Results (legacy)
Eval Results
Instructions to use nvidia/parakeet-tdt-0.6b-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/parakeet-tdt-0.6b-v3 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/parakeet-tdt-0.6b-v3") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Does it support Realtime ?
#23
by ism0il - opened
Hello! i'm wondering if this model can be used for realtime streaming conversation like the multilingual parakeet-1.1b-rnnt-multilingual-asr
ism0il changed discussion title from Realtime ? to Does it support Realtime ?
they have released https://huggingface.co/nvidia/nemotron-speech-streaming-en-0.6b i guess its multi lingual will come later.