Automatic Speech Recognition
NeMo
PyTorch
speech-recognition
cache-aware ASR
streaming-asr
speech
audio
FastConformer
RNNT
Parakeet
ASR
NeMo
Eval Results (legacy)
Instructions to use nvidia/nemotron-speech-streaming-en-0.6b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/nemotron-speech-streaming-en-0.6b with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/nemotron-speech-streaming-en-0.6b") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Mel-spectrogram boundary artifacts degrade quality in real-time mic streaming β fix inside
ππ₯ 2
1
#15 opened about 2 months ago
by
Twizz
Are there any more metrics or articles on how this compares to other models for streaming?
1
#13 opened 2 months ago
by
minimanatee
How about a TDT based version instead of RNN-T?
β 4
#12 opened 3 months ago
by
yash-levelai
missing punctuation marks
ππ€ 1
#11 opened 3 months ago
by
Kerwin11
Can we expect an ONNX quant?
β 3
3
#6 opened 4 months ago
by
SuperPauly
MLX version planned?
3
#3 opened 4 months ago
by
Amit-I
Multilingual version planned?
13
#2 opened 5 months ago
by
fosple