πŸŽ™οΈ Whisper Small Karakalpak ASR

This model is a fine-tuned version of openai/whisper-small for Automatic Speech Recognition (ASR) in the Karakalpak language.


πŸ‘€ Founder

Quyashbek Allanazarov


🀝 Acknowledgements

  • πŸŽ“ New Uzbekistan University β€” for knowledge, research environment, and academic direction
  • 🏦 Xalq Banki AI Lab β€” for providing GPU resources and supporting the technical direction of the project

πŸ“Š Model Performance

Evaluation was performed on a held-out test set.

Metric Score
WER (Word Error Rate) 31.42%
CER (Character Error Rate) 9.33%

Test Details

  • Total samples: 504
  • Sampling rate: 16 kHz
  • Model size: Small

πŸ§ͺ Inference Example

import torch
from transformers import WhisperForConditionalGeneration, WhisperProcessor

model_id = "your-username/whisper-small-karakalpak"
processor = WhisperProcessor.from_pretrained(model_id)
model = WhisperForConditionalGeneration.from_pretrained(model_id)

# inputs = processor(audio, sampling_rate=16000, return_tensors="pt")
# generated_ids = model.generate(inputs.input_features)
# transcription = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
Downloads last month
20
Safetensors
Model size
0.2B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for Quyashbek/whisper-small-karakalpak

Finetuned
(3438)
this model

Collection including Quyashbek/whisper-small-karakalpak