🎙️ Whisper Small Karakalpak ASR

This model is a fine-tuned version of openai/whisper-small for Automatic Speech Recognition (ASR) in the Karakalpak language.

👤 Founder

Quyashbek Allanazarov

🤝 Acknowledgements

🎓 New Uzbekistan University — for knowledge, research environment, and academic direction
🏦 Xalq Banki AI Lab — for providing GPU resources and supporting the technical direction of the project

📊 Model Performance

Evaluation was performed on a held-out test set.

Metric	Score
WER (Word Error Rate)	31.42%
CER (Character Error Rate)	9.33%

Test Details

Total samples: 504
Sampling rate: 16 kHz
Model size: Small

🧪 Inference Example

import torch
from transformers import WhisperForConditionalGeneration, WhisperProcessor

model_id = "your-username/whisper-small-karakalpak"
processor = WhisperProcessor.from_pretrained(model_id)
model = WhisperForConditionalGeneration.from_pretrained(model_id)

# inputs = processor(audio, sampling_rate=16000, return_tensors="pt")
# generated_ids = model.generate(inputs.input_features)
# transcription = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]

Downloads last month: 20

Safetensors

Model size

0.2B params

Tensor type

F32

Model tree for Quyashbek/whisper-small-karakalpak

Base model

openai/whisper-small

Finetuned

(3438)

this model

Collection including Quyashbek/whisper-small-karakalpak

Karakalpak ASR

Collection

The collection of the Fine tuned Karakalpak models • 6 items • Updated 5 days ago • 1