Speech-UK initiative

non-profit

https://github.com/egorsmkv/speech-recognition-uk

AI & ML interests

We are driving innovation in Ukrainian speech technology 🇺🇦

Recent Activity

Yehor updated a model about 6 hours ago

speech-uk/w2v-bert-test2-full

Yehor updated a model about 19 hours ago

speech-uk/w2v-bert-v3

Yehor published a model about 19 hours ago

speech-uk/w2v-bert-v3

View all activity

updated a model about 6 hours ago

speech-uk/w2v-bert-test2-full

Updated about 6 hours ago

posted an update about 12 hours ago

Post

34

Updated the demo for the new version of the W2V-BERT model for Ukrainian audio recognition.

This is a classic Automatic Speech Recognition or Speech to Text task.

What's new in version three:

• more data: 1200 hours
• new SentencePiece tokenizer with 512 tokens
• feature extraction is done via a Rust extension

Facts:

• Training was started from the previous model to speed up the learning process.
• Training takes place on two 3090 video cards with 24 GB each.
• It is well suited for fine-tuning because the training data is very diverse and mostly noisy.

You can try it here:

Yehor/w2v-bert-uk-v3

Download weights here:

speech-uk/w2v-bert-v3

If you wish to support the speech-uk initiative with a donation, here is the link to Monobank:

https://send.monobank.ua/jar/3Saxixsdua

updated a model about 19 hours ago

speech-uk/w2v-bert-v3

0.6B • Updated about 19 hours ago • 28

published a model about 19 hours ago

speech-uk/w2v-bert-v3

0.6B • Updated about 19 hours ago • 28

published a model 5 days ago

speech-uk/w2v-bert-test2-full

Updated about 6 hours ago

in speech-uk/voa-2-opus 9 days ago

[bot] Conversion to Parquet

#1 opened 9 days ago by

parquet-converter

updated a dataset 10 days ago

speech-uk/voa-2-opus

Viewer • Updated 10 days ago • 177k • 230

published a dataset 10 days ago

speech-uk/voa-2-opus

Viewer • Updated 10 days ago • 177k • 230

in speech-uk/yodas2-opus 11 days ago

[bot] Conversion to Parquet

#1 opened 11 days ago by

parquet-converter

updated a dataset 11 days ago

speech-uk/yodas2-opus

Viewer • Updated 11 days ago • 400k • 412

published a dataset 12 days ago

speech-uk/yodas2-opus

Viewer • Updated 11 days ago • 400k • 412

in speech-uk/cv22-opus 12 days ago

[bot] Conversion to Parquet

#1 opened 12 days ago by

parquet-converter

in speech-uk/broadcast-opus 13 days ago

[bot] Conversion to Parquet

#1 opened 13 days ago by

parquet-converter

in speech-uk/yodas2 13 days ago

[bot] Conversion to Parquet

#1 opened 6 months ago by

parquet-converter

updated a dataset 13 days ago

speech-uk/voa-opus

Viewer • Updated 13 days ago • 326k • 251

published a dataset 13 days ago

speech-uk/voa-opus

Viewer • Updated 13 days ago • 326k • 251

updated a dataset 13 days ago

speech-uk/cv22-opus

Viewer • Updated 13 days ago • 89.2k • 272

published a dataset 13 days ago

speech-uk/cv22-opus

Viewer • Updated 13 days ago • 89.2k • 272

updated a dataset 13 days ago

speech-uk/broadcast-opus

Viewer • Updated 13 days ago • 137k • 49

published a dataset 13 days ago

speech-uk/broadcast-opus

Viewer • Updated 13 days ago • 137k • 49