mozilla-foundation/common_voice_17_0
Updated • 5.57k • 16
This is a VibeVoice 7B (Large) model LoRA finetune on a Hungarian audio dataset. For this particular test I used the CommonVoice 17.0 dataset's Hungarian config's train split.
To finetune the model I used the following code base.
Thank you for JPGallegoar for that amazing VibeVoice trainer!
This LoRA was trained on RunPod cloud GPUs.
To use the LoRA model you can use my modified fork until the following PR will be merged into the main branch of VibeVoice Community's repository.
Voice without LoRA
Voice WITH LoRA
Producing and sharing this kind of open-source work requires renting cloud GPUs, which gets expensive quickly. If you find it useful and would like me to keep contributing, your support is very much appreciated:
Base model
aoi-ot/VibeVoice-Large