VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

This is a VibeVoice 7B (Large) model LoRA finetune on a Hungarian audio dataset. For this particular test I used the CommonVoice 17.0 dataset's Hungarian config's train split.

To finetune the model I used the following code base.

Thank you for JPGallegoar for that amazing VibeVoice trainer!

Training

This LoRA was trained on RunPod cloud GPUs.

Inference

To use the LoRA model you can use my modified fork until the following PR will be merged into the main branch of VibeVoice Community's repository.

Examples

Voice without LoRA

Voice WITH LoRA

Support

Producing and sharing this kind of open-source work requires renting cloud GPUs, which gets expensive quickly. If you find it useful and would like me to keep contributing, your support is very much appreciated:

Ko-fi Liberapay

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

Adapter
(3)
this model

Dataset used to train Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

Collection including Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17