VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

This is a VibeVoice 7B (Large) model LoRA finetune on a Hungarian audio dataset. For this particular test I used the CommonVoice 17.0 dataset's Hungarian config's train split.

To finetune the model I used the following code base.

Thank you for JPGallegoar for that amazing VibeVoice trainer!

Training

This LoRA was trained on RunPod cloud GPUs.

Inference

To use the LoRA model you can use my modified fork until the following PR will be merged into the main branch of VibeVoice Community's repository.

Examples

Voice without LoRA

Voice WITH LoRA

Support

Producing and sharing this kind of open-source work requires renting cloud GPUs, which gets expensive quickly. If you find it useful and would like me to keep contributing, your support is very much appreciated:

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

Base model

aoi-ot/VibeVoice-Large

Adapter

(3)

this model

Dataset used to train Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

Collection including Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

Text to Speech Models

Collection

2 items • Updated Feb 14 • 1