Cseti's picture
Update README.md
80a2efc verified
metadata
base_model:
  - aoi-ot/VibeVoice-Large
tags:
  - text-to-speech
  - tts
  - lora
  - vibevice
datasets:
  - mozilla-foundation/common_voice_17_0
language:
  - hu

VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17

This is a VibeVoice 7B (Large) model LoRA finetune on a Hungarian audio dataset. For this particular test I used the CommonVoice 17.0 dataset's Hungarian config's train split.

To finetune the model I used the following code base.

Thank you for JPGallegoar for that amazing VibeVoice trainer!

Inference

To use the LoRA model you can use my modified fork until the following PR will be merged into the main branch of VibeVoice Community's repository.

Examples

Voice without LoRA

Voice WITH LoRA