metadata
			base_model:
  - aoi-ot/VibeVoice-Large
tags:
  - text-to-speech
  - tts
  - lora
  - vibevice
datasets:
  - mozilla-foundation/common_voice_17_0
language:
  - hu
VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17
This is a VibeVoice 7B (Large) model LoRA finetune on a Hungarian audio dataset. For this particular test I used the CommonVoice 17.0 dataset's Hungarian config's train split.
To finetune the model I used the following code base.
Thank you for JPGallegoar for that amazing VibeVoice trainer!
Inference
To use the LoRA model you can use my modified fork until the following PR will be merged into the main branch of VibeVoice Community's repository.
Examples
Voice without LoRA
Voice WITH LoRA