--- base_model: - aoi-ot/VibeVoice-Large tags: - text-to-speech - tts - lora - sft - full-finetune - vibevoice language: - hu --- # VibeVoice_7B_Hun_v2 This is my newest finetuned VibeVoice 7B (Large) model tailored to Hungarian language. I trained LoRA for the LLM module, performed a full-finetune on the Diffusion head modules, and merged each of them into the base model. To finetune the model I used the [following code](https://github.com/voicepowered-ai/VibeVoice-finetuning). Thank you for [JPGallegoar](https://github.com/jpgallegoar-vpai) for that amazing VibeVoice trainer! ## Inference For inference, you can use - [this Comfyui node](https://github.com/Enemyx-net/VibeVoice-ComfyUI) - Demo codes on [VibeVoice Community's repository](https://github.com/vibevoice-community/VibeVoice) ## Examples These examples were made with 4bit inference. One can get even better results without quantization. #### Sample 1 ``` "Az utcák lassan megteltek emberekkel, ahogy a város ébredezett. A kávézók teraszain gőzölgő csészék mellett beszélgettek az emberek, miközben a villamos csilingelve gördült el a sarkon. A levegőben friss péksütemény illata keveredett a tavaszi széllel. Minden arra utalt, hogy egy nyugodt, szép nap veszi kezdetét." ```