Spaces:
				
			
			
	
			
			
		Runtime error
		
	
	
	
			
			
	
	
	
	
		
		
		Runtime error
		
	trying to hack together a voice cloning demo....
#1
by
						
sherlock1199
	
							
						- opened
							
					
I've been trying to create my own custom embeddings using speechbrain/spkrec-xvect-voxceleb
signal, fs =torchaudio.load('morgan.wav')
embeddings = classifier.encode_batch(signal)
and generating audio using:
speech = model.generate_speech(inputs["input_ids"], embeddings[0], vocoder=vocoder)
but having the output garbled. is there an intermediary step i'm missing ?
so managed to get a non-garbled output. after resampling my wav file to 16k hz and converting it to mono. now to figure out how to improve the quality of voice reproduction.
Great work! Where can I find information about fine-tuning to other languages?
