Zero-shot voice cloning sounds great if you get a lucky generation.

by Gapeleon - opened Sep 22

Sep 22

Depending on the voice, zero-shot voice cloning works pretty well if you provide [reference_text + new_text] then pre-fill the response with encoded reference audio.

https://huggingface.co/spaces/Gapeleon/KaniTTS_Voice_Cloning
Sometimes it needs a couple of re-generations.

KaniTTS and SparkTTS are the only models I've tried that can get my accent right.

Simonlob

NineNineSix org Sep 22

Woow!

ylankgz

NineNineSix org Sep 22

Thats awesome! We will release a stable base model next week, I hope it will work smoothly

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment