nineninesix
/

kani-tts-450m-0.1-pt

text-generation

Model card Files Files and versions

ylankgz commited on Sep 19

Commit

51c2677

·

verified ·

1 Parent(s): d7af952

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -75,7 +75,7 @@ This model powers voice interactions in the modern agentic systems, enabling sea
 - Dataset: Curated from LibriTTS, Common Voice and Emilia (~50k hours).
 - Pretrained mostly on English speech for robust core capabilities, with multilingual fine-tuning for supported languages.
 - Metrics: MOS (Mean Opinion Score) 4.3/5 for naturalness; WER (Word Error Rate) < 5% on benchmark texts.
-- Hardware: Pretrained on 8x H200 over 25 hours.
 ## Inference on Nvidia RTX 5080:
 - **Latency**: ~ 1s to generate 15 seconds of audio

 - Dataset: Curated from LibriTTS, Common Voice and Emilia (~50k hours).
 - Pretrained mostly on English speech for robust core capabilities, with multilingual fine-tuning for supported languages.
 - Metrics: MOS (Mean Opinion Score) 4.3/5 for naturalness; WER (Word Error Rate) < 5% on benchmark texts.
+- Hardware: Pretrained on 8x H200 over 8 hours.
 ## Inference on Nvidia RTX 5080:
 - **Latency**: ~ 1s to generate 15 seconds of audio