Update README.md
Browse files
README.md
CHANGED
|
@@ -75,7 +75,7 @@ This model powers voice interactions in the modern agentic systems, enabling sea
|
|
| 75 |
- Dataset: Curated from LibriTTS, Common Voice and Emilia (~50k hours).
|
| 76 |
- Pretrained mostly on English speech for robust core capabilities, with multilingual fine-tuning for supported languages.
|
| 77 |
- Metrics: MOS (Mean Opinion Score) 4.3/5 for naturalness; WER (Word Error Rate) < 5% on benchmark texts.
|
| 78 |
-
- Hardware: Pretrained on 8x H200 over
|
| 79 |
|
| 80 |
## Inference on Nvidia RTX 5080:
|
| 81 |
- **Latency**: ~ 1s to generate 15 seconds of audio
|
|
|
|
| 75 |
- Dataset: Curated from LibriTTS, Common Voice and Emilia (~50k hours).
|
| 76 |
- Pretrained mostly on English speech for robust core capabilities, with multilingual fine-tuning for supported languages.
|
| 77 |
- Metrics: MOS (Mean Opinion Score) 4.3/5 for naturalness; WER (Word Error Rate) < 5% on benchmark texts.
|
| 78 |
+
- Hardware: Pretrained on 8x H200 over 8 hours.
|
| 79 |
|
| 80 |
## Inference on Nvidia RTX 5080:
|
| 81 |
- **Latency**: ~ 1s to generate 15 seconds of audio
|