OuteAI
/

wavtokenizer-large-75token-interface

Model card Files Files and versions

edwko commited on Dec 14, 2024

Commit

8f47998

·

verified ·

1 Parent(s): 771ff21

Update README.md

Files changed (1) hide show

README.md +13 -3

README.md CHANGED Viewed

@@ -1,3 +1,13 @@
----
-license: mit
----

+---
+license: mit
+---
+This is a streamlined interface version of [WavTokenizer-large-speech-75token](https://huggingface.co/novateur/WavTokenizer-large-speech-75token/tree/main), providing a clean, efficient way to interact with the model through separate encoder and decoder components.
+- Reduced model size from 1.75GB to ~330MB by keeping only necessary components for inference
+- Split interface (82MB encoder, 248MB decoder)
+- Simplified integration with just [one .py](https://github.com/edwko/OuteTTS) file
+The model is split into:
+- `encoder/`: Handles audio encoding
+- `decoder/`: Handles decoding and synthesis