Welp, that was fast. Haven’t tried the local version, this is based on the vall-e paper by ms researchers. https://github.com/enhuiz/vall-e trainable text to speech and comes with a colab implementation. Let me know how it works will spin this up later.
Viewing a single comment thread. View all comments