Vall-E
Visit Toolvall-e is an open-source PyTorch implementation of VALL-E, a zero-shot text-to-speech model. It allows users to train the VALL-E model on a single GPU and reproduce the original VALL-E demo.
At a glance
Trending
Also listed in
vall-e is an open-source PyTorch implementation of VALL-E, a zero-shot text-to-speech model. It allows users to train the VALL-E model on a single GPU and reproduce the original VALL-E demo.
Trending
Also listed in
About
vall-e provides an unofficial PyTorch implementation of the VALL-E neural codec language model, enabling zero-shot text-to-speech synthesis. This open-source project allows users to train the VALL-E model efficiently on a single GPU, making advanced speech synthesis accessible for researchers and developers. It includes comprehensive instructions for installation, training, and inference, with examples for both English (LibriTTS) and Chinese (AISHELL-1) datasets. The implementation also supports various prefix modes for the NAR Decoder, offering flexibility in how acoustic prompt tokens are used. While the project emphasizes its potential risks regarding misuse due to its speaker identity preservation capabilities, it serves as a valuable resource for those looking to experiment with and advance text-to-speech technologies.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending