IMS-Toucan
Visit ToolIMS-Toucan is an open-source Text-to-Speech tool that supports over 7000 languages. It offers controllable and fast speech synthesis, allowing users to train and use state-of-the-art TTS models.
At a glance
Trending
IMS-Toucan is an open-source Text-to-Speech tool that supports over 7000 languages. It offers controllable and fast speech synthesis, allowing users to train and use state-of-the-art TTS models.
Trending
About
IMS-Toucan is a comprehensive, open-source toolkit developed at the Institute for Natural Language Processing (IMS), University of Stuttgart, designed for training, using, and teaching state-of-the-art Text-to-Speech Synthesis. This system is notable for its massive multilingual support, covering over 7000 languages, and its ability to generate speech quickly and controllably without requiring extensive computational resources. Users can fine-tune models on single or multiple datasets, including various languages, and leverage pretrained models for faster results. The toolkit provides inference interfaces for generating audio from text, with options to control duration, pitch, and energy curves, and supports both file output and immediate audio playback. It also includes features for managing storage, installing optional dependencies like eSpeak-NG, and a scorer to identify and remove outliers from datasets, making it a robust solution for advanced TTS development and research.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending