Orpheus-TTS
Visit ToolOrpheus-TTS is an open-source audio & music tool that provides human-sounding speech synthesis. It offers multilingual models and optimized inference capabilities for real-time applications.
At a glance
Trending
Orpheus-TTS is an open-source audio & music tool that provides human-sounding speech synthesis. It offers multilingual models and optimized inference capabilities for real-time applications.
Trending
About
Orpheus-TTS is a state-of-the-art open-source text-to-speech system built on the Llama-3b backbone, demonstrating emergent capabilities of using LLMs for speech synthesis. It delivers human-like speech with natural intonation, emotion, and rhythm, surpassing many closed-source models. Key features include zero-shot voice cloning, guided emotion and intonation control via simple tags, and low latency for real-time applications. The tool provides both English and multilingual models, along with data processing scripts and sample datasets to facilitate custom finetuning. Users can deploy models on platforms like Baseten for optimized inference at fp8 and fp16, or integrate with local setups. It also supports audio watermarking and offers various voice options and emotive tags for enhanced customization.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending