Moonshine
Visit ToolMoonshine is an open-source AI toolkit for very low latency speech-to-text, intent recognition, and text-to-speech. It enables developers to build real-time voice agents and interfaces that run on-device.
At a glance
Trending
Moonshine is an open-source AI toolkit for very low latency speech-to-text, intent recognition, and text-to-speech. It enables developers to build real-time voice agents and interfaces that run on-device.
Trending
About
Moonshine Voice is an open-source AI toolkit designed for developers building real-time voice applications. It offers very low latency speech-to-text, intent recognition, and text-to-speech capabilities, optimized for live streaming applications. Everything runs on-device, ensuring speed, privacy, and eliminating the need for accounts, credit cards, or API keys. The framework and models are optimized for live streaming, providing low latency responses by processing audio while the user is still speaking. Moonshine supports various platforms including Python, iOS, Android, MacOS, Linux, Windows, Raspberry Pis, IoT devices, and wearables, with high-level APIs for common tasks like transcription, text-to-speech, speaker identification, and command recognition. It also supports multiple languages for both STT and TTS, offering higher accuracy than Whisper Large V3 with significantly smaller models.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending