Sherpa-Onnx

Visit Tool

sherpa-onnx is an open-source AI tool that provides speech-to-text, text-to-speech, and various audio processing functions. It operates offline and supports a wide range of platforms and programming languages.

Claim this tool

16Views

At a glance

Pricing

Open Source

Free tier

Yes

API

Yes

Skill level

Technical

About

What is sherpa-onnx?

sherpa-onnx is a comprehensive open-source AI toolkit designed for offline speech and audio processing. It offers a wide array of functionalities including speech-to-text (ASR), text-to-speech (TTS), speaker diarization, speaker identification, speaker verification, spoken language identification, audio tagging, voice activity detection (VAD), speech enhancement, keyword spotting, and source separation. The tool is highly versatile, supporting numerous platforms such as Android, iOS, Windows, macOS, Linux, and HarmonyOS, across various architectures including x64, x86, ARM, and RISC-V. It also integrates with several NPUs like Rockchip, Qualcomm, Ascend, and Axera, and provides APIs for 12 programming languages, including C++, Python, Java, and Swift, along with WebAssembly support. This makes it ideal for developers building AI-powered audio applications for embedded systems and diverse environments.

Best used for

Ideal for developers and engineers who need to implement offline speech recognition, text-to-speech, and advanced audio processing functions. Especially valuable for building applications on embedded systems, mobile devices, and various operating systems without requiring an internet connection.

Common actions

transcribe speech

synthesize speech

separate audio sources

enhance audio quality

detect voice activity

github copilotface swapping"AI Agents"open-sourceworkflowscollaborationlow-code/no-codeautomated workflowdeepfake

Capabilities

Key features

Speech-to-text
Text-to-speech
Speaker diarization
Speech enhancement
Source separation
Voice activity detection
Keyword spotting

Target Audience

developersmachine learning engineersembedded systems engineers

Integrations

Not yet documented

Pricing & Plans

Open Source

Free

FAQs

What programming languages does sherpa-onnx support for integration?

sherpa-onnx supports a wide range of programming languages, including C++, C, Python, JavaScript, Java, C#, Kotlin, Swift, Go, Dart, Rust, and Pascal. It also provides support for WebAssembly, making it highly versatile for various development environments.

Can sherpa-onnx run on embedded systems and mobile devices?

Yes, sherpa-onnx is designed to support embedded systems and mobile devices. It runs on platforms like Android, iOS, HarmonyOS, Raspberry Pi, and various NPUs (Rockchip, Axera, Ascend), making it suitable for resource-constrained environments.

Does sherpa-onnx require an internet connection to function?

No, a key feature of sherpa-onnx is its ability to operate without an internet connection. All speech-to-text, text-to-speech, and other audio processing functions are performed locally using ONNX Runtime.

Trending

Subcategories trending in Content & Design

Image Generation AI Writing Assistants Video Generation Photo Editing Graphic Design Video Editing

Trending

Also listed in

This tool also appears in

Coding & Development › Backend & APIs Coding & Development › Open Source & Models AI Agents & Automation › Voice Agents

Explore

Browse AI tools by category

Content & Design Productivity & Business Coding & Development AI Agents & Automation Research & Education Wellness & Lifestyle Career Development Marketing & Growth Data & Analytics Customer Support & CX Finance E-commerce