GLM-ASR
Visit ToolGLM-ASR is an open-source speech recognition model that offers robust performance with 1.5B parameters. It excels in dialect support and low-volume speech robustness, outperforming OpenAI Whisper V3.
At a glance
Trending
GLM-ASR is an open-source speech recognition model that offers robust performance with 1.5B parameters. It excels in dialect support and low-volume speech robustness, outperforming OpenAI Whisper V3.
Trending
About
GLM-ASR-Nano is a robust, open-source speech recognition model featuring 1.5 billion parameters, designed to handle real-world complexities. It surpasses OpenAI Whisper V3 in multiple benchmarks while maintaining a compact size. Key capabilities include exceptional dialect support, particularly for Cantonese and other dialects, effectively bridging gaps in dialectal speech recognition. The model is also specifically trained for "Whisper/Quiet Speech" scenarios, accurately transcribing extremely low-volume audio that traditional models often miss. GLM-ASR-Nano achieves a state-of-the-art average error rate of 4.10 among comparable open-source models, demonstrating significant advantages in Chinese benchmarks like Wenet Meeting and Aishell-1. It supports 17 languages with high usability, with specific optimizations for certain regions.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending