Masr
Visit Toolmasr is an open-source Mandarin automatic speech recognition (ASR) project. It uses an end-to-end deep neural network with a gated convolutional network architecture for converting spoken Mandarin to text.
At a glance
Trending
masr is an open-source Mandarin automatic speech recognition (ASR) project. It uses an end-to-end deep neural network with a gated convolutional network architecture for converting spoken Mandarin to text.
Trending
About
masr is an open-source project dedicated to Mandarin Automatic Speech Recognition (ASR). It leverages an end-to-end deep neural network, specifically a gated convolutional network similar to Facebook's Wav2letter, but utilizes GLU (Gated Linear Unit) as its activation function for faster convergence. The project is trained on the AISHELL-1 dataset, comprising 150 hours of recordings covering over 4000 Chinese characters. While not designed to compete with industrial-grade systems, masr serves as a valuable reference for researchers interested in convolutional networks for speech recognition. It also demonstrates how external language models can further improve recognition accuracy.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending