PPASR Chinese Speech Recognition (Beginner Level)
Thank you for your detailed introduction! To further help everyone understand and use this CTC-based end-to-end Chinese-English speech recognition model, I will supplement and improve it from several aspects: ### 1. Dataset and Its Processing #### AISHELL - **Data Volume**: Approximately 20 hours of Mandarin Chinese pronunciation. - **Characteristics**: Contains standard Mandarin Chinese pronunciation and some dialects. #### Free ST Chinese Mandarin Corpus - **Data Volume**: Approximately 65 hours of Mandarin Chinese pronunciation. -
Read MoreEnd-to-End Chinese Speech Recognition Model of DeepSpeech2 Implemented Based on PaddlePaddle
This tutorial provides a detailed introduction to using PaddlePaddle for speech recognition, along with a series of operational guidelines to assist developers from data preparation to model training and online deployment. Below is a brief summary of each step: 1. **Environment Configuration**: Ensure the development environment has installed necessary software and libraries, including PaddlePaddle. 2. **Data Preparation**: - Download and extract the speech recognition dataset. - Process audio files, such as denoising, downsampling, etc. - (Note: The original summary for "processing text" appears to be incomplete in the provided content.)
Read More