PPASR Chinese Speech Recognition (Beginner Level)

Thank you for your detailed introduction! To further help everyone understand and use this CTC-based end-to-end Chinese-English speech recognition model, I will supplement and improve it from several aspects: ### 1. Dataset and Its Processing #### AISHELL - **Data Volume**: Approximately 20 hours of Mandarin Chinese pronunciation. - **Characteristics**: Contains standard Mandarin Chinese pronunciation and some dialects. #### Free ST Chinese Mandarin Corpus - **Data Volume**: Approximately 65 hours of Mandarin Chinese pronunciation. -

Read More
End-to-End Chinese Speech Recognition Model of DeepSpeech2 Implemented Based on PaddlePaddle

This tutorial provides a detailed introduction to using PaddlePaddle for speech recognition, along with a series of operational guidelines to assist developers from data preparation to model training and online deployment. Below is a brief summary of each step: 1. **Environment Configuration**: Ensure the development environment has installed necessary software and libraries, including PaddlePaddle. 2. **Data Preparation**: - Download and extract the speech recognition dataset. - Process audio files, such as denoising, downsampling, etc. - (Note: The original summary for "processing text" appears to be incomplete in the provided content.)

Read More