Articles in 2021-11 - Yeyupiaoling

PPASR Streaming and Non-Streaming Speech Recognition

2021-11-30 260 views PaddlePaddle 语音深度学习 Artificial Intelligence Deep Learning PaddlePaddle Speech Recognition DeepSpeech2

This document introduces how to deploy and test a speech recognition model implemented using PaddlePaddle, and provides various methods to execute and demonstrate the model's functionality. The following is a summary and interpretation of the document content: ### 1. Introduction - Provides an overview of PaddlePaddle-based speech recognition models, including recognition for short voice segments and long audio clips. ### 2. Deployment Methods #### 2.1 Command-line Deployment Two commands are provided to implement different deployment methods: - `python infer_server.

Processing and Usage of the WenetSpeech Dataset

2021-11-30 291 views 语音 PaddlePaddle 深度学习 Speech Recognition PaddlePaddle WenetSpeech Mandarin Speech Dataset Chinese Speech Dataset

The WenetSpeech dataset provides over 10,000 hours of Mandarin Chinese speech, categorized into strong-labeled (10,005 hours), weak-labeled (2,478 hours), and unlabeled (9,952 hours) subsets, suitable for supervised, semi-supervised, or unsupervised training. The data is grouped by domain and style, and datasets of different scales (S, M, L) as well as evaluation/test data are provided. The tutorial details how to download, prepare, and use this dataset for training speech recognition models, making it a valuable reference for ASR system developers.

Fast Face Recognition Model Implemented with PaddlePaddle

2021-11-03 235 views PaddlePaddle 深度学习 Deep Learning Computer Vision Artificial Intelligence

This project develops a small and efficient face recognition system based on the ArcFace and PP-OCRv2 models. The training dataset is emore (containing 85,742 individuals and 5,822,653 images), and the lfw-align-128 dataset is used for testing. The project provides complete code and preprocessing scripts. The `create_dataset.py` script is executed to organize raw data into binary file format, improving training efficiency. Model training and evaluation are controlled by `train.py` and `eval.py` respectively. The prediction function supports

A Fast Face Recognition Model Implemented Based on PyTorch

2021-11-03 220 views Pytorch 深度学习 Pytorch Deep Learning Artificial Intelligence

This project aims to develop a face recognition system with small models, high recognition accuracy, and fast inference speed. The training data is sourced from the emore dataset (5.82 million images), and the lfw-align-128 dataset is used for testing. The project combines the ArcFace loss function and MobileNet, implemented through Python scripts. The process of training the model includes data preparation, training, and evaluation, with all code available on GitHub. To start the training process, the `train.py` command is executed; for performance verification, run `ev`