MATLAB Implementation for Speech Recognition System
A comprehensive speech recognition program featuring a pre-trained fixed model architecture, capable of executing complete recognition workflows and producing accurate transcription results.
Explore MATLAB source code curated for "语音识别" with clean implementations, documentation, and examples.
A comprehensive speech recognition program featuring a pre-trained fixed model architecture, capable of executing complete recognition workflows and producing accurate transcription results.
Speech Recognition is a technology that enables machines to convert speech signals into corresponding text or commands through identification and comprehension processes. This project conducts preliminary exploration and research on isolated word recognition using the DTW (Dynamic Time Warping) algorithm. Implementation involves MATLAB-based development of isolated word speech recognition, with analysis of DTW's key characteristics and limitations, including code-level insights on pattern matching and temporal alignment techniques.
A robust pitch period feature extraction program suitable for speech recognition applications, implementing advanced signal processing algorithms for accurate fundamental frequency detection.
Implementation of speech recognition system combining Dynamic Time Warping and Mel-Frequency Cepstral Coefficients
A self-developed speech recognition program utilizing Hidden Markov Model (HMM) framework for audio pattern recognition
MATLAB implementation of the DTW algorithm for speech recognition with corresponding test programs, including feature extraction and template matching capabilities.
Speech recognition system based on artificial neural networks (ANN) that combines LPC parameters extracted from speech signals with neural network classification, implemented and optimized using MATLAB
MATLAB implementation of DTW (Dynamic Time Warping) algorithm for speech recognition with enhanced code documentation and technical explanations
This MATLAB program focuses on digital speech recognition training and identification. Due to the large size of the complete dataset, only a small sample is uploaded here. Users can create additional data using software like COOLEDIT by following these specifications: WAV files must have 8000 Hz sampling rate, mono channel, 16-bit sampling precision with Motorola PCM format. Corresponding LAB files should contain speech segment boundaries (start/end points) and phonetic content labels for training data annotation.
MATLAB source code for speech recognition system including preprocessing, feature extraction, and training/recognition modules with algorithm implementations