Useful Links
Computer Science
Signal Processing
Speech Synthesis and Processing
1. Fundamentals of Sound and Speech
2. Digital Signal Processing for Speech
3. Speech Analysis and Feature Extraction
4. Speech Synthesis (Text-to-Speech)
5. Automatic Speech Recognition (ASR)
6. Advanced Topics and Applications
7. Evaluation and Quality Assessment
Automatic Speech Recognition (ASR)
ASR System Architecture
Feature Extraction Pipeline
Preprocessing Steps
Feature Computation
Feature Normalization
Feature Transformation
Acoustic Modeling
Model Architecture Design
Training Procedures
Model Adaptation Techniques
Language Modeling
Vocabulary Design
Text Preprocessing
Model Training and Evaluation
Decoding Process
Search Space Definition
Pruning Strategies
Hypothesis Scoring
Traditional ASR Approaches
Hidden Markov Models (HMMs)
HMM Topology Design
State Emission Modeling
Transition Probability Modeling
Training Algorithms
Baum-Welch Algorithm
Viterbi Training
Gaussian Mixture Models (GMMs)
GMM Parameter Estimation
Component Selection
Adaptation Techniques
GMM-HMM Systems
System Architecture
Training Pipeline
Decoding Process
Deep Learning ASR
Deep Neural Network Acoustic Models
Feedforward Networks
Convolutional Neural Networks
Recurrent Neural Networks
Hybrid DNN-HMM Systems
End-to-End ASR Models
Connectionist Temporal Classification (CTC)
CTC Loss Function
Alignment-free Training
CTC Decoding Algorithms
Attention-based Models
Encoder-Decoder Architecture
Attention Mechanisms
Listen, Attend and Spell (LAS)
RNN Transducer (RNN-T)
Transducer Architecture
Streaming Recognition
Training and Inference
Transformer-based ASR
Self-attention Mechanisms
Conformer Architecture
Wav2Vec 2.0
Whisper Architecture
Language Modeling
N-gram Language Models
Maximum Likelihood Estimation
Smoothing Techniques
Add-one Smoothing
Good-Turing Smoothing
Kneser-Ney Smoothing
Backoff and Interpolation
Neural Language Models
Feedforward Neural LMs
Recurrent Neural LMs
LSTM Language Models
Transformer Language Models
Domain Adaptation
Model Interpolation
Model Adaptation Techniques
Transfer Learning
Decoding and Search
Viterbi Algorithm
Dynamic Programming Formulation
Path Backtracking
Computational Complexity
Beam Search Decoding
Beam Width Selection
Pruning Strategies
Language Model Integration
Weighted Finite-State Transducers (WFSTs)
WFST Construction
Composition Operations
Optimization Techniques
Advanced Decoding Techniques
Word-level Beam Search
Prefix Beam Search
Attention-based Decoding
Previous
4. Speech Synthesis (Text-to-Speech)
Go to top
Next
6. Advanced Topics and Applications