Useful Links
Computer Science
Signal Processing
Speech Synthesis and Processing
1. Fundamentals of Sound and Speech
2. Digital Signal Processing for Speech
3. Speech Analysis and Feature Extraction
4. Speech Synthesis (Text-to-Speech)
5. Automatic Speech Recognition (ASR)
6. Advanced Topics and Applications
7. Evaluation and Quality Assessment
Speech Analysis and Feature Extraction
Fundamental Frequency Estimation
Time-Domain Methods
Autocorrelation Function
Average Magnitude Difference Function (AMDF)
Normalized Cross-Correlation
Frequency-Domain Methods
Harmonic Product Spectrum
Cepstral Peak Picking
Spectral Autocorrelation
Advanced Pitch Estimation
YIN Algorithm
Difference Function
Cumulative Mean Normalized Difference
Absolute Threshold
PYIN Algorithm
Probabilistic Extensions
Voicing Probability
RAPT Algorithm
Dynamic Programming Approach
Cross-correlation Analysis
Pitch Tracking and Smoothing
Temporal Continuity Constraints
Median Filtering
Kalman Filtering
Formant Analysis
Linear Predictive Coding (LPC)
Autocorrelation Method
Covariance Method
Levinson-Durbin Algorithm
Prediction Error and Gain
Formant Extraction from LPC
Root Finding Methods
Bandwidth Estimation
Formant Tracking
Spectral Peak Picking
Peak Detection Algorithms
Spectral Smoothing
False Peak Rejection
Formant Synthesis
Cascade Formant Synthesis
Parallel Formant Synthesis
Formant Parameter Control
Voice Activity Detection (VAD)
Energy-based Methods
Short-term Energy Thresholding
Adaptive Thresholding
Energy Contour Analysis
Spectral-based Methods
Spectral Entropy
Spectral Centroid
Spectral Rolloff
Statistical Model-based VAD
Gaussian Mixture Models
Hidden Markov Models
Likelihood Ratio Tests
Machine Learning-based VAD
Support Vector Machines
Neural Network Approaches
Deep Learning Models
Feature Engineering
Spectral Features
Linear Prediction Cepstral Coefficients (LPCCs)
Perceptual Linear Prediction (PLP)
Relative Spectral Transform (RASTA)
Gammatone Frequency Cepstral Coefficients (GFCCs)
Prosodic Features
Fundamental Frequency Contours
Energy Contours
Duration Modeling
Speaking Rate Features
Voice Quality Features
Jitter Measurements
Shimmer Measurements
Harmonics-to-Noise Ratio (HNR)
Spectral Tilt
Cepstral Peak Prominence
Feature Normalization
Cepstral Mean Normalization (CMN)
Cepstral Variance Normalization (CVN)
Feature Warping
Histogram Equalization
Previous
2. Digital Signal Processing for Speech
Go to top
Next
4. Speech Synthesis (Text-to-Speech)