Speech Synthesis and Processing

  1. Speech Analysis and Feature Extraction
    1. Fundamental Frequency Estimation
      1. Time-Domain Methods
        1. Autocorrelation Function
          1. Average Magnitude Difference Function (AMDF)
            1. Normalized Cross-Correlation
            2. Frequency-Domain Methods
              1. Harmonic Product Spectrum
                1. Cepstral Peak Picking
                  1. Spectral Autocorrelation
                  2. Advanced Pitch Estimation
                    1. YIN Algorithm
                      1. Difference Function
                        1. Cumulative Mean Normalized Difference
                          1. Absolute Threshold
                          2. PYIN Algorithm
                            1. Probabilistic Extensions
                              1. Voicing Probability
                              2. RAPT Algorithm
                                1. Dynamic Programming Approach
                                  1. Cross-correlation Analysis
                                2. Pitch Tracking and Smoothing
                                  1. Temporal Continuity Constraints
                                    1. Median Filtering
                                      1. Kalman Filtering
                                    2. Formant Analysis
                                      1. Linear Predictive Coding (LPC)
                                        1. Autocorrelation Method
                                          1. Covariance Method
                                            1. Levinson-Durbin Algorithm
                                              1. Prediction Error and Gain
                                              2. Formant Extraction from LPC
                                                1. Root Finding Methods
                                                  1. Bandwidth Estimation
                                                    1. Formant Tracking
                                                    2. Spectral Peak Picking
                                                      1. Peak Detection Algorithms
                                                        1. Spectral Smoothing
                                                          1. False Peak Rejection
                                                          2. Formant Synthesis
                                                            1. Cascade Formant Synthesis
                                                              1. Parallel Formant Synthesis
                                                                1. Formant Parameter Control
                                                              2. Voice Activity Detection (VAD)
                                                                1. Energy-based Methods
                                                                  1. Short-term Energy Thresholding
                                                                    1. Adaptive Thresholding
                                                                      1. Energy Contour Analysis
                                                                      2. Spectral-based Methods
                                                                        1. Spectral Entropy
                                                                          1. Spectral Centroid
                                                                            1. Spectral Rolloff
                                                                            2. Statistical Model-based VAD
                                                                              1. Gaussian Mixture Models
                                                                                1. Hidden Markov Models
                                                                                  1. Likelihood Ratio Tests
                                                                                  2. Machine Learning-based VAD
                                                                                    1. Support Vector Machines
                                                                                      1. Neural Network Approaches
                                                                                        1. Deep Learning Models
                                                                                      2. Feature Engineering
                                                                                        1. Spectral Features
                                                                                          1. Linear Prediction Cepstral Coefficients (LPCCs)
                                                                                            1. Perceptual Linear Prediction (PLP)
                                                                                              1. Relative Spectral Transform (RASTA)
                                                                                                1. Gammatone Frequency Cepstral Coefficients (GFCCs)
                                                                                                2. Prosodic Features
                                                                                                  1. Fundamental Frequency Contours
                                                                                                    1. Energy Contours
                                                                                                      1. Duration Modeling
                                                                                                        1. Speaking Rate Features
                                                                                                        2. Voice Quality Features
                                                                                                          1. Jitter Measurements
                                                                                                            1. Shimmer Measurements
                                                                                                              1. Harmonics-to-Noise Ratio (HNR)
                                                                                                                1. Spectral Tilt
                                                                                                                  1. Cepstral Peak Prominence
                                                                                                                  2. Feature Normalization
                                                                                                                    1. Cepstral Mean Normalization (CMN)
                                                                                                                      1. Cepstral Variance Normalization (CVN)
                                                                                                                        1. Feature Warping
                                                                                                                          1. Histogram Equalization