Feature Engineering for Machine Learning

  1. Exploratory Data Analysis for Features
    1. Univariate Analysis
      1. Descriptive Statistics
        1. Central Tendency Measures
          1. Mean
            1. Median
              1. Mode
                1. Trimmed Mean
                2. Dispersion Measures
                  1. Variance
                    1. Standard Deviation
                      1. Range
                        1. Interquartile Range
                          1. Mean Absolute Deviation
                          2. Distribution Shape
                            1. Skewness
                              1. Kurtosis
                                1. Modality
                              2. Distribution Visualization
                                1. Histograms
                                  1. Density Plots
                                    1. Box Plots
                                      1. Violin Plots
                                        1. Q-Q Plots
                                          1. Count Plots
                                          2. Data Quality Assessment
                                            1. Missing Value Patterns
                                              1. Outlier Detection
                                                1. Data Type Consistency
                                                  1. Value Range Validation
                                                2. Bivariate Analysis
                                                  1. Relationship Identification
                                                    1. Scatter Plots
                                                      1. Line Plots
                                                        1. Heatmaps
                                                          1. Joint Plots
                                                          2. Correlation Analysis
                                                            1. Pearson Correlation
                                                              1. Spearman Correlation
                                                                1. Kendall Tau
                                                                  1. Point-Biserial Correlation
                                                                  2. Categorical Relationships
                                                                    1. Contingency Tables
                                                                      1. Chi-Square Tests
                                                                        1. Cramér's V
                                                                          1. Phi Coefficient
                                                                        2. Multivariate Analysis
                                                                          1. Multiple Feature Relationships
                                                                            1. Pair Plots
                                                                              1. Correlation Matrices
                                                                                1. Covariance Analysis
                                                                                2. Target Variable Analysis
                                                                                  1. Feature-Target Relationships
                                                                                    1. Grouped Statistics
                                                                                      1. Stratified Analysis
                                                                                        1. Class Distribution Analysis
                                                                                        2. Multicollinearity Detection
                                                                                          1. Variance Inflation Factor
                                                                                            1. Condition Index
                                                                                              1. Eigenvalue Analysis