Data Science

  1. Feature Engineering and Selection
    1. Feature Creation
      1. Domain-Specific Features
        1. Business Logic Features
          1. Industry-Specific Metrics
            1. Expert Knowledge Integration
            2. Mathematical Transformations
              1. Polynomial Features
                1. Logarithmic Transformations
                  1. Square Root Transformations
                    1. Reciprocal Transformations
                    2. Interaction Features
                      1. Two-way Interactions
                        1. Higher-order Interactions
                          1. Cross Products
                          2. Aggregation Features
                            1. Statistical Aggregations
                              1. Time-based Aggregations
                                1. Group-based Aggregations
                                2. Date and Time Features
                                  1. Temporal Decomposition
                                    1. Year
                                      1. Month
                                        1. Day of Week
                                          1. Hour
                                            1. Season
                                            2. Time Differences
                                              1. Lag Features
                                                1. Rolling Window Statistics
                                                2. Text Features
                                                  1. Bag of Words
                                                    1. TF-IDF
                                                      1. N-grams
                                                        1. Word Embeddings
                                                          1. Sentiment Scores
                                                            1. Text Statistics
                                                            2. Geospatial Features
                                                              1. Distance Calculations
                                                                1. Coordinate Transformations
                                                                  1. Spatial Clustering
                                                                    1. Geographic Aggregations
                                                                  2. Feature Transformation
                                                                    1. Scaling Techniques
                                                                      1. Min-Max Scaling
                                                                        1. Standardization
                                                                          1. Robust Scaling
                                                                            1. Unit Vector Scaling
                                                                            2. Distribution Transformations
                                                                              1. Log Transformation
                                                                                1. Box-Cox Transformation
                                                                                  1. Yeo-Johnson Transformation
                                                                                    1. Quantile Transformation
                                                                                    2. Discretization
                                                                                      1. Equal-width Binning
                                                                                        1. Equal-frequency Binning
                                                                                          1. Custom Binning
                                                                                            1. Optimal Binning
                                                                                          2. Encoding Categorical Variables
                                                                                            1. Nominal Encoding
                                                                                              1. One-Hot Encoding
                                                                                                1. Binary Encoding
                                                                                                  1. Hash Encoding
                                                                                                  2. Ordinal Encoding
                                                                                                    1. Label Encoding
                                                                                                      1. Custom Ordinal Mapping
                                                                                                      2. Target-based Encoding
                                                                                                        1. Target Encoding
                                                                                                          1. Leave-One-Out Encoding
                                                                                                            1. Weight of Evidence
                                                                                                            2. High Cardinality Handling
                                                                                                              1. Frequency Encoding
                                                                                                                1. Rare Category Grouping
                                                                                                                  1. Embedding Techniques
                                                                                                                2. Feature Selection Methods
                                                                                                                  1. Filter Methods
                                                                                                                    1. Univariate Statistical Tests
                                                                                                                      1. Chi-square Test
                                                                                                                        1. ANOVA F-test
                                                                                                                          1. Mutual Information
                                                                                                                          2. Correlation-based Selection
                                                                                                                            1. Pearson Correlation
                                                                                                                              1. Spearman Correlation
                                                                                                                                1. Kendall's Tau
                                                                                                                                2. Variance-based Selection
                                                                                                                                  1. Low Variance Filter
                                                                                                                                    1. Quasi-constant Features
                                                                                                                                  2. Wrapper Methods
                                                                                                                                    1. Forward Selection
                                                                                                                                      1. Backward Elimination
                                                                                                                                        1. Recursive Feature Elimination
                                                                                                                                          1. Genetic Algorithms
                                                                                                                                          2. Embedded Methods
                                                                                                                                            1. L1 Regularization
                                                                                                                                              1. Tree-based Feature Importance
                                                                                                                                                1. Elastic Net
                                                                                                                                                2. Hybrid Methods
                                                                                                                                                  1. Sequential Feature Selection
                                                                                                                                                    1. Stability Selection
                                                                                                                                                  2. Feature Validation
                                                                                                                                                    1. Feature Importance Analysis
                                                                                                                                                      1. Permutation Importance
                                                                                                                                                        1. SHAP Values
                                                                                                                                                          1. Feature Ablation Studies
                                                                                                                                                          2. Feature Stability
                                                                                                                                                            1. Cross-validation Consistency
                                                                                                                                                              1. Bootstrap Stability
                                                                                                                                                              2. Multicollinearity Detection
                                                                                                                                                                1. Variance Inflation Factor
                                                                                                                                                                  1. Condition Index
                                                                                                                                                                    1. Correlation Matrix Analysis
                                                                                                                                                                  2. Automated Feature Engineering
                                                                                                                                                                    1. Feature Tools
                                                                                                                                                                      1. AutoML Feature Generation
                                                                                                                                                                        1. Deep Feature Synthesis