Machine Learning with Python

  1. Machine Learning Workflow with Scikit-Learn
    1. Scikit-Learn Overview
      1. Library Architecture
        1. Estimator Interface
          1. Transformer Interface
            1. Predictor Interface
            2. API Consistency
              1. Fit Method
                1. Transform Method
                  1. Predict Method
                  2. Data Representation
                    1. Feature Matrices
                      1. Target Vectors
                        1. Sparse Matrices
                      2. Data Preprocessing
                        1. Handling Missing Data
                          1. Missing Data Detection
                            1. Imputation Strategies
                              1. Simple Imputation
                                1. Iterative Imputation
                                  1. KNN Imputation
                                2. Categorical Data Encoding
                                  1. One-Hot Encoding
                                    1. Label Encoding
                                      1. Ordinal Encoding
                                        1. Target Encoding
                                        2. Feature Scaling
                                          1. Standardization
                                            1. StandardScaler
                                              1. RobustScaler
                                              2. Normalization
                                                1. MinMaxScaler
                                                  1. MaxAbsScaler
                                                  2. Unit Vector Scaling
                                                  3. Data Splitting
                                                    1. Train-Test Split
                                                      1. Stratified Splitting
                                                        1. Time Series Splitting
                                                          1. Cross-Validation Splits
                                                        2. Feature Engineering
                                                          1. Feature Creation
                                                            1. Polynomial Features
                                                              1. Interaction Features
                                                                1. Mathematical Transformations
                                                                  1. Domain-Specific Features
                                                                  2. Feature Selection
                                                                    1. Filter Methods
                                                                      1. Univariate Selection
                                                                        1. Correlation-based Selection
                                                                          1. Variance Threshold
                                                                          2. Wrapper Methods
                                                                            1. Recursive Feature Elimination
                                                                              1. Sequential Feature Selection
                                                                              2. Embedded Methods
                                                                                1. L1 Regularization
                                                                                  1. Tree-based Importance
                                                                                2. Dimensionality Reduction
                                                                                  1. Linear Methods
                                                                                    1. Principal Component Analysis
                                                                                      1. Linear Discriminant Analysis
                                                                                        1. Factor Analysis
                                                                                        2. Non-linear Methods
                                                                                          1. Kernel PCA
                                                                                            1. Manifold Learning
                                                                                        3. Model Training and Evaluation
                                                                                          1. Model Training Process
                                                                                            1. Fitting Models
                                                                                              1. Parameter Learning
                                                                                                1. Convergence Monitoring
                                                                                                2. Model Evaluation Metrics
                                                                                                  1. Classification Metrics
                                                                                                    1. Accuracy
                                                                                                      1. Precision
                                                                                                        1. Recall
                                                                                                          1. F1-Score
                                                                                                            1. ROC-AUC
                                                                                                              1. Precision-Recall AUC
                                                                                                                1. Confusion Matrix
                                                                                                                  1. Classification Report
                                                                                                                  2. Regression Metrics
                                                                                                                    1. Mean Absolute Error
                                                                                                                      1. Mean Squared Error
                                                                                                                        1. Root Mean Squared Error
                                                                                                                          1. R-squared
                                                                                                                            1. Adjusted R-squared
                                                                                                                              1. Mean Absolute Percentage Error
                                                                                                                            2. Model Validation
                                                                                                                              1. Holdout Validation
                                                                                                                                1. Cross-Validation
                                                                                                                                  1. K-Fold Cross-Validation
                                                                                                                                    1. Stratified K-Fold
                                                                                                                                      1. Leave-One-Out
                                                                                                                                        1. Time Series Cross-Validation
                                                                                                                                        2. Bootstrap Validation
                                                                                                                                      2. Model Optimization
                                                                                                                                        1. Hyperparameter Tuning
                                                                                                                                          1. Grid Search
                                                                                                                                            1. Exhaustive Search
                                                                                                                                              1. Parameter Grids
                                                                                                                                              2. Random Search
                                                                                                                                                1. Random Sampling
                                                                                                                                                  1. Search Distributions
                                                                                                                                                  2. Bayesian Optimization
                                                                                                                                                    1. Gaussian Processes
                                                                                                                                                      1. Acquisition Functions
                                                                                                                                                    2. Model Selection
                                                                                                                                                      1. Cross-Validation Scoring
                                                                                                                                                        1. Model Comparison
                                                                                                                                                          1. Statistical Testing
                                                                                                                                                          2. Pipeline Construction
                                                                                                                                                            1. Creating Pipelines
                                                                                                                                                              1. Pipeline Components
                                                                                                                                                                1. Nested Pipelines
                                                                                                                                                                  1. Pipeline Optimization