Data Science

  1. Machine Learning Fundamentals
    1. Core Concepts
      1. What is Machine Learning
        1. Definition and Scope
          1. AI vs Machine Learning vs Deep Learning
            1. Applications and Use Cases
            2. Types of Machine Learning
              1. Supervised Learning
                1. Classification
                  1. Regression
                  2. Unsupervised Learning
                    1. Clustering
                      1. Dimensionality Reduction
                        1. Association Rules
                        2. Semi-supervised Learning
                          1. Reinforcement Learning
                            1. Online Learning
                              1. Transfer Learning
                              2. The Machine Learning Workflow
                                1. Problem Definition
                                  1. Data Collection
                                    1. Data Preprocessing
                                      1. Feature Engineering
                                        1. Model Selection
                                          1. Training
                                            1. Evaluation
                                              1. Deployment
                                                1. Monitoring
                                                2. Training and Testing
                                                  1. Training Set
                                                    1. Validation Set
                                                      1. Test Set
                                                        1. Cross-validation
                                                          1. Hold-out Validation
                                                          2. Overfitting and Underfitting
                                                            1. Definitions
                                                              1. Causes
                                                                1. Detection Methods
                                                                  1. Prevention Strategies
                                                                  2. Bias-Variance Tradeoff
                                                                    1. Bias Definition
                                                                      1. Variance Definition
                                                                        1. Tradeoff Implications
                                                                          1. Model Complexity Effects
                                                                          2. No Free Lunch Theorem
                                                                            1. Curse of Dimensionality
                                                                            2. Supervised Learning - Regression
                                                                              1. Linear Regression
                                                                                1. Simple Linear Regression
                                                                                  1. Mathematical Foundation
                                                                                    1. Least Squares Method
                                                                                      1. Assumptions
                                                                                        1. Interpretation
                                                                                        2. Multiple Linear Regression
                                                                                          1. Matrix Formulation
                                                                                            1. Parameter Estimation
                                                                                              1. Statistical Inference
                                                                                                1. Model Diagnostics
                                                                                                2. Polynomial Regression
                                                                                                  1. Polynomial Features
                                                                                                    1. Degree Selection
                                                                                                      1. Overfitting Concerns
                                                                                                    2. Regularized Regression
                                                                                                      1. Ridge Regression
                                                                                                        1. L2 Regularization
                                                                                                          1. Hyperparameter Tuning
                                                                                                            1. Geometric Interpretation
                                                                                                            2. Lasso Regression
                                                                                                              1. L1 Regularization
                                                                                                                1. Feature Selection Properties
                                                                                                                  1. Coordinate Descent
                                                                                                                  2. Elastic Net
                                                                                                                    1. Combined L1 and L2
                                                                                                                      1. Parameter Selection
                                                                                                                    2. Non-linear Regression
                                                                                                                      1. Support Vector Regression
                                                                                                                        1. Kernel Trick
                                                                                                                          1. Hyperparameter Tuning
                                                                                                                            1. Advantages and Disadvantages
                                                                                                                            2. Decision Tree Regression
                                                                                                                              1. Tree Construction
                                                                                                                                1. Splitting Criteria
                                                                                                                                  1. Pruning Techniques
                                                                                                                                  2. Ensemble Methods
                                                                                                                                    1. Random Forest Regression
                                                                                                                                      1. Gradient Boosting Regression
                                                                                                                                        1. XGBoost
                                                                                                                                          1. LightGBM
                                                                                                                                        2. Regression Evaluation
                                                                                                                                          1. Mean Absolute Error
                                                                                                                                            1. Mean Squared Error
                                                                                                                                              1. Root Mean Squared Error
                                                                                                                                                1. R-squared
                                                                                                                                                  1. Adjusted R-squared
                                                                                                                                                    1. Mean Absolute Percentage Error
                                                                                                                                                  2. Supervised Learning - Classification
                                                                                                                                                    1. Linear Classification
                                                                                                                                                      1. Logistic Regression
                                                                                                                                                        1. Sigmoid Function
                                                                                                                                                          1. Maximum Likelihood Estimation
                                                                                                                                                            1. Multiclass Extensions
                                                                                                                                                              1. Regularization
                                                                                                                                                              2. Linear Discriminant Analysis
                                                                                                                                                                1. Assumptions
                                                                                                                                                                  1. Decision Boundaries
                                                                                                                                                                    1. Quadratic Discriminant Analysis
                                                                                                                                                                  2. Instance-based Learning
                                                                                                                                                                    1. k-Nearest Neighbors
                                                                                                                                                                      1. Distance Metrics
                                                                                                                                                                        1. Choosing k
                                                                                                                                                                          1. Weighted Voting
                                                                                                                                                                            1. Curse of Dimensionality
                                                                                                                                                                          2. Probabilistic Classifiers
                                                                                                                                                                            1. Naive Bayes
                                                                                                                                                                              1. Bayes' Theorem Foundation
                                                                                                                                                                                1. Independence Assumption
                                                                                                                                                                                  1. Gaussian Naive Bayes
                                                                                                                                                                                    1. Multinomial Naive Bayes
                                                                                                                                                                                      1. Bernoulli Naive Bayes
                                                                                                                                                                                    2. Support Vector Machines
                                                                                                                                                                                      1. Linear SVM
                                                                                                                                                                                        1. Maximum Margin Principle
                                                                                                                                                                                          1. Support Vectors
                                                                                                                                                                                            1. Soft Margin
                                                                                                                                                                                            2. Non-linear SVM
                                                                                                                                                                                              1. Kernel Functions
                                                                                                                                                                                                1. RBF Kernel
                                                                                                                                                                                                  1. Polynomial Kernel
                                                                                                                                                                                                  2. Hyperparameter Tuning
                                                                                                                                                                                                  3. Tree-based Methods
                                                                                                                                                                                                    1. Decision Trees
                                                                                                                                                                                                      1. Splitting Criteria
                                                                                                                                                                                                        1. Information Gain
                                                                                                                                                                                                          1. Gini Impurity
                                                                                                                                                                                                            1. Pruning
                                                                                                                                                                                                            2. Ensemble Methods
                                                                                                                                                                                                              1. Random Forest
                                                                                                                                                                                                                1. Gradient Boosting
                                                                                                                                                                                                                  1. AdaBoost
                                                                                                                                                                                                                    1. XGBoost
                                                                                                                                                                                                                      1. LightGBM
                                                                                                                                                                                                                    2. Neural Networks
                                                                                                                                                                                                                      1. Perceptron
                                                                                                                                                                                                                        1. Multi-layer Perceptron
                                                                                                                                                                                                                          1. Backpropagation
                                                                                                                                                                                                                            1. Activation Functions
                                                                                                                                                                                                                            2. Classification Evaluation
                                                                                                                                                                                                                              1. Accuracy
                                                                                                                                                                                                                                1. Precision
                                                                                                                                                                                                                                  1. Recall
                                                                                                                                                                                                                                    1. F1-Score
                                                                                                                                                                                                                                      1. Confusion Matrix
                                                                                                                                                                                                                                        1. ROC Curve
                                                                                                                                                                                                                                          1. AUC Score
                                                                                                                                                                                                                                            1. Precision-Recall Curve
                                                                                                                                                                                                                                              1. Classification Report
                                                                                                                                                                                                                                            2. Unsupervised Learning
                                                                                                                                                                                                                                              1. Clustering
                                                                                                                                                                                                                                                1. k-Means Clustering
                                                                                                                                                                                                                                                  1. Algorithm Steps
                                                                                                                                                                                                                                                    1. Initialization Methods
                                                                                                                                                                                                                                                      1. Choosing k
                                                                                                                                                                                                                                                        1. Limitations
                                                                                                                                                                                                                                                        2. Hierarchical Clustering
                                                                                                                                                                                                                                                          1. Agglomerative Clustering
                                                                                                                                                                                                                                                            1. Divisive Clustering
                                                                                                                                                                                                                                                              1. Linkage Criteria
                                                                                                                                                                                                                                                                1. Dendrograms
                                                                                                                                                                                                                                                                2. Density-based Clustering
                                                                                                                                                                                                                                                                  1. DBSCAN
                                                                                                                                                                                                                                                                    1. OPTICS
                                                                                                                                                                                                                                                                      1. Mean Shift
                                                                                                                                                                                                                                                                      2. Model-based Clustering
                                                                                                                                                                                                                                                                        1. Gaussian Mixture Models
                                                                                                                                                                                                                                                                          1. Expectation-Maximization Algorithm
                                                                                                                                                                                                                                                                          2. Clustering Evaluation
                                                                                                                                                                                                                                                                            1. Silhouette Score
                                                                                                                                                                                                                                                                              1. Davies-Bouldin Index
                                                                                                                                                                                                                                                                                1. Calinski-Harabasz Index
                                                                                                                                                                                                                                                                                  1. Adjusted Rand Index
                                                                                                                                                                                                                                                                                2. Dimensionality Reduction
                                                                                                                                                                                                                                                                                  1. Principal Component Analysis
                                                                                                                                                                                                                                                                                    1. Mathematical Foundation
                                                                                                                                                                                                                                                                                      1. Eigenvalue Decomposition
                                                                                                                                                                                                                                                                                        1. Explained Variance
                                                                                                                                                                                                                                                                                          1. Component Interpretation
                                                                                                                                                                                                                                                                                          2. Factor Analysis
                                                                                                                                                                                                                                                                                            1. Latent Variables
                                                                                                                                                                                                                                                                                              1. Factor Loadings
                                                                                                                                                                                                                                                                                                1. Rotation Methods
                                                                                                                                                                                                                                                                                                2. Independent Component Analysis
                                                                                                                                                                                                                                                                                                  1. Non-Gaussian Assumption
                                                                                                                                                                                                                                                                                                  2. Non-linear Methods
                                                                                                                                                                                                                                                                                                    1. t-SNE
                                                                                                                                                                                                                                                                                                      1. UMAP
                                                                                                                                                                                                                                                                                                        1. Isomap
                                                                                                                                                                                                                                                                                                          1. Locally Linear Embedding
                                                                                                                                                                                                                                                                                                        2. Association Rule Mining
                                                                                                                                                                                                                                                                                                          1. Market Basket Analysis
                                                                                                                                                                                                                                                                                                            1. Apriori Algorithm
                                                                                                                                                                                                                                                                                                              1. FP-Growth Algorithm
                                                                                                                                                                                                                                                                                                                1. Support and Confidence
                                                                                                                                                                                                                                                                                                                  1. Lift and Conviction
                                                                                                                                                                                                                                                                                                                2. Model Selection and Evaluation
                                                                                                                                                                                                                                                                                                                  1. Cross-validation Techniques
                                                                                                                                                                                                                                                                                                                    1. k-Fold Cross-validation
                                                                                                                                                                                                                                                                                                                      1. Stratified k-Fold
                                                                                                                                                                                                                                                                                                                        1. Leave-One-Out Cross-validation
                                                                                                                                                                                                                                                                                                                          1. Time Series Cross-validation
                                                                                                                                                                                                                                                                                                                            1. Nested Cross-validation
                                                                                                                                                                                                                                                                                                                            2. Hyperparameter Optimization
                                                                                                                                                                                                                                                                                                                              1. Grid Search
                                                                                                                                                                                                                                                                                                                                1. Random Search
                                                                                                                                                                                                                                                                                                                                  1. Bayesian Optimization
                                                                                                                                                                                                                                                                                                                                    1. Hyperband
                                                                                                                                                                                                                                                                                                                                      1. Population-based Training
                                                                                                                                                                                                                                                                                                                                      2. Model Comparison
                                                                                                                                                                                                                                                                                                                                        1. Statistical Significance Testing
                                                                                                                                                                                                                                                                                                                                          1. McNemar's Test
                                                                                                                                                                                                                                                                                                                                            1. Paired t-test
                                                                                                                                                                                                                                                                                                                                              1. Wilcoxon Signed-rank Test
                                                                                                                                                                                                                                                                                                                                              2. Learning Curves
                                                                                                                                                                                                                                                                                                                                                1. Training vs Validation Error
                                                                                                                                                                                                                                                                                                                                                  1. Sample Size Effects
                                                                                                                                                                                                                                                                                                                                                    1. Diagnosing Bias and Variance
                                                                                                                                                                                                                                                                                                                                                    2. Feature Importance
                                                                                                                                                                                                                                                                                                                                                      1. Permutation Importance
                                                                                                                                                                                                                                                                                                                                                        1. Tree-based Importance
                                                                                                                                                                                                                                                                                                                                                          1. Coefficient Analysis
                                                                                                                                                                                                                                                                                                                                                            1. SHAP Values