Useful Links
Computer Science
Data Science
Data Science
1. Foundations of Data Science
2. Mathematical and Statistical Foundations
3. Computational Foundations and Tools
4. Data Acquisition and Management
5. Exploratory Data Analysis
6. Feature Engineering and Selection
7. Machine Learning Fundamentals
8. Advanced Machine Learning Topics
9. Big Data and Distributed Computing
10. Data Visualization and Communication
11. Model Deployment and MLOps
12. Ethics and Responsible AI
Machine Learning Fundamentals
Core Concepts
What is Machine Learning
Definition and Scope
AI vs Machine Learning vs Deep Learning
Applications and Use Cases
Types of Machine Learning
Supervised Learning
Classification
Regression
Unsupervised Learning
Clustering
Dimensionality Reduction
Association Rules
Semi-supervised Learning
Reinforcement Learning
Online Learning
Transfer Learning
The Machine Learning Workflow
Problem Definition
Data Collection
Data Preprocessing
Feature Engineering
Model Selection
Training
Evaluation
Deployment
Monitoring
Training and Testing
Training Set
Validation Set
Test Set
Cross-validation
Hold-out Validation
Overfitting and Underfitting
Definitions
Causes
Detection Methods
Prevention Strategies
Bias-Variance Tradeoff
Bias Definition
Variance Definition
Tradeoff Implications
Model Complexity Effects
No Free Lunch Theorem
Curse of Dimensionality
Supervised Learning - Regression
Linear Regression
Simple Linear Regression
Mathematical Foundation
Least Squares Method
Assumptions
Interpretation
Multiple Linear Regression
Matrix Formulation
Parameter Estimation
Statistical Inference
Model Diagnostics
Polynomial Regression
Polynomial Features
Degree Selection
Overfitting Concerns
Regularized Regression
Ridge Regression
L2 Regularization
Hyperparameter Tuning
Geometric Interpretation
Lasso Regression
L1 Regularization
Feature Selection Properties
Coordinate Descent
Elastic Net
Combined L1 and L2
Parameter Selection
Non-linear Regression
Support Vector Regression
Kernel Trick
Hyperparameter Tuning
Advantages and Disadvantages
Decision Tree Regression
Tree Construction
Splitting Criteria
Pruning Techniques
Ensemble Methods
Random Forest Regression
Gradient Boosting Regression
XGBoost
LightGBM
Regression Evaluation
Mean Absolute Error
Mean Squared Error
Root Mean Squared Error
R-squared
Adjusted R-squared
Mean Absolute Percentage Error
Supervised Learning - Classification
Linear Classification
Logistic Regression
Sigmoid Function
Maximum Likelihood Estimation
Multiclass Extensions
Regularization
Linear Discriminant Analysis
Assumptions
Decision Boundaries
Quadratic Discriminant Analysis
Instance-based Learning
k-Nearest Neighbors
Distance Metrics
Choosing k
Weighted Voting
Curse of Dimensionality
Probabilistic Classifiers
Naive Bayes
Bayes' Theorem Foundation
Independence Assumption
Gaussian Naive Bayes
Multinomial Naive Bayes
Bernoulli Naive Bayes
Support Vector Machines
Linear SVM
Maximum Margin Principle
Support Vectors
Soft Margin
Non-linear SVM
Kernel Functions
RBF Kernel
Polynomial Kernel
Hyperparameter Tuning
Tree-based Methods
Decision Trees
Splitting Criteria
Information Gain
Gini Impurity
Pruning
Ensemble Methods
Random Forest
Gradient Boosting
AdaBoost
XGBoost
LightGBM
Neural Networks
Perceptron
Multi-layer Perceptron
Backpropagation
Activation Functions
Classification Evaluation
Accuracy
Precision
Recall
F1-Score
Confusion Matrix
ROC Curve
AUC Score
Precision-Recall Curve
Classification Report
Unsupervised Learning
Clustering
k-Means Clustering
Algorithm Steps
Initialization Methods
Choosing k
Limitations
Hierarchical Clustering
Agglomerative Clustering
Divisive Clustering
Linkage Criteria
Dendrograms
Density-based Clustering
DBSCAN
OPTICS
Mean Shift
Model-based Clustering
Gaussian Mixture Models
Expectation-Maximization Algorithm
Clustering Evaluation
Silhouette Score
Davies-Bouldin Index
Calinski-Harabasz Index
Adjusted Rand Index
Dimensionality Reduction
Principal Component Analysis
Mathematical Foundation
Eigenvalue Decomposition
Explained Variance
Component Interpretation
Factor Analysis
Latent Variables
Factor Loadings
Rotation Methods
Independent Component Analysis
Non-Gaussian Assumption
Non-linear Methods
t-SNE
UMAP
Isomap
Locally Linear Embedding
Association Rule Mining
Market Basket Analysis
Apriori Algorithm
FP-Growth Algorithm
Support and Confidence
Lift and Conviction
Model Selection and Evaluation
Cross-validation Techniques
k-Fold Cross-validation
Stratified k-Fold
Leave-One-Out Cross-validation
Time Series Cross-validation
Nested Cross-validation
Hyperparameter Optimization
Grid Search
Random Search
Bayesian Optimization
Hyperband
Population-based Training
Model Comparison
Statistical Significance Testing
McNemar's Test
Paired t-test
Wilcoxon Signed-rank Test
Learning Curves
Training vs Validation Error
Sample Size Effects
Diagnosing Bias and Variance
Feature Importance
Permutation Importance
Tree-based Importance
Coefficient Analysis
SHAP Values
Previous
6. Feature Engineering and Selection
Go to top
Next
8. Advanced Machine Learning Topics