Useful Links
Statistics
Machine Learning
1. Introduction to Machine Learning
2. Mathematical and Statistical Foundations
3. Data Preprocessing and Feature Engineering
4. Supervised Learning
5. Unsupervised Learning
6. Model Evaluation and Validation
7. Ensemble Methods and Advanced Techniques
8. Deep Learning and Neural Networks
9. Reinforcement Learning
10. Advanced Topics and Specialized Areas
11. Machine Learning Operations and Deployment
Data Preprocessing and Feature Engineering
Understanding Data
Data Types
Numerical Data
Categorical Data
Ordinal Data
Text Data
Time Series Data
Data Quality Assessment
Completeness
Accuracy
Consistency
Timeliness
Exploratory Data Analysis
Descriptive Statistics
Data Visualization
Pattern Recognition
Outlier Identification
Data Collection and Storage
Data Sources
Databases
APIs
Web Scraping
Sensors and IoT
Data Formats
CSV
JSON
XML
Parquet
HDF5
Data Storage Systems
Relational Databases
NoSQL Databases
Data Warehouses
Data Lakes
Data Cleaning
Handling Missing Values
Types of Missing Data
Missing Data Mechanisms
Deletion Methods
Imputation Techniques
Mean Imputation
Median Imputation
Mode Imputation
Forward Fill
Backward Fill
Interpolation
Model-Based Imputation
Correcting Inconsistent Data
Data Type Conversion
Format Standardization
Unit Conversion
Encoding Issues
Outlier Detection and Treatment
Statistical Methods
Z-Score Method
IQR Method
Modified Z-Score
Visualization Techniques
Box Plots
Scatter Plots
Histograms
Machine Learning Methods
Isolation Forest
Local Outlier Factor
Outlier Treatment
Removal
Transformation
Capping
Dealing with Duplicates
Exact Duplicates
Near Duplicates
Deduplication Strategies
Feature Scaling and Normalization
Standardization
Z-Score Normalization
Robust Standardization
Normalization
Min-Max Scaling
Max Scaling
Unit Vector Scaling
Other Transformations
Log Transformation
Square Root Transformation
Box-Cox Transformation
Yeo-Johnson Transformation
When to Apply Scaling
Algorithm Requirements
Feature Magnitude Differences
Feature Engineering
Feature Creation
Polynomial Features
Interaction Features
Ratio Features
Aggregation Features
Time-Based Features
Domain-Specific Features
Encoding Categorical Variables
One-Hot Encoding
Label Encoding
Ordinal Encoding
Binary Encoding
Target Encoding
Frequency Encoding
Hash Encoding
Binning and Discretization
Equal-Width Binning
Equal-Frequency Binning
K-Means Binning
Custom Binning
Feature Extraction
Text Feature Extraction
Bag of Words
TF-IDF
N-grams
Image Feature Extraction
Pixel Values
Color Histograms
Texture Features
Time Series Feature Extraction
Statistical Features
Frequency Domain Features
Feature Selection
Filter Methods
Univariate Statistical Tests
Correlation Coefficient
Chi-Squared Test
Mutual Information
Variance Threshold
Wrapper Methods
Forward Selection
Backward Elimination
Recursive Feature Elimination
Exhaustive Search
Embedded Methods
Lasso Regression
Ridge Regression
Elastic Net
Tree-Based Feature Importance
Regularization Paths
Dimensionality Reduction
Principal Component Analysis
Linear Discriminant Analysis
Independent Component Analysis
Factor Analysis
Previous
2. Mathematical and Statistical Foundations
Go to top
Next
4. Supervised Learning