Useful Links
Computer Science
Artificial Intelligence
Machine Learning
Machine Learning with Scikit-Learn
1. Introduction to Scikit-Learn
2. Core Scikit-Learn Concepts and API
3. Machine Learning Fundamentals
4. Data Preprocessing and Feature Engineering
5. Supervised Learning: Regression
6. Supervised Learning: Classification
7. Model Evaluation and Metrics
8. Improving Model Performance
9. Unsupervised Learning
10. Building Machine Learning Pipelines
11. Working with Text Data
12. Advanced Topics
13. Model Persistence and Deployment
14. Performance Optimization
15. Best Practices and Common Pitfalls
Model Evaluation and Metrics
Data Splitting Strategies
Train-Test Split
Purpose and Importance
Split Ratios
Random State
Stratified Splitting
Preserving Class Distribution
Use in Classification
Time Series Splitting
Temporal Data Considerations
TimeSeriesSplit
Group-based Splitting
GroupKFold
LeaveOneGroupOut
Cross-Validation
Purpose and Benefits
K-Fold Cross-Validation
Fold Selection
Computational Considerations
Stratified K-Fold
Class Balance Preservation
Leave-One-Out Cross-Validation
Extreme Case of K-Fold
Computational Cost
Leave-P-Out Cross-Validation
Shuffle Split
Random Sampling
Cross-Validation Utilities
cross_val_score
cross_validate
cross_val_predict
Classification Metrics
Accuracy
Definition and Calculation
Limitations with Imbalanced Data
Precision
True Positive Rate
Class-specific Precision
Recall
Sensitivity
Class-specific Recall
F1-Score
Harmonic Mean
Macro and Micro Averaging
F-beta Score
Weighted Harmonic Mean
Confusion Matrix
True Positives
False Positives
True Negatives
False Negatives
Visualization
Classification Report
Comprehensive Metrics
ROC Curve
True Positive Rate vs False Positive Rate
Threshold Selection
Multi-class Extension
AUC Score
Area Under the ROC Curve
Interpretation
Precision-Recall Curve
Imbalanced Dataset Evaluation
Average Precision Score
Log Loss
Probabilistic Predictions
Matthews Correlation Coefficient
Balanced Measure
Cohen's Kappa
Inter-rater Agreement
Hamming Loss
Multi-label Classification
Jaccard Score
Set Similarity
Regression Metrics
Mean Absolute Error
L1 Loss
Robustness to Outliers
Mean Squared Error
L2 Loss
Sensitivity to Outliers
Root Mean Squared Error
Interpretability
Mean Absolute Percentage Error
Relative Error
R-squared
Coefficient of Determination
Explained Variance
Adjusted R-squared
Penalty for Additional Features
Mean Squared Log Error
Relative Errors
Median Absolute Error
Robust Metric
Explained Variance Score
Custom Scoring Functions
Creating Custom Metrics
make_scorer Function
Scorer Objects
Previous
6. Supervised Learning: Classification
Go to top
Next
8. Improving Model Performance