Useful Links
Computer Science
Artificial Intelligence
Machine Learning
Machine Learning with Scikit-Learn
1. Introduction to Scikit-Learn
2. Core Scikit-Learn Concepts and API
3. Machine Learning Fundamentals
4. Data Preprocessing and Feature Engineering
5. Supervised Learning: Regression
6. Supervised Learning: Classification
7. Model Evaluation and Metrics
8. Improving Model Performance
9. Unsupervised Learning
10. Building Machine Learning Pipelines
11. Working with Text Data
12. Advanced Topics
13. Model Persistence and Deployment
14. Performance Optimization
15. Best Practices and Common Pitfalls
Unsupervised Learning
Clustering
Clustering Concepts
Similarity and Distance Measures
Cluster Validation
Choosing Number of Clusters
K-Means Clustering
Algorithm Steps
Centroid Initialization
Convergence Criteria
Limitations and Assumptions
Mini-Batch K-Means
Hierarchical Clustering
Agglomerative Clustering
Linkage Criteria
Single Linkage
Complete Linkage
Average Linkage
Ward Linkage
Dendrogram Interpretation
Density-Based Clustering
DBSCAN
Core Points
Border Points
Noise Points
Parameter Selection
OPTICS
Ordering Points
Gaussian Mixture Models
Probabilistic Clustering
Expectation-Maximization
Model Selection
Spectral Clustering
Graph-based Clustering
Affinity Matrices
Mean Shift Clustering
Mode Seeking
Bandwidth Selection
Clustering Evaluation
Silhouette Score
Calinski-Harabasz Index
Davies-Bouldin Index
Adjusted Rand Index
Dimensionality Reduction
Curse of Dimensionality
Linear Methods
Principal Component Analysis
Eigenvalue Decomposition
Variance Explained
Component Interpretation
Choosing Number of Components
Truncated SVD
Sparse Data Handling
Latent Semantic Analysis
Independent Component Analysis
Signal Separation
Factor Analysis
Latent Variables
Non-linear Methods
Manifold Learning
Locally Linear Embedding
Isomap
Geodesic Distances
Multi-dimensional Scaling
t-SNE
Perplexity Parameter
Learning Rate
Visualization Applications
UMAP
Feature Selection vs Feature Extraction
Dimensionality Reduction Evaluation
Anomaly Detection
Outlier vs Anomaly
Statistical Methods
Z-score Method
Modified Z-score
Isolation Forest
Random Partitioning
Anomaly Score
Local Outlier Factor
Density-based Detection
Local Density Comparison
One-Class SVM
Support Vector Description
Kernel Methods
Elliptic Envelope
Gaussian Distribution Assumption
Evaluation of Anomaly Detection
Precision and Recall
ROC Analysis
Previous
8. Improving Model Performance
Go to top
Next
10. Building Machine Learning Pipelines