Useful Links
Computer Science
Data Science
Data Mining and Knowledge Discovery
1. Introduction to Data Mining and Knowledge Discovery
2. Data Types and Sources
3. Data Preprocessing Fundamentals
4. Classification Methods
5. Regression Analysis
6. Clustering Analysis
7. Association Rule Mining
8. Advanced Mining Techniques
9. Model Evaluation and Validation
10. Model Interpretation and Explainability
11. Deployment and Production Systems
12. Ethics, Privacy, and Security
Data Preprocessing Fundamentals
Understanding Data Quality
Accuracy Assessment
Completeness Evaluation
Consistency Checking
Timeliness Considerations
Believability Factors
Data Profiling Techniques
Data Cleaning Processes
Missing Value Handling
Missing Data Patterns
Deletion Methods
Imputation Techniques
Mean and Median Substitution
Forward and Backward Fill
Multiple Imputation
Noise Reduction
Noise Identification Methods
Smoothing Techniques
Binning Approaches
Regression-Based Smoothing
Clustering for Outlier Detection
Inconsistency Resolution
Data Validation Rules
Constraint Checking
Reference Data Validation
Cross-Field Validation
Data Integration Techniques
Multi-Source Data Combination
Schema Integration
Schema Matching Algorithms
Schema Mapping Techniques
Ontology Alignment
Entity Resolution
Duplicate Detection Methods
Record Linkage Algorithms
Similarity Measures
Blocking Techniques
Redundancy Management
Correlation Analysis
Covariance Computation
Statistical Dependency Tests
Data Reduction Strategies
Dimensionality Reduction
Feature Selection Methods
Filter-Based Selection
Wrapper-Based Selection
Embedded Selection
Univariate Selection
Recursive Feature Elimination
Feature Extraction Techniques
Principal Component Analysis
Linear Discriminant Analysis
Independent Component Analysis
t-Distributed Stochastic Neighbor Embedding
Multidimensional Scaling
Numerosity Reduction
Sampling Techniques
Simple Random Sampling
Stratified Sampling
Systematic Sampling
Cluster Sampling
Data Aggregation
Histogram Construction
Clustering-Based Reduction
Data Compression
Lossless Compression Methods
Lossy Compression Techniques
Wavelet Transforms
Data Transformation Methods
Normalization Techniques
Min-Max Normalization
Z-Score Standardization
Decimal Scaling
Robust Scaling
Discretization Approaches
Equal-Width Binning
Equal-Frequency Binning
Entropy-Based Discretization
Chi-Square-Based Discretization
Attribute Construction
Feature Engineering Principles
Domain-Specific Features
Interaction Features
Polynomial Features
Previous
2. Data Types and Sources
Go to top
Next
4. Classification Methods