Data Cleaning

  1. Techniques for Fixing Structural Errors
    1. Data Type Conversion and Validation
      1. String to Numeric Conversion
        1. Integer Conversion
          1. Float Conversion
            1. Decimal Conversion
              1. Error Handling
              2. String to DateTime Conversion
                1. Format Detection
                  1. Parsing Strategies
                    1. Time Zone Handling
                      1. Invalid Date Handling
                      2. Numeric to Categorical Conversion
                        1. Binning Strategies
                          1. Equal Width Binning
                            1. Equal Frequency Binning
                              1. Custom Binning
                                1. Optimal Binning
                                2. Threshold-Based Categorization
                                  1. Quantile-Based Categorization
                                  2. Categorical to Numeric Encoding
                                    1. Label Encoding
                                      1. One-Hot Encoding
                                        1. Binary Encoding
                                          1. Target Encoding
                                            1. Frequency Encoding
                                            2. Boolean Data Handling
                                              1. True/False Standardization
                                                1. Yes/No Conversion
                                                  1. 1/0 Mapping
                                                    1. Null Handling
                                                  2. String and Text Processing
                                                    1. Basic String Operations
                                                      1. Trimming Operations
                                                        1. Padding Operations
                                                          1. Substring Extraction
                                                            1. String Concatenation
                                                            2. Advanced Text Processing
                                                              1. Regular Expression Matching
                                                                1. Pattern Extraction
                                                                  1. Text Parsing
                                                                    1. Token Extraction
                                                                    2. Text Cleaning
                                                                      1. Special Character Removal
                                                                        1. HTML Tag Removal
                                                                          1. URL Extraction and Removal
                                                                            1. Email Extraction and Validation
                                                                            2. Text Normalization
                                                                              1. Unicode Normalization
                                                                                1. Accent Removal
                                                                                  1. Punctuation Standardization
                                                                                    1. Case Normalization
                                                                                    2. Column Splitting and Merging
                                                                                      1. Delimiter-Based Splitting
                                                                                        1. Fixed-Width Splitting
                                                                                          1. Pattern-Based Splitting
                                                                                            1. Intelligent Column Merging
                                                                                          2. Schema and Structure Optimization
                                                                                            1. Column Management
                                                                                              1. Column Renaming
                                                                                                1. Column Reordering
                                                                                                  1. Column Addition
                                                                                                    1. Column Removal
                                                                                                    2. Data Reshaping
                                                                                                      1. Wide to Long Format
                                                                                                        1. Long to Wide Format
                                                                                                          1. Pivot Operations
                                                                                                            1. Unpivot Operations
                                                                                                            2. Hierarchical Data Handling
                                                                                                              1. Nested Structure Flattening
                                                                                                                1. JSON Data Processing
                                                                                                                  1. XML Data Processing
                                                                                                                    1. Tree Structure Normalization
                                                                                                                    2. Relational Structure Creation
                                                                                                                      1. Primary Key Assignment
                                                                                                                        1. Foreign Key Relationships
                                                                                                                          1. Normalization Processes
                                                                                                                            1. Denormalization Strategies
                                                                                                                          2. Multi-Value Field Processing
                                                                                                                            1. List and Array Handling
                                                                                                                              1. List Parsing
                                                                                                                                1. Array Splitting
                                                                                                                                  1. Element Extraction
                                                                                                                                    1. List Normalization
                                                                                                                                    2. Delimited Value Processing
                                                                                                                                      1. Multi-Value Splitting
                                                                                                                                        1. Value Standardization
                                                                                                                                          1. Duplicate Removal
                                                                                                                                            1. Order Preservation
                                                                                                                                            2. Nested Data Extraction
                                                                                                                                              1. JSON Field Extraction
                                                                                                                                                1. XML Element Extraction
                                                                                                                                                  1. Nested Object Flattening
                                                                                                                                                    1. Recursive Processing
                                                                                                                                                    2. Relationship Extraction
                                                                                                                                                      1. Many-to-Many Relationships
                                                                                                                                                        1. Association Tables
                                                                                                                                                          1. Reference Tables