Statistics with R

  1. Data Cleaning and Preprocessing
    1. Data Quality Assessment
      1. Identifying Data Issues
        1. Data Profiling
          1. Completeness Checks
            1. Consistency Checks
            2. Missing Data Handling
              1. Types of Missing Data
                1. Identifying Missing Values
                  1. is.na() Function
                    1. complete.cases() Function
                    2. Missing Data Patterns
                      1. Removing Missing Values
                        1. na.omit() Function
                          1. drop_na() Function
                          2. Imputation Methods
                            1. Mean Imputation
                              1. Median Imputation
                                1. Mode Imputation
                                  1. Forward Fill
                                    1. Backward Fill
                                  2. Duplicate Data
                                    1. Identifying Duplicates
                                      1. duplicated() Function
                                        1. distinct() Function
                                        2. Removing Duplicates
                                          1. unique() Function
                                            1. Deduplication Strategies
                                          2. Data Type Conversion
                                            1. Type Checking
                                              1. Numeric Conversion
                                                1. Character Conversion
                                                  1. Factor Conversion
                                                    1. Date and Time Conversion
                                                      1. as.Date() Function
                                                        1. lubridate Package
                                                          1. Date Formats
                                                        2. Variable Transformation
                                                          1. Renaming Variables
                                                            1. Recoding Variables
                                                              1. Creating New Variables
                                                                1. Binning Continuous Variables
                                                                  1. Standardization
                                                                    1. Normalization
                                                                    2. Text Data Cleaning
                                                                      1. String Manipulation
                                                                        1. Regular Expressions
                                                                          1. Case Conversion
                                                                            1. Trimming Whitespace
                                                                              1. Pattern Matching