Python for Data Science

  1. Data Manipulation with Pandas
    1. Pandas Data Structures
      1. Series Objects
        1. Series Creation
          1. From Lists and Arrays
            1. From Dictionaries
              1. From Scalar Values
                1. Series Constructor Parameters
                2. Series Attributes
                  1. Index and Values
                    1. Data Type Information
                      1. Shape and Size
                      2. Series Indexing
                        1. Label-based Indexing
                          1. Position-based Indexing
                            1. Boolean Indexing
                            2. Series Operations
                              1. Arithmetic Operations
                                1. String Operations
                                  1. Statistical Methods
                                    1. Descriptive Statistics
                                      1. Aggregation Functions
                                      2. Missing Data Handling
                                    2. DataFrame Objects
                                      1. DataFrame Creation
                                        1. From Dictionaries
                                          1. Column-oriented Data
                                            1. Row-oriented Data
                                            2. From Lists of Lists
                                              1. From NumPy Arrays
                                                1. From Series Objects
                                                  1. DataFrame Constructor Parameters
                                                  2. DataFrame Structure
                                                    1. Index and Columns
                                                      1. Data Types
                                                        1. Shape and Info
                                                          1. Memory Usage
                                                          2. DataFrame Attributes
                                                            1. columns Attribute
                                                              1. index Attribute
                                                                1. dtypes Attribute
                                                                  1. shape Attribute
                                                                2. Index Objects
                                                                  1. Index Types
                                                                    1. RangeIndex
                                                                      1. Int64Index
                                                                        1. DatetimeIndex
                                                                          1. MultiIndex
                                                                          2. Index Operations
                                                                            1. Index Creation
                                                                              1. Index Manipulation
                                                                                1. Index Alignment
                                                                            2. Data Input and Output
                                                                              1. Reading Data Files
                                                                                1. CSV Files
                                                                                  1. read_csv() Function
                                                                                    1. Common Parameters
                                                                                      1. sep and delimiter
                                                                                        1. header and names
                                                                                          1. index_col
                                                                                            1. dtype Specification
                                                                                              1. parse_dates
                                                                                                1. na_values
                                                                                                2. Handling Large Files
                                                                                                  1. chunksize Parameter
                                                                                                    1. nrows Parameter
                                                                                                  2. Excel Files
                                                                                                    1. read_excel() Function
                                                                                                      1. Sheet Selection
                                                                                                        1. Range Specification
                                                                                                          1. Multiple Sheets
                                                                                                          2. JSON Data
                                                                                                            1. read_json() Function
                                                                                                              1. JSON Normalization
                                                                                                                1. Nested JSON Handling
                                                                                                                2. Database Connections
                                                                                                                  1. read_sql() Function
                                                                                                                    1. Database Engines
                                                                                                                      1. SQL Query Execution
                                                                                                                      2. Other Formats
                                                                                                                        1. Parquet Files
                                                                                                                          1. HDF5 Files
                                                                                                                            1. Pickle Files
                                                                                                                          2. Writing Data Files
                                                                                                                            1. CSV Export
                                                                                                                              1. to_csv() Method
                                                                                                                                1. Index and Header Control
                                                                                                                                  1. Encoding Options
                                                                                                                                  2. Excel Export
                                                                                                                                    1. to_excel() Method
                                                                                                                                      1. Multiple Sheets
                                                                                                                                        1. Formatting Options
                                                                                                                                        2. Database Export
                                                                                                                                          1. to_sql() Method
                                                                                                                                            1. Append vs Replace
                                                                                                                                              1. Index Handling
                                                                                                                                              2. Other Export Formats
                                                                                                                                                1. JSON Export
                                                                                                                                                  1. Parquet Export
                                                                                                                                                    1. Pickle Export
                                                                                                                                                2. Data Selection and Filtering
                                                                                                                                                  1. Column Operations
                                                                                                                                                    1. Single Column Selection
                                                                                                                                                      1. Multiple Column Selection
                                                                                                                                                        1. Column Reordering
                                                                                                                                                          1. Column Renaming
                                                                                                                                                            1. rename() Method
                                                                                                                                                              1. Column Assignment
                                                                                                                                                            2. Row Operations
                                                                                                                                                              1. Label-based Selection
                                                                                                                                                                1. loc Indexer
                                                                                                                                                                  1. Single Row Selection
                                                                                                                                                                    1. Multiple Row Selection
                                                                                                                                                                      1. Row Slicing
                                                                                                                                                                      2. Position-based Selection
                                                                                                                                                                        1. iloc Indexer
                                                                                                                                                                          1. Integer Indexing
                                                                                                                                                                            1. Position Slicing
                                                                                                                                                                            2. Boolean Indexing
                                                                                                                                                                              1. Single Conditions
                                                                                                                                                                                1. Multiple Conditions
                                                                                                                                                                                  1. Complex Boolean Logic
                                                                                                                                                                                    1. query() Method
                                                                                                                                                                                  2. Advanced Selection
                                                                                                                                                                                    1. Conditional Selection
                                                                                                                                                                                      1. where() Method
                                                                                                                                                                                        1. mask() Method
                                                                                                                                                                                        2. Random Sampling
                                                                                                                                                                                          1. sample() Method
                                                                                                                                                                                            1. Sampling Parameters
                                                                                                                                                                                            2. Top and Bottom Selection
                                                                                                                                                                                              1. head() and tail() Methods
                                                                                                                                                                                                1. nlargest() and nsmallest()
                                                                                                                                                                                            3. Data Cleaning and Preparation
                                                                                                                                                                                              1. Missing Data Handling
                                                                                                                                                                                                1. Identifying Missing Data
                                                                                                                                                                                                  1. isnull() and isna() Methods
                                                                                                                                                                                                    1. notnull() and notna() Methods
                                                                                                                                                                                                      1. Missing Data Patterns
                                                                                                                                                                                                      2. Removing Missing Data
                                                                                                                                                                                                        1. dropna() Method
                                                                                                                                                                                                          1. Axis Parameter
                                                                                                                                                                                                            1. Threshold Parameters
                                                                                                                                                                                                              1. Subset Parameter
                                                                                                                                                                                                              2. Filling Missing Data
                                                                                                                                                                                                                1. fillna() Method
                                                                                                                                                                                                                  1. Fill Strategies
                                                                                                                                                                                                                    1. Forward Fill
                                                                                                                                                                                                                      1. Backward Fill
                                                                                                                                                                                                                        1. Constant Values
                                                                                                                                                                                                                          1. Statistical Measures
                                                                                                                                                                                                                          2. Interpolation Methods
                                                                                                                                                                                                                            1. Linear Interpolation
                                                                                                                                                                                                                              1. Polynomial Interpolation
                                                                                                                                                                                                                                1. Time-based Interpolation
                                                                                                                                                                                                                            2. Data Type Conversion
                                                                                                                                                                                                                              1. Automatic Type Inference
                                                                                                                                                                                                                                1. Explicit Type Conversion
                                                                                                                                                                                                                                  1. astype() Method
                                                                                                                                                                                                                                    1. Numeric Conversion
                                                                                                                                                                                                                                      1. Categorical Data
                                                                                                                                                                                                                                      2. String to Numeric Conversion
                                                                                                                                                                                                                                        1. to_numeric() Function
                                                                                                                                                                                                                                          1. Error Handling
                                                                                                                                                                                                                                        2. Duplicate Data Handling
                                                                                                                                                                                                                                          1. Identifying Duplicates
                                                                                                                                                                                                                                            1. duplicated() Method
                                                                                                                                                                                                                                              1. Duplicate Criteria
                                                                                                                                                                                                                                              2. Removing Duplicates
                                                                                                                                                                                                                                                1. drop_duplicates() Method
                                                                                                                                                                                                                                                  1. Keep Parameters
                                                                                                                                                                                                                                                    1. Subset Specification
                                                                                                                                                                                                                                                2. Data Transformation
                                                                                                                                                                                                                                                  1. Applying Functions
                                                                                                                                                                                                                                                    1. Element-wise Operations
                                                                                                                                                                                                                                                      1. apply() Method
                                                                                                                                                                                                                                                        1. Lambda Functions
                                                                                                                                                                                                                                                          1. Custom Functions
                                                                                                                                                                                                                                                          2. Column-wise Operations
                                                                                                                                                                                                                                                            1. map() Method
                                                                                                                                                                                                                                                              1. replace() Method
                                                                                                                                                                                                                                                              2. DataFrame-wide Operations
                                                                                                                                                                                                                                                                1. applymap() Method
                                                                                                                                                                                                                                                                  1. Vectorized Operations
                                                                                                                                                                                                                                                                2. String Operations
                                                                                                                                                                                                                                                                  1. String Accessor (.str)
                                                                                                                                                                                                                                                                    1. String Methods
                                                                                                                                                                                                                                                                      1. Case Conversion
                                                                                                                                                                                                                                                                        1. String Splitting
                                                                                                                                                                                                                                                                          1. Pattern Matching
                                                                                                                                                                                                                                                                            1. String Replacement
                                                                                                                                                                                                                                                                            2. Regular Expressions
                                                                                                                                                                                                                                                                              1. Pattern Extraction
                                                                                                                                                                                                                                                                                1. Pattern Replacement
                                                                                                                                                                                                                                                                                  1. String Validation
                                                                                                                                                                                                                                                                                2. Data Reshaping
                                                                                                                                                                                                                                                                                  1. Adding and Removing Columns
                                                                                                                                                                                                                                                                                    1. Column Assignment
                                                                                                                                                                                                                                                                                      1. Column Deletion
                                                                                                                                                                                                                                                                                        1. Computed Columns
                                                                                                                                                                                                                                                                                        2. Row Operations
                                                                                                                                                                                                                                                                                          1. Row Addition
                                                                                                                                                                                                                                                                                            1. Row Deletion
                                                                                                                                                                                                                                                                                              1. Row Reordering
                                                                                                                                                                                                                                                                                            2. Categorical Data
                                                                                                                                                                                                                                                                                              1. Creating Categorical Data
                                                                                                                                                                                                                                                                                                1. Categorical Operations
                                                                                                                                                                                                                                                                                                  1. Memory Benefits
                                                                                                                                                                                                                                                                                                    1. Ordered Categories
                                                                                                                                                                                                                                                                                                  2. Grouping and Aggregation
                                                                                                                                                                                                                                                                                                    1. GroupBy Operations
                                                                                                                                                                                                                                                                                                      1. Basic Grouping
                                                                                                                                                                                                                                                                                                        1. Single Column Grouping
                                                                                                                                                                                                                                                                                                          1. Multiple Column Grouping
                                                                                                                                                                                                                                                                                                            1. GroupBy Object Properties
                                                                                                                                                                                                                                                                                                            2. Group Iteration
                                                                                                                                                                                                                                                                                                              1. Iterating Over Groups
                                                                                                                                                                                                                                                                                                                1. Group Selection
                                                                                                                                                                                                                                                                                                                2. Group Filtering
                                                                                                                                                                                                                                                                                                                  1. filter() Method
                                                                                                                                                                                                                                                                                                                    1. Group-level Conditions
                                                                                                                                                                                                                                                                                                                  2. Aggregation Methods
                                                                                                                                                                                                                                                                                                                    1. Built-in Aggregations
                                                                                                                                                                                                                                                                                                                      1. sum(), mean(), count()
                                                                                                                                                                                                                                                                                                                        1. min(), max(), std()
                                                                                                                                                                                                                                                                                                                          1. first(), last()
                                                                                                                                                                                                                                                                                                                          2. Multiple Aggregations
                                                                                                                                                                                                                                                                                                                            1. agg() Method
                                                                                                                                                                                                                                                                                                                              1. Dictionary Aggregation
                                                                                                                                                                                                                                                                                                                                1. Named Aggregations
                                                                                                                                                                                                                                                                                                                                2. Custom Aggregation Functions
                                                                                                                                                                                                                                                                                                                                  1. User-defined Functions
                                                                                                                                                                                                                                                                                                                                    1. Lambda Aggregations
                                                                                                                                                                                                                                                                                                                                  2. Transformation Operations
                                                                                                                                                                                                                                                                                                                                    1. transform() Method
                                                                                                                                                                                                                                                                                                                                      1. Group-wise Transformations
                                                                                                                                                                                                                                                                                                                                        1. Standardization and Normalization
                                                                                                                                                                                                                                                                                                                                        2. Advanced Grouping
                                                                                                                                                                                                                                                                                                                                          1. Hierarchical Grouping
                                                                                                                                                                                                                                                                                                                                            1. MultiIndex Results
                                                                                                                                                                                                                                                                                                                                              1. Index Manipulation
                                                                                                                                                                                                                                                                                                                                              2. Time-based Grouping
                                                                                                                                                                                                                                                                                                                                                1. Grouper Object
                                                                                                                                                                                                                                                                                                                                                  1. Frequency-based Grouping
                                                                                                                                                                                                                                                                                                                                                  2. Binning and Discretization
                                                                                                                                                                                                                                                                                                                                                    1. cut() Function
                                                                                                                                                                                                                                                                                                                                                      1. qcut() Function
                                                                                                                                                                                                                                                                                                                                                        1. Custom Bins
                                                                                                                                                                                                                                                                                                                                                    2. Combining and Merging Data
                                                                                                                                                                                                                                                                                                                                                      1. Concatenation
                                                                                                                                                                                                                                                                                                                                                        1. Basic Concatenation
                                                                                                                                                                                                                                                                                                                                                          1. concat() Function
                                                                                                                                                                                                                                                                                                                                                            1. Axis Parameter
                                                                                                                                                                                                                                                                                                                                                              1. Ignore Index
                                                                                                                                                                                                                                                                                                                                                              2. Handling Overlapping Data
                                                                                                                                                                                                                                                                                                                                                                1. Duplicate Handling
                                                                                                                                                                                                                                                                                                                                                                  1. Key Conflicts
                                                                                                                                                                                                                                                                                                                                                                  2. Multi-level Concatenation
                                                                                                                                                                                                                                                                                                                                                                    1. Hierarchical Indexing
                                                                                                                                                                                                                                                                                                                                                                      1. Key Assignment
                                                                                                                                                                                                                                                                                                                                                                    2. Database-style Joins
                                                                                                                                                                                                                                                                                                                                                                      1. Merge Operations
                                                                                                                                                                                                                                                                                                                                                                        1. merge() Function
                                                                                                                                                                                                                                                                                                                                                                          1. Join Types
                                                                                                                                                                                                                                                                                                                                                                            1. Inner Join
                                                                                                                                                                                                                                                                                                                                                                              1. Left Join
                                                                                                                                                                                                                                                                                                                                                                                1. Right Join
                                                                                                                                                                                                                                                                                                                                                                                  1. Outer Join
                                                                                                                                                                                                                                                                                                                                                                                2. Join Keys
                                                                                                                                                                                                                                                                                                                                                                                  1. Single Key Joins
                                                                                                                                                                                                                                                                                                                                                                                    1. Multiple Key Joins
                                                                                                                                                                                                                                                                                                                                                                                      1. Key Specification
                                                                                                                                                                                                                                                                                                                                                                                      2. Merge Validation
                                                                                                                                                                                                                                                                                                                                                                                        1. Validate Parameter
                                                                                                                                                                                                                                                                                                                                                                                          1. Merge Indicators
                                                                                                                                                                                                                                                                                                                                                                                        2. Index-based Joining
                                                                                                                                                                                                                                                                                                                                                                                          1. join() Method
                                                                                                                                                                                                                                                                                                                                                                                            1. Index Alignment
                                                                                                                                                                                                                                                                                                                                                                                              1. Suffix Handling
                                                                                                                                                                                                                                                                                                                                                                                              2. Advanced Merging
                                                                                                                                                                                                                                                                                                                                                                                                1. Fuzzy Matching
                                                                                                                                                                                                                                                                                                                                                                                                  1. Cross Joins
                                                                                                                                                                                                                                                                                                                                                                                                    1. Merge Performance Optimization
                                                                                                                                                                                                                                                                                                                                                                                                  2. Time Series Data Analysis
                                                                                                                                                                                                                                                                                                                                                                                                    1. DateTime Handling
                                                                                                                                                                                                                                                                                                                                                                                                      1. DateTime Parsing
                                                                                                                                                                                                                                                                                                                                                                                                        1. to_datetime() Function
                                                                                                                                                                                                                                                                                                                                                                                                          1. Date Formats
                                                                                                                                                                                                                                                                                                                                                                                                            1. Error Handling
                                                                                                                                                                                                                                                                                                                                                                                                            2. DateTime Components
                                                                                                                                                                                                                                                                                                                                                                                                              1. Date and Time Extraction
                                                                                                                                                                                                                                                                                                                                                                                                                1. Weekday and Month Names
                                                                                                                                                                                                                                                                                                                                                                                                                  1. Custom Date Attributes
                                                                                                                                                                                                                                                                                                                                                                                                                2. Time Series Indexing
                                                                                                                                                                                                                                                                                                                                                                                                                  1. DatetimeIndex Creation
                                                                                                                                                                                                                                                                                                                                                                                                                    1. Time-based Selection
                                                                                                                                                                                                                                                                                                                                                                                                                      1. Date Range Selection
                                                                                                                                                                                                                                                                                                                                                                                                                        1. Partial String Indexing
                                                                                                                                                                                                                                                                                                                                                                                                                        2. Time Zone Handling
                                                                                                                                                                                                                                                                                                                                                                                                                          1. Time Zone Conversion
                                                                                                                                                                                                                                                                                                                                                                                                                            1. Localization
                                                                                                                                                                                                                                                                                                                                                                                                                          2. Resampling Operations
                                                                                                                                                                                                                                                                                                                                                                                                                            1. Frequency Conversion
                                                                                                                                                                                                                                                                                                                                                                                                                              1. Downsampling
                                                                                                                                                                                                                                                                                                                                                                                                                                1. Upsampling
                                                                                                                                                                                                                                                                                                                                                                                                                                  1. Custom Frequencies
                                                                                                                                                                                                                                                                                                                                                                                                                                  2. Aggregation During Resampling
                                                                                                                                                                                                                                                                                                                                                                                                                                    1. Statistical Aggregations
                                                                                                                                                                                                                                                                                                                                                                                                                                      1. Custom Aggregation Functions
                                                                                                                                                                                                                                                                                                                                                                                                                                      2. Resampling Parameters
                                                                                                                                                                                                                                                                                                                                                                                                                                        1. Label and Closed Parameters
                                                                                                                                                                                                                                                                                                                                                                                                                                          1. Offset and Base Parameters
                                                                                                                                                                                                                                                                                                                                                                                                                                        2. Time Series Manipulation
                                                                                                                                                                                                                                                                                                                                                                                                                                          1. Shifting Data
                                                                                                                                                                                                                                                                                                                                                                                                                                            1. shift() Method
                                                                                                                                                                                                                                                                                                                                                                                                                                              1. Lead and Lag Operations
                                                                                                                                                                                                                                                                                                                                                                                                                                              2. Rolling Windows
                                                                                                                                                                                                                                                                                                                                                                                                                                                1. rolling() Method
                                                                                                                                                                                                                                                                                                                                                                                                                                                  1. Window Size and Types
                                                                                                                                                                                                                                                                                                                                                                                                                                                    1. Rolling Aggregations
                                                                                                                                                                                                                                                                                                                                                                                                                                                    2. Expanding Windows
                                                                                                                                                                                                                                                                                                                                                                                                                                                      1. expanding() Method
                                                                                                                                                                                                                                                                                                                                                                                                                                                        1. Cumulative Operations