Distributed Database Systems

  1. Distributed Query Processing
    1. Overview of Query Processing
      1. Query Decomposition
        1. Data Localization
          1. Global Query Optimization
            1. Local Query Optimization
              1. Query Execution
              2. Query Decomposition Steps
                1. Parsing and Validation
                  1. Query Normalization
                    1. Semantic Analysis
                      1. Simplification and Restructuring
                        1. Query Graph Generation
                          1. Algebraic Optimization
                          2. Data Localization
                            1. Mapping Global Queries to Local Fragments
                              1. Fragment Query Generation
                                1. Localization Program
                                  1. Fragment Substitution
                                  2. Global Query Optimization
                                    1. Objective Function
                                      1. I/O Costs
                                        1. CPU Costs
                                          1. Communication Costs
                                            1. Response Time
                                            2. Search Space of Execution Plans
                                              1. Plan Enumeration
                                                1. Heuristic Approaches
                                                  1. Dynamic Programming
                                                  2. Optimization Strategies
                                                    1. Centralized Optimization
                                                      1. Distributed Optimization
                                                        1. Hybrid Optimization
                                                        2. Cost Models
                                                          1. Statistical Cost Models
                                                            1. Parametric Cost Models
                                                          2. Local Query Optimization
                                                            1. Local Plan Generation
                                                              1. Local Cost Estimation
                                                                1. Local Execution Strategies
                                                                2. Distributed Join Strategies
                                                                  1. Nested Loop Join
                                                                    1. Sort-Merge Join
                                                                      1. Hash Join
                                                                        1. Semijoin Operation
                                                                          1. Reducing Data Transfer
                                                                            1. Semijoin Variants
                                                                              1. Bloom Join
                                                                              2. Ship Operations
                                                                                1. Ship Whole Relations
                                                                                  1. Ship Fragments
                                                                                    1. Ship Results
                                                                                    2. Join Ordering
                                                                                      1. Parallel Join Execution