Data Engineering

  1. Cloud Data Engineering Platforms
    1. Cloud Computing Fundamentals
      1. Service Models
        1. Infrastructure as a Service
          1. Platform as a Service
            1. Software as a Service
            2. Deployment Models
              1. Public Cloud
                1. Private Cloud
                  1. Hybrid Cloud
                    1. Multi-Cloud Strategies
                    2. Cloud Adoption Benefits
                      1. Scalability and Elasticity
                        1. Cost Optimization
                          1. Managed Services
                            1. Global Availability
                            2. Cloud Migration Challenges
                              1. Data Transfer Costs
                                1. Vendor Lock-In Concerns
                                  1. Security and Compliance
                                    1. Skills and Training Requirements
                                  2. Amazon Web Services Data Services
                                    1. Storage Services
                                      1. S3 Object Storage
                                        1. Bucket Organization
                                          1. Storage Classes
                                            1. Lifecycle Policies
                                              1. Security and Access Control
                                              2. Glacier Archival Storage
                                                1. Archive Retrieval Options
                                                  1. Cost Optimization Strategies
                                                2. Database Services
                                                  1. RDS Managed Databases
                                                    1. Multi-AZ Deployments
                                                      1. Read Replicas
                                                        1. Backup and Recovery
                                                        2. DynamoDB NoSQL Database
                                                          1. Partition Key Design
                                                            1. Global Secondary Indexes
                                                              1. DynamoDB Streams
                                                            2. Data Warehousing
                                                              1. Amazon Redshift
                                                                1. Cluster Architecture
                                                                  1. Distribution Strategies
                                                                    1. Query Optimization
                                                                      1. Workload Management
                                                                    2. Data Processing Services
                                                                      1. EMR Managed Hadoop
                                                                        1. Cluster Configuration
                                                                          1. Auto Scaling
                                                                            1. Cost Optimization
                                                                            2. AWS Glue ETL Service
                                                                              1. Data Catalog
                                                                                1. Job Development
                                                                                  1. Serverless Processing
                                                                                2. Streaming Services
                                                                                  1. Kinesis Data Streams
                                                                                    1. Shard Management
                                                                                      1. Producer and Consumer APIs
                                                                                      2. Kinesis Data Firehose
                                                                                        1. Delivery Stream Configuration
                                                                                          1. Data Transformation
                                                                                        2. Orchestration Services
                                                                                          1. Step Functions
                                                                                            1. State Machine Definition
                                                                                              1. Error Handling
                                                                                                1. Integration Patterns
                                                                                            2. Google Cloud Platform Data Services
                                                                                              1. Storage Services
                                                                                                1. Cloud Storage
                                                                                                  1. Storage Classes
                                                                                                    1. Object Lifecycle Management
                                                                                                      1. Access Control Models
                                                                                                    2. Database Services
                                                                                                      1. Cloud SQL
                                                                                                        1. High Availability Configuration
                                                                                                          1. Read Replicas
                                                                                                            1. Automated Backups
                                                                                                            2. Bigtable NoSQL Database
                                                                                                              1. Schema Design
                                                                                                                1. Performance Optimization
                                                                                                              2. Data Warehousing
                                                                                                                1. BigQuery
                                                                                                                  1. Dataset Organization
                                                                                                                    1. Query Optimization
                                                                                                                      1. Partitioning and Clustering
                                                                                                                        1. Streaming Inserts
                                                                                                                      2. Data Processing Services
                                                                                                                        1. Dataproc Managed Spark
                                                                                                                          1. Cluster Management
                                                                                                                            1. Job Submission
                                                                                                                              1. Auto Scaling
                                                                                                                              2. Dataflow Stream Processing
                                                                                                                                1. Apache Beam Integration
                                                                                                                                  1. Template Development
                                                                                                                                2. Messaging Services
                                                                                                                                  1. Pub/Sub
                                                                                                                                    1. Topic and Subscription Management
                                                                                                                                      1. Message Ordering
                                                                                                                                        1. Dead Letter Topics
                                                                                                                                      2. Orchestration Services
                                                                                                                                        1. Cloud Composer
                                                                                                                                          1. Airflow-Based Workflows
                                                                                                                                            1. Environment Management
                                                                                                                                        2. Microsoft Azure Data Services
                                                                                                                                          1. Storage Services
                                                                                                                                            1. Blob Storage
                                                                                                                                              1. Container Organization
                                                                                                                                                1. Access Tiers
                                                                                                                                                  1. Lifecycle Management
                                                                                                                                                  2. Data Lake Storage
                                                                                                                                                    1. Hierarchical Namespace
                                                                                                                                                      1. Access Control Lists
                                                                                                                                                    2. Database Services
                                                                                                                                                      1. Azure SQL Database
                                                                                                                                                        1. Elastic Pools
                                                                                                                                                          1. Geo-Replication
                                                                                                                                                            1. Automated Tuning
                                                                                                                                                            2. Cosmos DB
                                                                                                                                                              1. Multi-Model Support
                                                                                                                                                                1. Global Distribution
                                                                                                                                                                  1. Consistency Levels
                                                                                                                                                                2. Data Warehousing
                                                                                                                                                                  1. Azure Synapse Analytics
                                                                                                                                                                    1. SQL Pool Architecture
                                                                                                                                                                      1. Data Integration Pipelines
                                                                                                                                                                        1. Apache Spark Integration
                                                                                                                                                                      2. Data Processing Services
                                                                                                                                                                        1. HDInsight
                                                                                                                                                                          1. Cluster Types
                                                                                                                                                                            1. Enterprise Security Package
                                                                                                                                                                            2. Azure Data Factory
                                                                                                                                                                              1. Pipeline Development
                                                                                                                                                                                1. Mapping Data Flows
                                                                                                                                                                                  1. Integration Runtime
                                                                                                                                                                                2. Streaming Services
                                                                                                                                                                                  1. Event Hubs
                                                                                                                                                                                    1. Partition Management
                                                                                                                                                                                      1. Capture Feature
                                                                                                                                                                                        1. Auto-Inflate