Big Data Technologies

  1. Cloud-Based Big Data Platforms
    1. Big Data as a Service (BDaaS)
      1. Managed Services
        1. Scalability and Elasticity
          1. Cost Models
            1. Service Level Agreements
            2. Amazon Web Services (AWS)
              1. Storage
                1. S3
                  1. Object Storage
                    1. Lifecycle Management
                      1. Storage Classes
                        1. Security Features
                        2. EFS
                          1. Elastic File System
                          2. EBS
                            1. Block Storage
                          3. Managed Hadoop/Spark
                            1. EMR
                              1. Cluster Provisioning
                                1. Integration with AWS Services
                                  1. Auto Scaling
                                    1. Spot Instances
                                  2. Data Warehousing
                                    1. Redshift
                                      1. Columnar Storage
                                        1. Massively Parallel Processing
                                          1. Spectrum
                                            1. Concurrency Scaling
                                          2. Stream Processing
                                            1. Kinesis
                                              1. Data Streams
                                                1. Firehose
                                                  1. Analytics
                                                    1. Video Streams
                                                  2. NoSQL Databases
                                                    1. DynamoDB
                                                      1. Key-Value and Document Store
                                                        1. Global Tables
                                                          1. DynamoDB Streams
                                                        2. Analytics Services
                                                          1. Athena
                                                            1. Serverless Query Service
                                                            2. QuickSight
                                                              1. Business Intelligence
                                                            3. Machine Learning
                                                              1. SageMaker
                                                                1. Model Development
                                                                  1. Training and Deployment
                                                              2. Google Cloud Platform (GCP)
                                                                1. Storage
                                                                  1. Cloud Storage
                                                                    1. Buckets
                                                                      1. Access Control
                                                                        1. Storage Classes
                                                                      2. Managed Hadoop/Spark
                                                                        1. Dataproc
                                                                          1. Cluster Management
                                                                            1. Autoscaling
                                                                              1. Preemptible Instances
                                                                            2. Data Warehousing
                                                                              1. BigQuery
                                                                                1. Serverless Architecture
                                                                                  1. SQL Analytics
                                                                                    1. ML Integration
                                                                                      1. Streaming Inserts
                                                                                    2. Stream Processing
                                                                                      1. Pub/Sub
                                                                                        1. Messaging Service
                                                                                          1. Global Distribution
                                                                                          2. Dataflow
                                                                                            1. Unified Batch and Stream Processing
                                                                                              1. Apache Beam
                                                                                            2. NoSQL Databases
                                                                                              1. Firestore
                                                                                                1. Document Database
                                                                                                2. Bigtable
                                                                                                  1. Wide-Column Database
                                                                                                3. Analytics and ML
                                                                                                  1. Data Studio
                                                                                                    1. Visualization
                                                                                                    2. AI Platform
                                                                                                      1. Machine Learning
                                                                                                  2. Microsoft Azure
                                                                                                    1. Storage
                                                                                                      1. Azure Data Lake Storage
                                                                                                        1. Hierarchical Namespace
                                                                                                          1. Security Features
                                                                                                            1. Gen1 vs Gen2
                                                                                                            2. Blob Storage
                                                                                                              1. Object Storage
                                                                                                            3. Managed Hadoop/Spark
                                                                                                              1. HDInsight
                                                                                                                1. Cluster Types
                                                                                                                  1. Integration with Azure Services
                                                                                                                    1. Enterprise Security Package
                                                                                                                  2. Data Analytics
                                                                                                                    1. Azure Synapse Analytics
                                                                                                                      1. Data Integration
                                                                                                                        1. Analytics Workspaces
                                                                                                                          1. SQL Pools
                                                                                                                            1. Spark Pools
                                                                                                                          2. Stream Processing
                                                                                                                            1. Event Hubs
                                                                                                                              1. Event Ingestion
                                                                                                                                1. Kafka Compatibility
                                                                                                                                2. Stream Analytics
                                                                                                                                  1. Real-Time Analytics
                                                                                                                                    1. SQL-Based Processing
                                                                                                                                  2. NoSQL Databases
                                                                                                                                    1. Cosmos DB
                                                                                                                                      1. Multi-Model Database
                                                                                                                                        1. Global Distribution
                                                                                                                                      2. Machine Learning
                                                                                                                                        1. Azure Machine Learning
                                                                                                                                          1. Model Development
                                                                                                                                            1. MLOps