Big Data Technologies

  1. NoSQL Databases
    1. Introduction to NoSQL
      1. Contrasting with Relational (SQL) Databases
        1. Schema Flexibility
          1. Scalability Differences
            1. ACID vs. BASE
            2. BASE Properties
              1. Basically Available
                1. Soft State
                  1. Eventual Consistency
                  2. Data Models
                    1. Key-Value
                      1. Document
                        1. Column-Family
                          1. Graph
                          2. NoSQL Use Cases
                            1. Web Applications
                              1. Content Management
                                1. Real-Time Analytics
                                  1. IoT Applications
                                2. Key-Value Stores
                                  1. Concept and Use Cases
                                    1. Caching
                                      1. Session Management
                                        1. Shopping Carts
                                          1. User Preferences
                                          2. Data Model Characteristics
                                            1. Simple Key-Value Pairs
                                              1. No Complex Queries
                                                1. High Performance
                                                2. Example Technologies
                                                  1. Redis
                                                    1. In-Memory Storage
                                                      1. Data Structures
                                                        1. Persistence Options
                                                          1. Clustering
                                                          2. Amazon DynamoDB
                                                            1. Managed Service
                                                              1. Auto-Scaling
                                                                1. Global Tables
                                                                2. Riak
                                                                  1. Distributed Architecture
                                                                    1. Consistency Models
                                                                      1. Multi-Datacenter Replication
                                                                  2. Document Stores
                                                                    1. Concept and Use Cases
                                                                      1. Content Management
                                                                        1. Catalogs
                                                                          1. User Profiles
                                                                            1. Configuration Management
                                                                            2. Data Format
                                                                              1. JSON
                                                                                1. BSON
                                                                                  1. XML
                                                                                  2. Document Model Features
                                                                                    1. Nested Documents
                                                                                      1. Arrays
                                                                                        1. Schema Evolution
                                                                                        2. Example Technologies
                                                                                          1. MongoDB
                                                                                            1. Aggregation Framework
                                                                                              1. Sharding
                                                                                                1. Replica Sets
                                                                                                  1. GridFS
                                                                                                  2. Couchbase
                                                                                                    1. Memory-First Architecture
                                                                                                      1. Indexing
                                                                                                        1. N1QL Query Language
                                                                                                        2. Amazon DocumentDB
                                                                                                          1. MongoDB Compatibility
                                                                                                            1. Managed Service
                                                                                                        3. Column-Family Stores
                                                                                                          1. Concept and Use Cases
                                                                                                            1. Time-Series Data
                                                                                                              1. Analytical Workloads
                                                                                                                1. IoT Data Storage
                                                                                                                  1. Log Data
                                                                                                                  2. Architecture
                                                                                                                    1. Column Families
                                                                                                                      1. Rows
                                                                                                                        1. Timestamps
                                                                                                                          1. Wide Rows
                                                                                                                          2. Data Model Characteristics
                                                                                                                            1. Sparse Data
                                                                                                                              1. Flexible Schema
                                                                                                                                1. Efficient Compression
                                                                                                                                2. Example Technologies
                                                                                                                                  1. Apache Cassandra
                                                                                                                                    1. Peer-to-Peer Architecture
                                                                                                                                      1. Tunable Consistency
                                                                                                                                        1. CQL Query Language
                                                                                                                                          1. Multi-Datacenter Replication
                                                                                                                                          2. Apache HBase
                                                                                                                                            1. HDFS Integration
                                                                                                                                              1. Region Servers
                                                                                                                                                1. Strong Consistency
                                                                                                                                                  1. Hadoop Ecosystem Integration
                                                                                                                                                  2. Amazon SimpleDB
                                                                                                                                                    1. Managed Service
                                                                                                                                                      1. Automatic Indexing
                                                                                                                                                  3. Graph Databases
                                                                                                                                                    1. Concept and Use Cases
                                                                                                                                                      1. Social Networks
                                                                                                                                                        1. Recommendation Engines
                                                                                                                                                          1. Fraud Detection
                                                                                                                                                            1. Knowledge Graphs
                                                                                                                                                            2. Data Model
                                                                                                                                                              1. Nodes
                                                                                                                                                                1. Edges
                                                                                                                                                                  1. Labels
                                                                                                                                                                  2. Graph Algorithms
                                                                                                                                                                    1. Shortest Path
                                                                                                                                                                      1. Centrality Measures
                                                                                                                                                                        1. Community Detection
                                                                                                                                                                        2. Example Technologies
                                                                                                                                                                          1. Neo4j
                                                                                                                                                                            1. Cypher Query Language
                                                                                                                                                                              1. ACID Compliance
                                                                                                                                                                                1. Graph Algorithms Library
                                                                                                                                                                                2. Amazon Neptune
                                                                                                                                                                                  1. Multi-Model Support
                                                                                                                                                                                    1. Integration with AWS Services
                                                                                                                                                                                      1. Gremlin and SPARQL Support
                                                                                                                                                                                      2. ArangoDB
                                                                                                                                                                                        1. Multi-Model Database
                                                                                                                                                                                          1. AQL Query Language