Useful Links
1. Introduction to Big Data and Hadoop
2. Hadoop Architecture Overview
3. Hadoop Distributed File System (HDFS)
4. Yet Another Resource Negotiator (YARN)
5. MapReduce Programming Model
6. Hadoop Ecosystem Tools
7. Hadoop Administration
8. Advanced Hadoop Topics
  1. Computer Science
  2. Big Data

Apache Hadoop

1. Introduction to Big Data and Hadoop
2. Hadoop Architecture Overview
3. Hadoop Distributed File System (HDFS)
4. Yet Another Resource Negotiator (YARN)
5. MapReduce Programming Model
6. Hadoop Ecosystem Tools
7. Hadoop Administration
8. Advanced Hadoop Topics
  1. Hadoop Ecosystem Tools
    1. Data Ingestion Tools
      1. Apache Sqoop
        1. Relational Database Integration
          1. Import Operations
            1. Export Operations
              1. Incremental Imports
                1. Command Structure
                2. Apache Flume
                  1. Log Data Collection
                    1. Streaming Data Ingestion
                      1. Agent Architecture
                        1. Source Components
                          1. Channel Components
                            1. Sink Components
                              1. Reliability Features
                            2. Data Processing and Querying
                              1. Apache Hive
                                1. SQL-like Interface
                                  1. HiveQL Language
                                    1. Metastore Service
                                      1. Table Management
                                        1. Partitioning Strategies
                                          1. Bucketing Concepts
                                            1. SerDe Framework
                                            2. Apache Pig
                                              1. Pig Latin Language
                                                1. Data Flow Programming
                                                  1. Execution Modes
                                                    1. Built-in Functions
                                                      1. User-Defined Functions
                                                    2. NoSQL Database Integration
                                                      1. Apache HBase
                                                        1. Column-Family Model
                                                          1. Real-Time Access
                                                            1. HBase Architecture
                                                              1. HMaster Functions
                                                                1. RegionServer Operations
                                                                  1. Data Model Concepts
                                                                    1. Table Design Principles
                                                                  2. Workflow Management
                                                                    1. Apache Oozie
                                                                      1. Workflow Scheduling
                                                                        1. Workflow Definition
                                                                          1. Coordinator Jobs
                                                                            1. Bundle Jobs
                                                                              1. Action Types
                                                                                1. Dependency Management
                                                                                2. Apache ZooKeeper
                                                                                  1. Coordination Service
                                                                                    1. Configuration Management
                                                                                      1. Naming Service
                                                                                        1. Synchronization Primitives
                                                                                          1. Leader Election
                                                                                        2. Cluster Management
                                                                                          1. Apache Ambari
                                                                                            1. Cluster Provisioning
                                                                                              1. Service Management
                                                                                                1. Configuration Management
                                                                                                  1. Monitoring Dashboard
                                                                                                    1. Alert Management
                                                                                                      1. REST API

                                                                                                  Previous

                                                                                                  5. MapReduce Programming Model

                                                                                                  Go to top

                                                                                                  Next

                                                                                                  7. Hadoop Administration

                                                                                                  © 2025 Useful Links. All rights reserved.

                                                                                                  About•Bluesky•X.com