Useful Links
Computer Science
Big Data
Apache Hadoop
1. Introduction to Big Data and Hadoop
2. Hadoop Architecture Overview
3. Hadoop Distributed File System (HDFS)
4. Yet Another Resource Negotiator (YARN)
5. MapReduce Programming Model
6. Hadoop Ecosystem Tools
7. Hadoop Administration
8. Advanced Hadoop Topics
Hadoop Ecosystem Tools
Data Ingestion Tools
Apache Sqoop
Relational Database Integration
Import Operations
Export Operations
Incremental Imports
Command Structure
Apache Flume
Log Data Collection
Streaming Data Ingestion
Agent Architecture
Source Components
Channel Components
Sink Components
Reliability Features
Data Processing and Querying
Apache Hive
SQL-like Interface
HiveQL Language
Metastore Service
Table Management
Partitioning Strategies
Bucketing Concepts
SerDe Framework
Apache Pig
Pig Latin Language
Data Flow Programming
Execution Modes
Built-in Functions
User-Defined Functions
NoSQL Database Integration
Apache HBase
Column-Family Model
Real-Time Access
HBase Architecture
HMaster Functions
RegionServer Operations
Data Model Concepts
Table Design Principles
Workflow Management
Apache Oozie
Workflow Scheduling
Workflow Definition
Coordinator Jobs
Bundle Jobs
Action Types
Dependency Management
Apache ZooKeeper
Coordination Service
Configuration Management
Naming Service
Synchronization Primitives
Leader Election
Cluster Management
Apache Ambari
Cluster Provisioning
Service Management
Configuration Management
Monitoring Dashboard
Alert Management
REST API
Previous
5. MapReduce Programming Model
Go to top
Next
7. Hadoop Administration