Useful Links
Computer Science
Big Data
Apache Airflow
1. Introduction to Apache Airflow
2. Core Concepts of Airflow
3. Airflow Architecture and Components
4. Setting Up an Airflow Environment
5. Authoring Your First DAG
6. Comprehensive Guide to Operators
7. Managing Workflows with the Airflow UI
8. Scheduling and Triggers
9. Data Sharing and Communication
10. Advanced DAG Authoring Techniques
11. Airflow Providers and Extensibility
12. Testing and Debugging Airflow DAGs
13. Operational Airflow Management
14. Scaling and Production Deployment
15. Airflow Best Practices and Patterns
Airflow Architecture and Components
Overall Architecture Overview
Multi-Component Architecture
Communication Between Components
Deployment Patterns
The Metastore Database
Role in State Management
Storing DAG Metadata
Storing Task and DAG Run States
Connection and Variable Storage
Supported Database Backends
SQLite
PostgreSQL
MySQL
SQL Server
Database Schema Overview
The Webserver
User Interface Overview
Web Framework and Technology Stack
Navigating the UI
REST API Endpoints
Authentication and Authorization
Configuration Options
The Scheduler
Core Scheduling Logic
Parsing and Loading DAGs
DAG Bag Management
Triggering DAG Runs
Task Scheduling Logic
Handling Backfills
Scheduler Performance Considerations
Executors
Role in Task Execution
Executor Selection Criteria
Types of Executors
SequentialExecutor
LocalExecutor
CeleryExecutor
KubernetesExecutor
CeleryKubernetesExecutor
DebugExecutor
Executor Configuration
Executor Comparison and Use Cases
The Triggerer
Asynchronous Task Execution Model
Event-driven Task Handling
Use Cases for Deferrable Operators
Resource Efficiency Benefits
Configuration and Setup
Workers
Worker Processes in Different Executors
Worker Configuration
Worker Scaling Strategies
Previous
2. Core Concepts of Airflow
Go to top
Next
4. Setting Up an Airflow Environment