Useful Links
Computer Science
Big Data
Apache Airflow
1. Introduction to Apache Airflow
2. Core Concepts of Airflow
3. Airflow Architecture and Components
4. Setting Up an Airflow Environment
5. Authoring Your First DAG
6. Comprehensive Guide to Operators
7. Managing Workflows with the Airflow UI
8. Scheduling and Triggers
9. Data Sharing and Communication
10. Advanced DAG Authoring Techniques
11. Airflow Providers and Extensibility
12. Testing and Debugging Airflow DAGs
13. Operational Airflow Management
14. Scaling and Production Deployment
15. Airflow Best Practices and Patterns
Authoring Your First DAG
DAG File Structure and Organization
File Naming Conventions
Directory Structure
Python Module Organization
Basic DAG Definition
Required Imports
Instantiating the DAG Object
DAG Parameters
Setting Default Arguments
owner
start_date
retries
retry_delay
email_on_failure
email_on_retry
Defining Tasks with Operators
BashOperator
Syntax and Parameters
Command Execution
Environment Variables
PythonOperator
Syntax and Parameters
Function Definition
Passing Arguments
Task Naming Conventions
Task Configuration
Setting Task Dependencies
Using set_upstream and set_downstream
Using Bitshift Operators
Chaining Dependencies for Linear Workflows
Complex Dependency Patterns
Fan-out and Fan-in Patterns
DAG Testing and Validation
Syntax Validation
Import Testing
Dependency Validation
Running Your First DAG
Manual Triggers via UI
Manual Triggers via CLI
Scheduled Triggers
Monitoring DAG Runs
Interpreting Results
Previous
4. Setting Up an Airflow Environment
Go to top
Next
6. Comprehensive Guide to Operators