Useful Links
Computer Science
Data Science
Data Engineering
1. Introduction to Data Engineering
2. Foundational Programming Skills
3. Computer Science and Software Engineering Foundations
4. Database Systems and Data Storage
5. Data Warehousing and Analytics
6. Modern Data Storage Architectures
7. Batch Data Processing Systems
8. Stream Processing and Real-Time Data
9. Data Pipeline Architecture and Orchestration
10. Cloud Data Engineering Platforms
11. Data Operations and Infrastructure Management
12. Data Governance, Quality, and Security
13. Advanced Data Engineering Topics
Data Pipeline Architecture and Orchestration
Pipeline Design Patterns
Data Ingestion Patterns
Batch Ingestion Workflows
Real-Time Ingestion Streams
Hybrid Ingestion Approaches
Change Data Capture Implementation
Data Processing Patterns
Linear Processing Pipelines
Fan-Out Processing Patterns
Fan-In Aggregation Patterns
Lambda Architecture
Kappa Architecture
Data Quality Patterns
Data Validation Rules
Data Profiling Integration
Anomaly Detection Workflows
Data Lineage Tracking
Pipeline Orchestration Concepts
Directed Acyclic Graphs
Task Dependency Modeling
Parallel Execution Paths
Critical Path Analysis
Scheduling Strategies
Time-Based Scheduling
Event-Driven Triggers
Dependency-Based Execution
Resource-Aware Scheduling
Error Handling and Recovery
Retry Mechanisms
Circuit Breaker Patterns
Dead Letter Queues
Rollback Strategies
Apache Airflow
DAG Development
Python-Based DAG Definition
Task Operators
Task Dependencies
Dynamic DAG Generation
Airflow Components
Scheduler Service
Executor Types
Web Server Interface
Metadata Database
Monitoring and Operations
Task Instance Monitoring
Log Management
Alert Configuration
Performance Metrics
Alternative Orchestration Tools
Prefect Framework
Flow and Task Concepts
Functional API Design
Cloud Execution Options
Dagster Platform
Software-Defined Assets
Type System Integration
Data Quality Testing
Tool Selection Criteria
Ease of Development
Operational Requirements
Scalability Needs
Community and Support
Previous
8. Stream Processing and Real-Time Data
Go to top
Next
10. Cloud Data Engineering Platforms