Useful Links
Computer Science
Data Science
Data Engineering
1. Introduction to Data Engineering
2. Foundational Programming Skills
3. Computer Science and Software Engineering Foundations
4. Database Systems and Data Storage
5. Data Warehousing and Analytics
6. Modern Data Storage Architectures
7. Batch Data Processing Systems
8. Stream Processing and Real-Time Data
9. Data Pipeline Architecture and Orchestration
10. Cloud Data Engineering Platforms
11. Data Operations and Infrastructure Management
12. Data Governance, Quality, and Security
13. Advanced Data Engineering Topics
Computer Science and Software Engineering Foundations
Data Structures
Arrays and Dynamic Arrays
Linked Lists
Singly Linked Lists
Doubly Linked Lists
Stacks
Stack Operations
Stack Applications
Queues
Queue Operations
Priority Queues
Trees
Binary Trees
Binary Search Trees
Tree Traversal Methods
Graphs
Graph Representation
Graph Traversal Algorithms
Hash Tables
Hash Functions
Collision Resolution
Hash Table Performance
Algorithms and Complexity Analysis
Sorting Algorithms
Bubble Sort
Selection Sort
Insertion Sort
Merge Sort
Quick Sort
Heap Sort
Searching Algorithms
Linear Search
Binary Search
Hash-based Search
Algorithm Complexity
Big O Notation
Time Complexity Analysis
Space Complexity Analysis
Best, Average, and Worst Case Analysis
Recursion and Iteration
Recursive Problem Solving
Iterative Solutions
Tail Recursion
Dynamic Programming Basics
Networking Fundamentals
HTTP Protocol
HTTP Methods
Status Codes
Headers and Cookies
HTTPS and SSL/TLS
TCP/IP Fundamentals
Network Layers
IP Addressing
Port Numbers
Network Troubleshooting
RESTful Web Services
REST Principles
Resource Identification
HTTP Verbs in REST
API Design Best Practices
Operating Systems Concepts
Linux/Unix File Systems
File System Hierarchy
File Permissions
File System Types
Disk Management
Process Management
Process Creation and Termination
Process Scheduling
Inter-process Communication
Process Monitoring
User and Permission Management
User Accounts and Groups
File and Directory Permissions
Sudo and Root Access
Security Best Practices
Software Engineering Principles
Version Control with Git
Git Fundamentals
Repository Management
Branching Strategies
Merging and Conflict Resolution
Remote Repositories
Pull Requests and Code Reviews
Git Workflow Patterns
Code Organization and Modularity
Function Design Principles
Module Structure
Package Management
Code Reusability Patterns
Dependency Management
Object-Oriented Programming Concepts
Classes and Objects
Inheritance
Encapsulation
Polymorphism
Design Patterns for Data Engineering
Testing and Quality Assurance
Unit Testing Principles
Test-Driven Development
Integration Testing
Mock Objects and Stubs
Testing Frameworks
Code Coverage Analysis
Debugging and Troubleshooting
Debugging Strategies
Logging Best Practices
Error Handling Patterns
Performance Profiling
Debugging Tools and IDEs
Documentation and Communication
Code Comments and Docstrings
README File Creation
API Documentation
Technical Writing
Code Review Practices
Previous
2. Foundational Programming Skills
Go to top
Next
4. Database Systems and Data Storage