UsefulLinks
Computer Science
Programming
GPU Programming
1. Introduction to Parallel Computing and GPU Architecture
2. GPU Programming Models and APIs
3. Fundamentals of CUDA Programming
4. Intermediate CUDA Programming
5. Performance Optimization and Profiling
6. Advanced CUDA Programming
7. OpenCL Programming
8. Alternative GPU Programming Frameworks
9. Parallel Algorithms and Patterns
10. Applications and Case Studies
11. Performance Analysis and Optimization
12. Debugging and Testing
8.
Alternative GPU Programming Frameworks
8.1.
SYCL and DPC++
8.1.1.
Single-Source Programming Model
8.1.1.1.
Host and Device Code Integration
8.1.1.2.
C++ Template Usage
8.1.1.3.
Modern C++ Features
8.1.2.
Abstraction Layers
8.1.2.1.
Backend Independence
8.1.2.2.
CUDA Backend
8.1.2.3.
OpenCL Backend
8.1.2.4.
CPU Backend
8.1.3.
Programming Constructs
8.1.3.1.
Queue and Handler Objects
8.1.3.2.
Buffer and Accessor Model
8.1.3.3.
Parallel Algorithms
8.2.
DirectCompute
8.2.1.
DirectX Integration
8.2.1.1.
Graphics Pipeline Integration
8.2.1.2.
Resource Sharing
8.2.1.3.
Compute Shader Model
8.2.2.
Programming Model
8.2.2.1.
HLSL Compute Shaders
8.2.2.2.
Thread Group Organization
8.2.2.3.
Resource Binding
8.2.3.
Performance Considerations
8.2.3.1.
GPU Scheduling
8.2.3.2.
Memory Management
8.2.3.3.
Optimization Techniques
8.3.
Vulkan Compute
8.3.1.
Low-Level API Design
8.3.1.1.
Explicit Control
8.3.1.2.
Minimal Driver Overhead
8.3.1.3.
Cross-Platform Support
8.3.2.
Compute Pipeline
8.3.2.1.
Pipeline Creation
8.3.2.2.
Descriptor Sets
8.3.2.3.
Command Buffer Recording
8.3.3.
Advanced Features
8.3.3.1.
Multi-Queue Execution
8.3.3.2.
Memory Management
8.3.3.3.
Synchronization Primitives
8.4.
High-Level Frameworks
8.4.1.
OpenACC
8.4.1.1.
Directive-Based Programming
8.4.1.2.
Compiler Pragmas
8.4.1.3.
Incremental Parallelization
8.4.2.
OpenMP Target Offloading
8.4.2.1.
Target Directives
8.4.2.2.
Data Mapping
8.4.2.3.
Device Selection
8.4.3.
Python GPU Libraries
8.4.3.1.
PyCUDA
8.4.3.2.
Numba
8.4.3.3.
CuPy
8.4.3.4.
JAX
Previous
7. OpenCL Programming
Go to top
Next
9. Parallel Algorithms and Patterns