AMD GPU Programming Training Course

ROCm is an open-source platform for GPU programming that supports AMD GPUs and offers compatibility with CUDA and OpenCL. ROCm exposes hardware details to the programmer, providing full control over the parallelisation process. However, this also requires a solid understanding of device architecture, memory models, execution models, and optimisation techniques.

HIP is a C++ runtime API and kernel language that enables you to write portable code capable of running on both AMD and NVIDIA GPUs. HIP provides a lightweight abstraction layer over native GPU APIs such as ROCm and CUDA, allowing you to leverage existing GPU libraries and tools.

This instructor-led, live training (available online or on-site) is designed for beginner to intermediate-level developers who wish to use ROCm and HIP to program AMD GPUs and harness their parallel capabilities.

By the end of this training, participants will be able to:

Set up a development environment that includes the ROCm Platform, an AMD GPU, and Visual Studio Code.
Create a basic ROCm program that performs vector addition on the GPU and retrieves results from GPU memory.
Use the ROCm API to query device information, allocate and deallocate device memory, copy data between host and device, launch kernels, and synchronise threads.
Use the HIP language to write kernels that execute on the GPU and manipulate data.
Utilise HIP built-in functions, variables, and libraries to perform common tasks and operations.
Apply ROCm and HIP memory spaces—such as global, shared, constant, and local—to optimise data transfers and memory accesses.
Leverage ROCm and HIP execution models to control the threads, blocks, and grids that define parallelism.
Debug and test ROCm and HIP programs using tools such as the ROCm Debugger and ROCm Profiler.
Optimise ROCm and HIP programs using techniques like coalescing, caching, prefetching, and profiling.

Course Format

Interactive lectures and discussions.
Abundant exercises and practical practice.
Hands-on implementation in a live-lab environment.

Course Customisation Options

To request a customised version of this training, please contact us to arrange.

28 hours

Wellington, Plimmer Towers

10880 NZD (Online)

25760 NZD (Classroom)

AMD GPU Programming Training Course

Course Outline

Requirements

Provisional Upcoming Courses (Require 5+ participants)

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

AMD GPU Programming Training Course

Course Outline

Requirements

Provisional Upcoming Courses (Require 5+ participants)

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites