Skip to content

Commit 4572066

Browse files
authored
Create 2024-01-05-preliminary-report-initial-evaluation-of-stdpar-implementations-on-amd-gpus-for-hpc.md
Adds, "Preliminary report: Initial evaluation of StdPar implementations on AMD GPUs for HPC"
1 parent 401b6fa commit 4572066

File tree

1 file changed

+26
-0
lines changed

1 file changed

+26
-0
lines changed
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
---
2+
contributor: max
3+
date: '2024-01-05T08:08:10.490000+00:00'
4+
title: 'Preliminary report: Initial evaluation of StdPar implementations on AMD GPUs for HPC'
5+
external_url: https://arxiv.org/pdf/2401.02680
6+
authors:
7+
- name: Wei-Chen Lin
8+
- name: Simon McIntosh-Smith
9+
- name: Tom Deakin
10+
tags:
11+
- sycl
12+
- gpu
13+
- hip
14+
---
15+
16+
Recently, AMD platforms have not supported offloading C++17 PSTL (StdPar) programs to the GPU. Our previous work highlights
17+
how StdPar is able to achieve good performance across NVIDIA and Intel GPU platforms. In that work, we acknowledged AMD’s past
18+
effort such as HCC, which unfortunately is deprecated and does not support newer hardware platforms. Recent developments by AMD, Codeplay,
19+
and AdaptiveCpp (previously known as hipSYCL or OpenSYCL) have enabled multiple paths for StdPar programs to run on AMD GPUs.
20+
This informal report discusses our experiences and evaluation of currently available StdPar implementations for AMD GPUs.
21+
We conduct benchmarks using our suite of HPC mini-apps with ports in many heterogeneous programming models, including StdPar.
22+
We then compare the performance of StdPar, using all available StdPar compilers, to contemporary heterogeneous programming models
23+
supported on AMD GPUs: HIP, OpenCL, Thrust, Kokkos, OpenMP, SYCL. Where appropriate, we discuss issues encountered and workarounds
24+
applied during our evaluation. Finally, the StdPar model discussed in this report largely depends on Unified Shared Memory (USM) performance
25+
and very few AMD GPUs have proper support for this feature. As such, this report demonstrates a proof-of-concept host-side userspace pagefault
26+
solution for models that use the HIP API. We discuss performance improvements achieved with our solution using the same set of benchmarks.

0 commit comments

Comments
 (0)