File tree 2 files changed +16
-0
lines changed
2 files changed +16
-0
lines changed Original file line number Diff line number Diff line change @@ -21,6 +21,14 @@ CUDA device images can be generated by a CUDA-capable compiler toolchain. Most
21
21
CUDA compiler toolchains are capable of generating PTX, SASS and/or bundles of
22
22
PTX and SASS.
23
23
24
+ When generating device code to be launched using Unified Runtime, it is
25
+ recommended to use a programming model with explicit kernel parameters, such as
26
+ OpenCL or CUDA. This is because kernels generated by a programming model with
27
+ implicit kernel parameters, such as SYCL, cannot guarantee any specific number
28
+ or ordering of kernel parameters. It has been observed that kernel signatures
29
+ for the same SYCL kernel may vary significantly when compiled for different
30
+ architectures.
31
+
24
32
PTX
25
33
---
26
34
Original file line number Diff line number Diff line change @@ -27,6 +27,14 @@ HIPCC can generate device code for a particular arch using the ``--genco`` flag
27
27
28
28
$ hipcc --genco hello.cu --amdgpu-target=gfx906 -o hello.hsaco
29
29
30
+ When generating device code to be launched using Unified Runtime, it is
31
+ recommended to use a programming model with explicit kernel parameters, such as
32
+ OpenCL or HIP. This is because kernels generated by a programming model with
33
+ implicit kernel parameters, such as SYCL, cannot guarantee any specific number
34
+ or ordering of kernel parameters. It has been observed that kernel signatures
35
+ for the same SYCL kernel may vary significantly when compiled for different
36
+ architectures.
37
+
30
38
UR Programs
31
39
===========
32
40
You can’t perform that action at this time.
0 commit comments