File tree Expand file tree Collapse file tree 1 file changed +20
-2
lines changed
docs/doc_sources/beginners_guides Expand file tree Collapse file tree 1 file changed +20
-2
lines changed Original file line number Diff line number Diff line change @@ -159,13 +159,31 @@ The following plugins from CodePlay are supported:
159
159
.. _codeplay_nv_plugin : https://developer.codeplay.com/products/oneapi/nvidia/
160
160
.. _codeplay_amd_plugin : https://developer.codeplay.com/products/oneapi/amd/
161
161
162
- ``dpctl `` can be built for CUDA devices as follows:
162
+ ``dpctl `` can be built for CUDA devices using the ``DPCTL_TARGET_CUDA `` CMake option,
163
+ which accepts a specific compute architecture string:
164
+
165
+ .. code-block :: bash
166
+
167
+ python scripts/build_locally.py --verbose --cmake-opts=" -DDPCTL_TARGET_CUDA=sm_80"
168
+
169
+ To use the default architecture (``sm_50 ``), use:
163
170
164
171
.. code-block :: bash
165
172
166
173
python scripts/build_locally.py --verbose --cmake-opts=" -DDPCTL_TARGET_CUDA=ON"
167
174
168
- And for AMD devices
175
+ Note that kernels are built for ``sm_50 `` by default, allowing them to work on a wider
176
+ range of architectures, but limiting the usage of more recent CUDA features.
177
+
178
+ For reference, compute architecture strings like ``sm_80 `` are based on
179
+ CUDA Compute Capability. A complete mapping between NVIDIA GPU models and their
180
+ respective ``sm_XX `` values can be found in the official
181
+ `CUDA GPU Compute Capability <https://developer.nvidia.com/cuda-gpus >`_.
182
+
183
+ A full list of available SYCL alias targets is available in the
184
+ `DPC++ Compiler User Manual <https://intel.github.io/llvm/UsersManual.html >`_.
185
+
186
+ To build for AMD devices, use:
169
187
170
188
.. code-block :: bash
171
189
You can’t perform that action at this time.
0 commit comments