threadgroup_bitonic_sort<T> HLSL

What is it?

This is an implementaion of threadgroup wide bitonic sort in HLSL.

Purpose

Sometimes, it is desired to sort elements within a thread group on the GPU. The threadgroup_bitonic_sort.hlsli header file provides multiple variants of the bitonic sort to support any power-of-2 threadgroup size and the number of sortable elements of up to 4096.

Features

It is agnostic of wave/warp sizes
It automatically switches to sorting and shuffling within waves/warps by utilising wave intrinsics when the sizes of sorted/shuffled blocks become smaller than the size of waves/warps in a threadgroup (check out AMD RGA codegen on godbolt.org)
It supports GPUs without wave intrinsic support
It supports sorting of up to 4096 elements within a thread group (sorting 4096 elements requires the size of a thread group to be 1024 threads)
For a thread group with N threads, it supports sorting of N, N * 2 or N * 4 elements

Building Demo

To build demo.cpp, run build.bat from Visual Studio Command Prompt. The batch file should automatically download the required packages (D3D12, DXC), build and run all shader variants as benchmarks.

The header file can be compiled with DX Compiler release for February 2025 or earlier.

License

This header file is available to anybody free of charge, under the terms of MIT License (see LICENSE.md).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

threadgroup_bitonic_sort<T> HLSL

What is it?

Purpose

Features

Building Demo

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE.md		LICENSE.md
README.md		README.md
build.bat		build.bat
d3d12aid.h		d3d12aid.h
demo.cpp		demo.cpp
threadgroup_bitonic_sort.hlsl		threadgroup_bitonic_sort.hlsl
threadgroup_bitonic_sort.hlsli		threadgroup_bitonic_sort.hlsli

License

pm4rtx/threadgroup_bitonic_sort_hlsl

Folders and files

Latest commit

History

Repository files navigation

threadgroup_bitonic_sort<T> HLSL

What is it?

Purpose

Features

Building Demo

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages