Releases: ROCm/rocPRIM
Releases · ROCm/rocPRIM
rocPRIM 3.1.0 for ROCm 6.1.5
rocPRIM code for ROCm 6.1.5 did not change. The library was rebuilt for the updated ROCm 6.1.5 stack.
rocPRIM 3.1.0 for ROCm 6.1.2
rocPRIM code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.
rocPRIM 3.1.0 for ROCm 6.1.1
rocPRIM code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.
rocPRIM 3.1.0 for ROCm 6.1.0
Additions
- New primitive:
block_run_length_decode
- New primitive:
batch_memcpy
Changes
- Renamed:
scan_config_v2
toscan_config
scan_by_key_config_v2
toscan_by_key_config
radix_sort_config_v2
toradix_sort_config
reduce_by_key_config_v2
toreduce_by_key_config
radix_sort_config_v2
toradix_sort_config
- Removed support for custom config types for device algorithms
host_warp_size()
was moved intorocprim/device/config_types.hpp
; it now uses eitherdevice_id
or
astream
parameter to query the proper device and adevice_id
out parameter- The return type is
hipError_t
- The return type is
- Added support for
__int128_t
indevice_radix_sort
andblock_radix_sort
- Improved the performance of
match_any
, andblock_histogram
which uses it
Deprecations
- Removed
reduce_by_key_config
,MatchAny
,scan_config
,scan_by_key_config
, and
radix_sort_config
Fixes
- Build issues with
rmake.py
on Windows when using VS 2017 15.8 or later (due to a breaking fix with
extended aligned storage)
rocPRIM 3.0.0 for ROCm 6.0.2
rocPRIM code for ROCm 6.0.2 did not change. The library was rebuilt for the updated ROCm 6.0.2 stack.
rocPRIM 3.0.0 for ROCm 6.0.0
Added
block_sort::sort()
overload for keys and values with a dynamic size, for all block sort algorithms. Additionally, allblock_sort::sort()
overloads with a dynamic size are now supported forblock_sort_algorithm::merge_sort
andblock_sort_algorithm::bitonic_sort
.- New two-way partition primitive
partition_two_way
which can write to two separate iterators.
Optimizations
- Improved the performance of
partition
.
Fixed
- Fixed
rocprim::MatchAny
for devices with 64-bit warp size. The functionrocprim::MatchAny
is deprecated androcprim::match_any
is preferred instead.
rocPRIM 2.13.1 for ROCm 5.7.1
rocPRIM code for ROCm 5.7.1 did not change. The library was rebuilt for the updated ROCm 5.7.1 stack.
rocPRIM 2.13.1 for ROCm 5.7.0
Changed
- Deprecated configuration
radix_sort_config
for device-level radix sort as it no longer matches the algorithm's parameters. New configurationradix_sort_config_v2
is preferred instead. - Removed erroneous implementation of device-level
inclusive_scan
andexclusive_scan
. The prior default implementation using lookback-scan now is the only available implementation. - The benchmark metric indicating the bytes processed for
exclusive_scan_by_key
andinclusive_scan_by_key
has been changed to incorporate the key type. Furthermore, the benchmark log has been changed such that these algorithms are reported asscan
andscan_by_key
instead ofscan_exclusive
andscan_inclusive
. - Deprecated configurations
scan_config
andscan_by_key_config
for device-level scans, as they no longer match the algorithm's parameters. New configurationsscan_config_v2
andscan_by_key_config_v2
are preferred instead.
Fixed
- Fixed build issue caused by missing header in
thread/thread_search.hpp
.
rocPRIM 2.13.0 for ROCm 5.6.1
rocPRIM code for ROCm 5.6.1 did not change. The library was rebuilt for the updated ROCm 5.6.1 stack.
rocPRIM 2.13.0 for ROCm 5.6.0
rocPRIM code for ROCm 5.6.0 did not change. The library was rebuilt for the updated ROCm 5.6.0 stack.