@@ -4,9 +4,9 @@ Open MPI v5.0.x series
4
4
This file contains all the NEWS updates for the Open MPI v5.0.x
5
5
series, in reverse chronological order.
6
6
7
- Open MPI version 5.0.0rc9
8
- -------------------------
9
- :Date: 21 October 2022
7
+ Open MPI version 5.0.0rc10
8
+ --------------------------
9
+ :Date: 27 January 2023
10
10
11
11
.. admonition :: MPIR API has been removed
12
12
:class: warning
@@ -40,23 +40,27 @@ Open MPI version 5.0.0rc9
40
40
libraries, rather than linked into the Open MPI core libraries.
41
41
42
42
43
- - Changes since rc8:
44
-
45
- - Added new ``Accelerator `` gpu framework. ``CUDA `` specific code was replaced with
46
- a generic framework that standardizes various device features such as copies or
47
- pointer type detection. This allows for modularized implementation of various
48
- devices such as the newly introduced ROCm Accelerator component. The redesign
49
- also allows for Open MPI builds to be shipped with ``CUDA `` support enabled
50
- without requiring ``CUDA `` libraries.
51
- - Various bug fixes related to ``MPI Sessions ``.
52
- - autogen.pl: Fixed ifort support on OSX. Thanks to Tien Quang, NGUYEN
53
- for providing the fix.
54
- - Initial implementation of ``MPI_COMM_TYPE_HW_UNGUIDED `` for ``MPI_Comm_split_type ``.
55
- - Fixed a bug where ``MPI_Pack `` and ``MPI_Unpack `` external32 with long doubles
56
- could fail. See #8918 for details.
57
- - Updated MCA mutexes to use the Qthreads user-level-threading backend. Thanks
58
- to Jan Ciesko for the contribution.
59
- - Various other bug fixes and cleanups.
43
+ - Changes since rc9:
44
+
45
+ - coll/ucc: Added support for Scatter and Iscatter collectives.
46
+ - Added cache bypass mechanism to the ``OFI `` BTL. This fixes conflicts
47
+ with Libfabric, which has its own registration cache. This addes a bypass
48
+ flag which can be used for providers known to have their own registration cache.
49
+ - common/ompio: implement pipelined read and write operation.
50
+ This new new code path shows significant performance improvements for reading/writing
51
+ device buffers compared to the previous implementation, and reduces the memory
52
+ footprint of ``OMPIO `` by allocating smaller temporary buffers.
53
+ - 32-bit builds have been disabled. Building Open MPI in a 32-bit environment
54
+ is no longer supported.
55
+ - MPI-4: MPI_Info_get() and MPI_Infi_get_valuelen() are now deprecated.
56
+ - MPI-4: Issue a deprecation warning when MPI_Cancel() is called for a non-blocking send request.
57
+ - Fixed various bugs encountered when running MPI under a debugger.
58
+ - Cleaned up a number of memory leaks.
59
+ - Cleaned up global symbol pollution that was leaking out of libmpi.
60
+ - Removed opal_list_insert(), it was buggy and not used.
61
+ Thanks to Jinyuan Wang for the contribution.
62
+ - Many other bug fixes and cleanups.
63
+ - Various documentation updates.
60
64
61
65
- All other notable updates for v5.0.0:
62
66
@@ -78,6 +82,12 @@ Open MPI version 5.0.0rc9
78
82
this effort.
79
83
- New Thread Local Storage API: Removes global visibility of TLS structures
80
84
and allows for dynamic TLS handling.
85
+ - Added new ``Accelerator `` gpu framework. ``CUDA `` specific code was replaced with
86
+ a generic framework that standardizes various device features such as copies or
87
+ pointer type detection. This allows for modularized implementation of various
88
+ devices such as the newly introduced ROCm Accelerator component. The redesign
89
+ also allows for Open MPI builds to be shipped with ``CUDA `` support enabled
90
+ without requiring ``CUDA `` libraries.
81
91
- Added load-linked, store-conditional atomics support for AArch64.
82
92
- Added atomicity support to the ``ompio `` component.
83
93
- Added support for MPI minimum alignment key to the one-sided ``RDMA `` component.
0 commit comments