Skip to content

Commit e5237a9

Browse files
authored
Update CHANGELOG to describe ROCm 5.7 contribution (#2075) (#2083)
1 parent 9fee723 commit e5237a9

File tree

1 file changed

+42
-6
lines changed

1 file changed

+42
-6
lines changed

CHANGELOG.md

Lines changed: 42 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,49 @@
22

33
Full documentation for MIGraphX is available at [MIGraphX Documentation](https://rocmdocs.amd.com/projects/AMDMIGraphX/en/latest/).
44

5-
## MIGraphX 2.5 for ROCm 5.5.0
5+
## MIGraphX 2.7 for ROCm 5.7.0
6+
### Added
7+
- Enabled hipRTC to not require dev packages for migraphx runtime and allow the ROCm install to be in a different directory than it was during build time
8+
- Add support for multi-target execution
9+
- Added Dynamic Batch support with C++/Python APIs
10+
- Add migraphx.create_argument to python API
11+
- Added dockerfile example for Ubuntu 22.04
12+
- Add TensorFlow supported ops in driver similar to exist onnx operator list
13+
- Add a MIGRAPHX_TRACE_MATCHES_FOR env variable to filter the matcher trace
14+
- Improved debugging by printing max,min,mean and stddev values for TRACE_EVAL = 2
15+
- use fast_math flag instead of ENV flag for GELU
16+
- Print message from driver if offload copy is set for compiled program
17+
### Optimizations
18+
- Optimized for ONNX Runtime 1.14.0
19+
- Improved compile times by only building for the GPU on the system
20+
- Improve performance of pointwise/reduction kernels when using NHWC layouts
21+
- Load specific version of the migraphx_py library
22+
- Annotate functions with the block size so the compiler can do a better job of optimizing
23+
- Enable reshape on nonstandard shapes
24+
- Use half HIP APIs to compute max and min
25+
- Added support for broadcasted scalars to unsqueeze operator
26+
- Improved multiplies with dot operator
27+
- Handle broadcasts across dot and concat
28+
- Add verify namespace for better symbol resolution
29+
### Fixed
30+
- Resolved accuracy issues with FP16 resnet50
31+
- Update cpp generator to handle inf from float
32+
- Fix assertion error during verify and make DCE work with tuples
33+
- Fix convert operation for NaNs
34+
- Fix shape typo in API test
35+
- Fix compile warnings for shadowing variable names
36+
- Add missing specialization for the `nullptr` for the hash function
37+
### Changed
38+
- Bumped version of half library to 5.6.0
39+
- Bumped CI to support rocm 5.6
40+
- Make building tests optional
41+
- replace np.bool with bool as per numpy request
42+
### Removed
43+
- Removed int8x4 rocBlas calls due to deprecation
44+
- removed std::reduce usage since not all OS' support it
45+
646

47+
## MIGraphX 2.5 for ROCm 5.5.0
748
### Added
849
- Y-Model feature to store tuning information with the optimized model
950
- Added Python 3.10 bindings
@@ -12,15 +53,11 @@ Full documentation for MIGraphX is available at [MIGraphX Documentation](https:/
1253
- Build support for ROCm MLIR
1354
- Added migraphx-driver flag to print optimizations in python (--python)
1455
- Added JIT implementation of the Gather and Pad operator which results in better handling of larger tensor sizes.
15-
16-
1756
### Optimizations
1857
- Improved performance of Transformer based models
1958
- Improved performance of the Pad, Concat, Gather, and Pointwise operators
2059
- Improved onnx/pb file loading speed
2160
- Added general optimize pass which runs several passes such as simplify_reshapes/algebra and DCE in loop.
22-
23-
2461
### Fixed
2562
- Improved parsing Tensorflow Protobuf files
2663
- Resolved various accuracy issues with some onnx models
@@ -29,6 +66,5 @@ Full documentation for MIGraphX is available at [MIGraphX Documentation](https:/
2966
- Use --offload-arch instead of --cuda-gpu-arch for the HIP compiler
3067
- Changes inside JIT to use float accumulator for large reduce ops of half type to avoid overflow.
3168
- Changes inside JIT to temporarily use cosine to compute sine function.
32-
3369
### Changed
3470
- Changed version/location of 3rd party build dependencies to pick up fixes

0 commit comments

Comments
 (0)