You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+42-6Lines changed: 42 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -2,8 +2,49 @@
2
2
3
3
Full documentation for MIGraphX is available at [MIGraphX Documentation](https://rocmdocs.amd.com/projects/AMDMIGraphX/en/latest/).
4
4
5
-
## MIGraphX 2.5 for ROCm 5.5.0
5
+
## MIGraphX 2.7 for ROCm 5.7.0
6
+
### Added
7
+
- Enabled hipRTC to not require dev packages for migraphx runtime and allow the ROCm install to be in a different directory than it was during build time
8
+
- Add support for multi-target execution
9
+
- Added Dynamic Batch support with C++/Python APIs
10
+
- Add migraphx.create_argument to python API
11
+
- Added dockerfile example for Ubuntu 22.04
12
+
- Add TensorFlow supported ops in driver similar to exist onnx operator list
13
+
- Add a MIGRAPHX_TRACE_MATCHES_FOR env variable to filter the matcher trace
14
+
- Improved debugging by printing max,min,mean and stddev values for TRACE_EVAL = 2
15
+
- use fast_math flag instead of ENV flag for GELU
16
+
- Print message from driver if offload copy is set for compiled program
17
+
### Optimizations
18
+
- Optimized for ONNX Runtime 1.14.0
19
+
- Improved compile times by only building for the GPU on the system
20
+
- Improve performance of pointwise/reduction kernels when using NHWC layouts
21
+
- Load specific version of the migraphx_py library
22
+
- Annotate functions with the block size so the compiler can do a better job of optimizing
23
+
- Enable reshape on nonstandard shapes
24
+
- Use half HIP APIs to compute max and min
25
+
- Added support for broadcasted scalars to unsqueeze operator
26
+
- Improved multiplies with dot operator
27
+
- Handle broadcasts across dot and concat
28
+
- Add verify namespace for better symbol resolution
29
+
### Fixed
30
+
- Resolved accuracy issues with FP16 resnet50
31
+
- Update cpp generator to handle inf from float
32
+
- Fix assertion error during verify and make DCE work with tuples
33
+
- Fix convert operation for NaNs
34
+
- Fix shape typo in API test
35
+
- Fix compile warnings for shadowing variable names
36
+
- Add missing specialization for the `nullptr` for the hash function
37
+
### Changed
38
+
- Bumped version of half library to 5.6.0
39
+
- Bumped CI to support rocm 5.6
40
+
- Make building tests optional
41
+
- replace np.bool with bool as per numpy request
42
+
### Removed
43
+
- Removed int8x4 rocBlas calls due to deprecation
44
+
- removed std::reduce usage since not all OS' support it
45
+
6
46
47
+
## MIGraphX 2.5 for ROCm 5.5.0
7
48
### Added
8
49
- Y-Model feature to store tuning information with the optimized model
9
50
- Added Python 3.10 bindings
@@ -12,15 +53,11 @@ Full documentation for MIGraphX is available at [MIGraphX Documentation](https:/
12
53
- Build support for ROCm MLIR
13
54
- Added migraphx-driver flag to print optimizations in python (--python)
14
55
- Added JIT implementation of the Gather and Pad operator which results in better handling of larger tensor sizes.
15
-
16
-
17
56
### Optimizations
18
57
- Improved performance of Transformer based models
19
58
- Improved performance of the Pad, Concat, Gather, and Pointwise operators
20
59
- Improved onnx/pb file loading speed
21
60
- Added general optimize pass which runs several passes such as simplify_reshapes/algebra and DCE in loop.
22
-
23
-
24
61
### Fixed
25
62
- Improved parsing Tensorflow Protobuf files
26
63
- Resolved various accuracy issues with some onnx models
@@ -29,6 +66,5 @@ Full documentation for MIGraphX is available at [MIGraphX Documentation](https:/
29
66
- Use --offload-arch instead of --cuda-gpu-arch for the HIP compiler
30
67
- Changes inside JIT to use float accumulator for large reduce ops of half type to avoid overflow.
31
68
- Changes inside JIT to temporarily use cosine to compute sine function.
32
-
33
69
### Changed
34
70
- Changed version/location of 3rd party build dependencies to pick up fixes
0 commit comments