Skip to content

Commit 3599f2d

Browse files
authored
Merge pull request #4421 from ChipKerchner/power10Copies_DGEMM
Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM PowerPC versions.
2 parents 7a6a246 + 5c5c1a1 commit 3599f2d

File tree

3 files changed

+664
-0
lines changed

3 files changed

+664
-0
lines changed

kernel/power/dgemm_tcopy_16_power8.S

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -107,6 +107,9 @@ USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
107107

108108
#define o0 0
109109

110+
#ifdef POWER10
111+
#include "dgemm_tcopy_macros_16_power10.S"
112+
#endif
110113
#include "dgemm_tcopy_macros_16_power8.S"
111114

112115
#define STACKSIZE 144

0 commit comments

Comments
 (0)