Skip to content

Commit 8d3d29e

Browse files
authored
Merge pull request #2049 from Celelibi/fix_crash_sgemm_sse_x64
Fix crash in sgemm SSE/nano kernel on x86_64
2 parents db3dc9e + b7f59da commit 8d3d29e

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

kernel/x86_64/gemm_kernel_4x8_nano.S

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -135,7 +135,7 @@
135135
#endif
136136

137137
movq %rsp, %rbx # save old stack
138-
subq $128 + LOCAL_BUFFER_SIZE, %rsp
138+
subq $256 + LOCAL_BUFFER_SIZE, %rsp
139139
andq $-4096, %rsp # align stack
140140

141141
STACK_TOUCHING

kernel/x86_64/gemm_kernel_8x4_sse.S

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -383,7 +383,7 @@
383383
EMMS
384384

385385
movq %rsp, %rbx # save old stack
386-
subq $128 + LOCAL_BUFFER_SIZE, %rsp
386+
subq $256 + LOCAL_BUFFER_SIZE, %rsp
387387
andq $-4096, %rsp # align stack
388388

389389
STACK_TOUCHING

0 commit comments

Comments
 (0)