Skip to content

Commit 55a7a4a

Browse files
authored
[SYCL][UR][Bench] Run less variants of MemcpyExecute (#17456)
to speed up the benchmarks. Performance difference between v1 and v2 is represented best by the remaining variants.
1 parent 397cf9f commit 55a7a4a

File tree

1 file changed

+1
-7
lines changed
  • unified-runtime/scripts/benchmarks/benches

1 file changed

+1
-7
lines changed

unified-runtime/scripts/benchmarks/benches/compute.py

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -74,14 +74,8 @@ def benchmarks(self) -> list[Benchmark]:
7474
ExecImmediateCopyQueue(self, 1, 1, "Device", "Host", 1024),
7575
VectorSum(self),
7676
MemcpyExecute(self, 400, 1, 102400, 10, 1, 1, 1),
77-
MemcpyExecute(self, 100, 8, 102400, 10, 1, 1, 1),
78-
MemcpyExecute(self, 400, 8, 1024, 1000, 1, 1, 1),
79-
MemcpyExecute(self, 10, 16, 1024, 10000, 1, 1, 1),
77+
MemcpyExecute(self, 400, 8, 1024, 100, 1, 1, 1),
8078
MemcpyExecute(self, 400, 1, 102400, 10, 0, 1, 1),
81-
MemcpyExecute(self, 100, 8, 102400, 10, 0, 1, 1),
82-
MemcpyExecute(self, 400, 8, 1024, 1000, 0, 1, 1),
83-
MemcpyExecute(self, 10, 16, 1024, 10000, 0, 1, 1),
84-
MemcpyExecute(self, 4096, 1, 1024, 10, 0, 1, 0),
8579
MemcpyExecute(self, 4096, 4, 1024, 10, 0, 1, 0),
8680
GraphApiSinKernelGraph(self, RUNTIMES.SYCL, 0, 5),
8781
GraphApiSinKernelGraph(self, RUNTIMES.SYCL, 1, 5),

0 commit comments

Comments
 (0)