Skip to content

use AirspeedVelocity for benchmark CI #502

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Aug 7, 2024
Merged

use AirspeedVelocity for benchmark CI #502

merged 1 commit into from
Aug 7, 2024

Conversation

vchuravy
Copy link
Member

@vchuravy vchuravy commented Aug 7, 2024

No description provided.

Copy link
Contributor

github-actions bot commented Aug 7, 2024

Benchmark Results

main 19b83c2... main/19b83c272ae7c8...
saxpy/default/Float16/1024 2.79 ± 0.19 μs 2.79 ± 0.19 μs 0.999
saxpy/default/Float16/1048576 2.07 ± 0.012 ms 2.07 ± 0.011 ms 1
saxpy/default/Float16/16384 0.0328 ± 0.00014 ms 0.0328 ± 0.00014 ms 0.999
saxpy/default/Float16/2048 5.21 ± 0.02 μs 5.22 ± 0.022 μs 0.998
saxpy/default/Float16/256 1.7 ± 0.019 μs 1.7 ± 0.017 μs 0.999
saxpy/default/Float16/262144 0.516 ± 0.0013 ms 0.516 ± 0.0014 ms 0.999
saxpy/default/Float16/32768 0.065 ± 0.00017 ms 0.065 ± 0.00016 ms 1
saxpy/default/Float16/4096 10.1 ± 0.05 μs 10.1 ± 0.05 μs 0.997
saxpy/default/Float16/512 2.08 ± 0.041 μs 2.09 ± 0.049 μs 0.995
saxpy/default/Float16/64 1.42 ± 0.0089 μs 1.43 ± 0.009 μs 0.994
saxpy/default/Float16/65536 0.129 ± 0.0003 ms 0.129 ± 0.00026 ms 1
saxpy/default/Float32/1024 1.12 ± 0.13 μs 1.04 ± 0.17 μs 1.08
saxpy/default/Float32/1048576 0.884 ± 0.012 ms 0.885 ± 0.012 ms 0.999
saxpy/default/Float32/16384 14.4 ± 0.11 μs 14.4 ± 0.11 μs 0.998
saxpy/default/Float32/2048 1.73 ± 0.17 μs 1.72 ± 0.17 μs 1.01
saxpy/default/Float32/256 1.23 ± 0.012 μs 1.22 ± 0.013 μs 1.01
saxpy/default/Float32/262144 0.221 ± 0.00065 ms 0.221 ± 0.00066 ms 0.998
saxpy/default/Float32/32768 28.2 ± 0.12 μs 28.2 ± 0.12 μs 1
saxpy/default/Float32/4096 3.04 ± 0.02 μs 3.03 ± 0.02 μs 1
saxpy/default/Float32/512 1.28 ± 0.013 μs 1.27 ± 0.011 μs 1.01
saxpy/default/Float32/64 1.2 ± 0.01 μs 1.19 ± 0.011 μs 1.01
saxpy/default/Float32/65536 0.0558 ± 0.00019 ms 0.0558 ± 0.0002 ms 0.999
saxpy/default/Float64/1024 1.09 ± 0.16 μs 1.07 ± 0.17 μs 1.02
saxpy/default/Float64/1048576 0.999 ± 0.027 ms 0.999 ± 0.029 ms 1
saxpy/default/Float64/16384 16.4 ± 0.47 μs 16.4 ± 0.4 μs 1
saxpy/default/Float64/2048 1.79 ± 0.16 μs 1.78 ± 0.16 μs 1
saxpy/default/Float64/256 1.37 ± 0.011 μs 1.34 ± 0.009 μs 1.02
saxpy/default/Float64/262144 0.244 ± 0.006 ms 0.245 ± 0.0078 ms 0.996
saxpy/default/Float64/32768 0.0319 ± 0.0011 ms 0.0322 ± 0.0011 ms 0.992
saxpy/default/Float64/4096 3.06 ± 0.046 μs 3.05 ± 0.049 μs 1
saxpy/default/Float64/512 1.37 ± 0.013 μs 1.37 ± 0.014 μs 1
saxpy/default/Float64/64 1.35 ± 0.024 μs 1.35 ± 0.018 μs 0.998
saxpy/default/Float64/65536 0.0617 ± 0.0021 ms 0.0618 ± 0.0019 ms 0.998
saxpy/static workgroup=(1024,)/Float16/1024 2.07 ± 0.2 μs 2.09 ± 0.19 μs 0.991
saxpy/static workgroup=(1024,)/Float16/1048576 0.162 ± 0.0084 ms 0.162 ± 0.0058 ms 1
saxpy/static workgroup=(1024,)/Float16/16384 4.31 ± 0.22 μs 4.27 ± 0.22 μs 1.01
saxpy/static workgroup=(1024,)/Float16/2048 2.1 ± 0.2 μs 2.11 ± 0.2 μs 0.997
saxpy/static workgroup=(1024,)/Float16/256 2.65 ± 0.032 μs 2.65 ± 0.031 μs 1
saxpy/static workgroup=(1024,)/Float16/262144 0.0434 ± 0.0019 ms 0.044 ± 0.0025 ms 0.986
saxpy/static workgroup=(1024,)/Float16/32768 6.81 ± 0.3 μs 6.84 ± 0.16 μs 0.996
saxpy/static workgroup=(1024,)/Float16/4096 2.41 ± 0.03 μs 2.42 ± 0.03 μs 0.999
saxpy/static workgroup=(1024,)/Float16/512 3.15 ± 0.1 μs 3.17 ± 0.061 μs 0.995
saxpy/static workgroup=(1024,)/Float16/64 2.26 ± 0.021 μs 2.28 ± 0.021 μs 0.991
saxpy/static workgroup=(1024,)/Float16/65536 12.8 ± 0.46 μs 13 ± 0.45 μs 0.987
saxpy/static workgroup=(1024,)/Float32/1024 1.93 ± 0.021 μs 1.94 ± 0.026 μs 0.993
saxpy/static workgroup=(1024,)/Float32/1048576 0.249 ± 0.012 ms 0.251 ± 0.0096 ms 0.992
saxpy/static workgroup=(1024,)/Float32/16384 4.84 ± 0.41 μs 4.82 ± 0.24 μs 1
saxpy/static workgroup=(1024,)/Float32/2048 2.25 ± 0.22 μs 2.28 ± 0.22 μs 0.989
saxpy/static workgroup=(1024,)/Float32/256 2.7 ± 0.91 μs 2.72 ± 1.6 μs 0.995
saxpy/static workgroup=(1024,)/Float32/262144 0.0626 ± 0.0059 ms 0.0631 ± 0.006 ms 0.991
saxpy/static workgroup=(1024,)/Float32/32768 8.08 ± 1.2 μs 8.42 ± 0.73 μs 0.959
saxpy/static workgroup=(1024,)/Float32/4096 2.42 ± 0.19 μs 2.4 ± 0.19 μs 1.01
saxpy/static workgroup=(1024,)/Float32/512 2.46 ± 0.21 μs 2.45 ± 0.21 μs 1
saxpy/static workgroup=(1024,)/Float32/64 2.43 ± 0.049 μs 2.44 ± 0.048 μs 0.996
saxpy/static workgroup=(1024,)/Float32/65536 17.4 ± 3.5 μs 17.1 ± 2.5 μs 1.02
saxpy/static workgroup=(1024,)/Float64/1024 2.06 ± 0.038 μs 2.02 ± 0.069 μs 1.02
saxpy/static workgroup=(1024,)/Float64/1048576 0.474 ± 0.052 ms 0.455 ± 0.052 ms 1.04
saxpy/static workgroup=(1024,)/Float64/16384 8.32 ± 0.81 μs 8.39 ± 0.66 μs 0.992
saxpy/static workgroup=(1024,)/Float64/2048 2.54 ± 0.26 μs 2.5 ± 0.24 μs 1.02
saxpy/static workgroup=(1024,)/Float64/256 2.4 ± 0.042 μs 2.41 ± 0.048 μs 0.996
saxpy/static workgroup=(1024,)/Float64/262144 0.106 ± 0.011 ms 0.106 ± 0.012 ms 0.996
saxpy/static workgroup=(1024,)/Float64/32768 16.7 ± 3.2 μs 16.6 ± 2.3 μs 1.01
saxpy/static workgroup=(1024,)/Float64/4096 3.13 ± 0.35 μs 2.98 ± 0.3 μs 1.05
saxpy/static workgroup=(1024,)/Float64/512 2.42 ± 0.061 μs 2.39 ± 0.039 μs 1.01
saxpy/static workgroup=(1024,)/Float64/64 2.38 ± 0.075 μs 2.38 ± 0.072 μs 1
saxpy/static workgroup=(1024,)/Float64/65536 31 ± 3.2 μs 0.0318 ± 0.0036 ms 0.974
time_to_load 0.454 ± 0.0015 s 0.456 ± 0.0026 s 0.996

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR.
Go to "Actions"->"Benchmark a pull request"->[the most recent run]->"Artifacts" (at the bottom).

@vchuravy vchuravy merged commit abf0bcf into main Aug 7, 2024
34 of 45 checks passed
@vchuravy vchuravy deleted the vc/benchmark_pr branch August 7, 2024 08:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant