-
Notifications
You must be signed in to change notification settings - Fork 244
Restore Enzyme to CI checks #2807
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
wsmoses
wants to merge
1
commit into
master
Choose a base branch
from
wsmoses-patch-1
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CUDA.jl Benchmarks
Benchmark suite | Current: 79bb632 | Previous: 66ab572 | Ratio |
---|---|---|---|
latency/precompile |
43357408181.5 ns |
43493459423 ns |
1.00 |
latency/ttfp |
7071621793 ns |
7080947094 ns |
1.00 |
latency/import |
3452965242 ns |
3436925241 ns |
1.00 |
integration/volumerhs |
9615706 ns |
9607544 ns |
1.00 |
integration/byval/slices=1 |
147368 ns |
147107 ns |
1.00 |
integration/byval/slices=3 |
426211 ns |
425588 ns |
1.00 |
integration/byval/reference |
145372 ns |
145074 ns |
1.00 |
integration/byval/slices=2 |
286792 ns |
286272 ns |
1.00 |
integration/cudadevrt |
103652 ns |
103409 ns |
1.00 |
kernel/indexing |
14554 ns |
14466 ns |
1.01 |
kernel/indexing_checked |
15193 ns |
15124 ns |
1.00 |
kernel/occupancy |
720.7714285714286 ns |
706.0551724137931 ns |
1.02 |
kernel/launch |
2278 ns |
2284.6666666666665 ns |
1.00 |
kernel/rand |
15111 ns |
14928 ns |
1.01 |
array/reverse/1d |
20081 ns |
19689 ns |
1.02 |
array/reverse/2d |
25364 ns |
23897 ns |
1.06 |
array/reverse/1d_inplace |
11478 ns |
10329 ns |
1.11 |
array/reverse/2d_inplace |
13431 ns |
11801 ns |
1.14 |
array/copy |
21587 ns |
21231 ns |
1.02 |
array/iteration/findall/int |
159896 ns |
159142 ns |
1.00 |
array/iteration/findall/bool |
140579 ns |
139322.5 ns |
1.01 |
array/iteration/findfirst/int |
163904 ns |
164128.5 ns |
1.00 |
array/iteration/findfirst/bool |
165138.5 ns |
161608.5 ns |
1.02 |
array/iteration/scalar |
72215 ns |
73479 ns |
0.98 |
array/iteration/logical |
222002 ns |
218722 ns |
1.01 |
array/iteration/findmin/1d |
47346 ns |
46880 ns |
1.01 |
array/iteration/findmin/2d |
97289 ns |
97416 ns |
1.00 |
array/reductions/reduce/1d |
36819 ns |
36474 ns |
1.01 |
array/reductions/reduce/2d |
51578.5 ns |
41172 ns |
1.25 |
array/reductions/mapreduce/1d |
35014 ns |
34723 ns |
1.01 |
array/reductions/mapreduce/2d |
41539.5 ns |
51622 ns |
0.80 |
array/broadcast |
21682 ns |
21035 ns |
1.03 |
array/copyto!/gpu_to_gpu |
12890 ns |
11372 ns |
1.13 |
array/copyto!/cpu_to_gpu |
218876 ns |
219501 ns |
1.00 |
array/copyto!/gpu_to_cpu |
285727 ns |
286844 ns |
1.00 |
array/accumulate/1d |
109468 ns |
109787 ns |
1.00 |
array/accumulate/2d |
81398 ns |
80534 ns |
1.01 |
array/construct |
1278.7 ns |
1267.7 ns |
1.01 |
array/random/randn/Float32 |
49531 ns |
48385.5 ns |
1.02 |
array/random/randn!/Float32 |
25322 ns |
25222 ns |
1.00 |
array/random/rand!/Int64 |
27361 ns |
27316 ns |
1.00 |
array/random/rand!/Float32 |
9039.666666666666 ns |
8940.666666666666 ns |
1.01 |
array/random/rand/Int64 |
30329 ns |
30144 ns |
1.01 |
array/random/rand/Float32 |
13709 ns |
13204 ns |
1.04 |
array/permutedims/4d |
61859.5 ns |
61641 ns |
1.00 |
array/permutedims/2d |
55887 ns |
55255 ns |
1.01 |
array/permutedims/3d |
56544 ns |
56276 ns |
1.00 |
array/sorting/1d |
2778409 ns |
2779172 ns |
1.00 |
array/sorting/by |
3369370 ns |
3370605.5 ns |
1.00 |
array/sorting/2d |
1086976 ns |
1087247 ns |
1.00 |
cuda/synchronization/stream/auto |
1015.2727272727273 ns |
1019.1 ns |
1.00 |
cuda/synchronization/stream/nonblocking |
7293.700000000001 ns |
8133.4 ns |
0.90 |
cuda/synchronization/stream/blocking |
844.0246913580247 ns |
829.5851063829788 ns |
1.02 |
cuda/synchronization/context/auto |
1167.8 ns |
1202 ns |
0.97 |
cuda/synchronization/context/nonblocking |
7141 ns |
7569.299999999999 ns |
0.94 |
cuda/synchronization/context/blocking |
913.1590909090909 ns |
910.7058823529412 ns |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
Enzyme CI fails. |
@vchuravy looks like your fix missed the tape_type function?
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
now that @vchuravy fixed the GPUCompiler compat