Skip to content

Check cuda::memcpy_async preconditions #4700

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
May 28, 2025

Conversation

davebayer
Copy link
Contributor

The preconditions of cuda::memcpy_async are described in the documentation, however, they are not checked in the code.

This PR adds assert statements to cuda::aligned_size_t and cuda::memcpy_async checking the preconditions.

@davebayer davebayer requested a review from a team as a code owner May 14, 2025 17:59
@davebayer davebayer requested a review from griwes May 14, 2025 17:59
@github-project-automation github-project-automation bot moved this to Todo in CCCL May 14, 2025
Copy link

copy-pr-bot bot commented May 14, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL May 14, 2025
Copy link
Contributor

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me would like some other eyes on this

@miscco miscco requested a review from fbusato May 14, 2025 18:21
@github-project-automation github-project-automation bot moved this from In Review to In Progress in CCCL May 14, 2025
@miscco
Copy link
Contributor

miscco commented May 15, 2025

/ok to test 123f07d

Copy link
Contributor

🟨 CI finished in 1h 11m: Pass: 76%/174 | Total: 1d 03h | Avg: 9m 27s | Max: 35m 51s | Hits: 95%/165318
  • 🟨 libcudacxx: Pass: 8%/45 | Total: 4h 44m | Avg: 6m 19s | Max: 33m 02s | Hits: 34%/9291

    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 11m 51s | Avg:  2m 57s | Max:  3m 00s
      🟥 Clang15            Pass:   0%/2   | Total:  6m 19s | Avg:  3m 09s | Max:  3m 12s
      🟥 Clang16            Pass:   0%/2   | Total:  6m 00s | Avg:  3m 00s | Max:  3m 00s
      🟥 Clang17            Pass:   0%/2   | Total:  6m 23s | Avg:  3m 11s | Max:  3m 23s
      🟥 Clang18            Pass:   0%/2   | Total:  6m 09s | Avg:  3m 04s | Max:  3m 06s
      🟥 Clang19            Pass:   0%/6   | Total: 15m 07s | Avg:  2m 31s | Max:  3m 14s
      🟥 GCC7               Pass:   0%/2   | Total:  5m 46s | Avg:  2m 53s | Max:  3m 00s
      🟥 GCC8               Pass:   0%/1   | Total:  2m 45s | Avg:  2m 45s | Max:  2m 45s
      🟥 GCC9               Pass:   0%/2   | Total:  5m 37s | Avg:  2m 48s | Max:  2m 52s
      🟥 GCC10              Pass:   0%/2   | Total:  5m 53s | Avg:  2m 56s | Max:  2m 57s
      🟥 GCC11              Pass:   0%/2   | Total:  5m 57s | Avg:  2m 58s | Max:  2m 59s
      🟥 GCC12              Pass:   0%/2   | Total:  5m 51s | Avg:  2m 55s | Max:  3m 01s
      🟨 GCC13              Pass:  10%/10  | Total:  1h 11m | Avg:  7m 09s | Max: 27m 04s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 40s | Max: 33m 02s | Hits:  34%/6189  
      🟨 MSVC14.42          Pass:  50%/2   | Total: 47m 43s | Avg: 23m 51s | Max: 32m 21s | Hits:  34%/3102  
      🟥 NVHPC25.3          Pass:   0%/2   | Total: 16m 02s | Avg:  8m 01s | Max:  8m 19s
    🟨 jobs
      🟨 Build              Pass:   7%/39  | Total:  3h 48m | Avg:  5m 51s | Max: 33m 02s | Hits:  34%/9291  
      🟥 NVRTC              Pass:   0%/2   | Total: 53m 45s | Avg: 26m 52s | Max: 27m 04s
      🟥 Test               Pass:   0%/3  
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s
    🟨 cpu
      🟨 amd64              Pass:   9%/43  | Total:  4h 38m | Avg:  6m 29s | Max: 33m 02s | Hits:  34%/9291  
      🟥 arm64              Pass:   0%/2   | Total:  5m 19s | Avg:  2m 39s | Max:  2m 41s
    🟨 ctk
      🟨 12.0               Pass:  20%/5   | Total: 43m 43s | Avg:  8m 44s | Max: 32m 19s | Hits:  34%/3089  
      🟨 12.8               Pass:   7%/40  | Total:  4h 00m | Avg:  6m 00s | Max: 33m 02s | Hits:  34%/6202  
    🟨 cudacxx
      🟥 ClangCUDA19        Pass:   0%/2   | Total:  6m 13s | Avg:  3m 06s | Max:  3m 11s
      🟨 nvcc12.0           Pass:  20%/5   | Total: 43m 43s | Avg:  8m 44s | Max: 32m 19s | Hits:  34%/3089  
      🟨 nvcc12.8           Pass:   7%/38  | Total:  3h 54m | Avg:  6m 10s | Max: 33m 02s | Hits:  34%/6202  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  6m 13s | Avg:  3m 06s | Max:  3m 11s
      🟨 nvcc               Pass:   9%/43  | Total:  4h 38m | Avg:  6m 28s | Max: 33m 02s | Hits:  34%/9291  
    🟨 cxx_family
      🟥 Clang              Pass:   0%/18  | Total: 51m 49s | Avg:  2m 52s | Max:  3m 23s
      🟨 GCC                Pass:   4%/21  | Total:  1h 43m | Avg:  4m 55s | Max: 27m 04s
      🟨 MSVC               Pass:  75%/4   | Total:  1h 53m | Avg: 28m 16s | Max: 33m 02s | Hits:  34%/9291  
      🟥 NVHPC              Pass:   0%/2   | Total: 16m 02s | Avg:  8m 01s | Max:  8m 19s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  2m 53s | Avg:  1m 26s | Max:  2m 53s
      🟨 rtx2080            Pass:   9%/43  | Total:  4h 41m | Avg:  6m 32s | Max: 33m 02s | Hits:  34%/9291  
    🟥 sm
      🟥 75                 Pass:   0%/2   | Total: 53m 45s | Avg: 26m 52s | Max: 27m 04s
      🟥 90                 Pass:   0%/2   | Total:  2m 53s | Avg:  1m 26s | Max:  2m 53s
      🟥 90;90a;100         Pass:   0%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
    🟨 std
      🟨 17                 Pass:  13%/22  | Total:  3h 03m | Avg:  8m 20s | Max: 33m 02s | Hits:  34%/9291  
      🟥 20                 Pass:   0%/22  | Total:  1h 38m | Avg:  4m 29s | Max: 27m 04s
    
  • 🟩 cub: Pass: 100%/47 | Total: 10h 31m | Avg: 13m 25s | Max: 34m 59s | Hits: 99%/56985

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 10h 17m | Avg: 13m 42s | Max: 34m 59s | Hits:  99%/54507 
      🟩 arm64              Pass: 100%/2   | Total: 14m 09s | Avg:  7m 04s | Max:  8m 04s | Hits:  99%/2478  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 57m 16s | Avg: 11m 27s | Max: 28m 39s | Hits:  99%/6021  
      🟩 12.8               Pass: 100%/42  | Total:  9h 34m | Avg: 13m 40s | Max: 34m 59s | Hits:  99%/50964 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 48s | Avg:  5m 24s | Max:  5m 31s | Hits: 100%/2134  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 57m 16s | Avg: 11m 27s | Max: 28m 39s | Hits:  99%/6021  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  9h 23m | Avg: 14m 04s | Max: 34m 59s | Hits:  99%/48830 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 48s | Avg:  5m 24s | Max:  5m 31s | Hits: 100%/2134  
      🟩 nvcc               Pass: 100%/45  | Total: 10h 20m | Avg: 13m 47s | Max: 34m 59s | Hits:  99%/54851 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 27m 12s | Avg:  6m 48s | Max:  7m 53s | Hits: 100%/4964  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 50s | Avg:  6m 55s | Max:  7m 07s | Hits: 100%/2478  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 50s | Avg:  6m 55s | Max:  6m 57s | Hits: 100%/2478  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 21s | Avg:  7m 10s | Max:  7m 23s | Hits: 100%/2478  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 26s | Avg:  6m 43s | Max:  6m 47s | Hits: 100%/2478  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 21m | Avg: 11m 35s | Max: 29m 04s | Hits: 100%/8329  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 08s | Avg:  8m 04s | Max:  8m 23s | Hits:  99%/2482  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 39s | Avg:  8m 39s | Max:  8m 39s | Hits:  99%/1241  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 52s | Avg:  8m 56s | Max:  9m 27s | Hits:  99%/2482  
      🟩 GCC10              Pass: 100%/2   | Total: 17m 20s | Avg:  8m 40s | Max:  8m 51s | Hits:  99%/2482  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max:  8m 47s | Hits:  99%/2478  
      🟩 GCC12              Pass: 100%/2   | Total: 18m 55s | Avg:  9m 27s | Max:  9m 34s | Hits:  99%/2478  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 38m | Avg: 19m 50s | Max: 34m 59s | Hits:  99%/13629 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 13s | Avg: 29m 06s | Max: 29m 34s | Hits:  99%/2114  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 57s | Max: 33m 43s | Hits:  99%/2114  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 29m 02s | Avg: 14m 31s | Max: 14m 36s | Hits:  98%/2280  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 43m | Avg:  8m 37s | Max: 29m 04s | Hits: 100%/23205 
      🟩 GCC                Pass: 100%/22  | Total:  5h 14m | Avg: 14m 17s | Max: 34m 59s | Hits:  99%/27272 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 04m | Avg: 31m 01s | Max: 33m 43s | Hits:  99%/4228  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 02s | Avg: 14m 31s | Max: 14m 36s | Hits:  98%/2280  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 57m 28s | Avg: 19m 09s | Max: 27m 37s | Hits:  99%/3717  
      🟩 rtx2080            Pass: 100%/36  | Total:  6h 23m | Avg: 10m 38s | Max: 33m 43s | Hits:  99%/43356 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 10m | Avg: 23m 51s | Max: 34m 59s | Hits:  99%/9912  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 45m | Avg: 10m 23s | Max: 33m 43s | Hits:  99%/47073 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 30m 25s | Avg: 30m 25s | Max: 30m 25s | Hits:  99%/1239  
      🟩 GraphCapture       Pass: 100%/1   | Total: 27m 56s | Avg: 27m 56s | Max: 27m 56s | Hits:  99%/1239  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 31m | Avg: 30m 33s | Max: 34m 59s | Hits:  99%/3717  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 16m | Avg: 25m 24s | Max: 31m 09s | Hits:  99%/3717  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 57m 28s | Avg: 19m 09s | Max: 27m 37s | Hits:  99%/3717  
      🟩 90;90a;100         Pass: 100%/1   | Total:  9m 53s | Avg:  9m 53s | Max:  9m 53s | Hits:  99%/1239  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 58m | Avg: 11m 21s | Max: 32m 11s | Hits:  99%/25218 
      🟩 20                 Pass: 100%/26  | Total:  6h 32m | Avg: 15m 06s | Max: 34m 59s | Hits:  99%/31767 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 8h 26m | Avg: 10m 46s | Max: 35m 51s | Hits: 99%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 33s | Avg: 10m 16s | Max: 13m 16s | Hits:  99%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  8h 14m | Avg: 10m 58s | Max: 35m 51s | Hits:  99%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 44s | Hits:  99%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 53m 02s | Avg: 10m 36s | Max: 29m 42s | Hits:  99%/8941  
      🟩 12.8               Pass: 100%/42  | Total:  7h 33m | Avg: 10m 47s | Max: 35m 51s | Hits:  99%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 42s | Hits: 100%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 53m 02s | Avg: 10m 36s | Max: 29m 42s | Hits:  99%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  7h 21m | Avg: 11m 02s | Max: 35m 51s | Hits:  99%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 42s | Hits: 100%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  8h 14m | Avg: 10m 59s | Max: 35m 51s | Hits:  99%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 38s | Avg:  5m 39s | Max:  6m 36s | Hits: 100%/7156  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 32s | Avg:  6m 16s | Max:  6m 25s | Hits: 100%/3578  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 23s | Hits: 100%/3578  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 49s | Avg:  5m 54s | Max:  6m 00s | Hits: 100%/3578  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  5m 53s | Hits: 100%/3578  
      🟩 Clang19            Pass: 100%/7   | Total: 47m 23s | Avg:  6m 46s | Max: 10m 46s | Hits: 100%/12523 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 15s | Avg:  6m 37s | Max:  6m 40s | Hits:  99%/3580  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 46s | Avg:  6m 46s | Max:  6m 46s | Hits:  99%/1790  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 02s | Avg:  7m 01s | Max:  7m 36s | Hits:  99%/3580  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 07s | Avg:  7m 03s | Max:  7m 04s | Hits:  99%/3580  
      🟩 GCC11              Pass: 100%/2   | Total: 14m 43s | Avg:  7m 21s | Max:  7m 38s | Hits:  99%/3580  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 26s | Avg:  7m 13s | Max:  7m 13s | Hits:  99%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 30m | Avg:  9m 05s | Max: 13m 30s | Hits:  99%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 55s | Avg: 29m 27s | Max: 29m 42s | Hits:  99%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 38m | Avg: 32m 56s | Max: 35m 51s | Hits:  99%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 00s | Max: 31m 36s | Hits:  99%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 58m | Avg:  6m 13s | Max: 10m 46s | Hits: 100%/33991 
      🟩 GCC                Pass: 100%/21  | Total:  2h 48m | Avg:  8m 00s | Max: 13m 30s | Hits:  99%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 37m | Avg: 31m 33s | Max: 35m 51s | Hits:  99%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 00s | Max: 31m 36s | Hits:  99%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max: 11m 58s | Hits:  99%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total:  5h 43m | Avg:  9m 49s | Max: 31m 36s | Hits:  99%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 24m | Avg: 14m 26s | Max: 35m 51s | Hits:  99%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  6h 43m | Avg: 10m 04s | Max: 32m 21s | Hits:  99%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 28s | Avg: 17m 49s | Max: 35m 51s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total: 49m 30s | Avg: 12m 22s | Max: 13m 30s | Hits:  99%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max: 11m 58s | Hits:  99%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 30s | Avg:  7m 30s | Max:  7m 30s | Hits:  99%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 52m | Avg: 11m 04s | Max: 31m 36s | Hits:  99%/37560 
      🟩 20                 Pass: 100%/24  | Total:  4h 12m | Avg: 10m 32s | Max: 35m 51s | Hits:  99%/42934 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 29m | Avg: 5m 44s | Max: 15m 20s | Hits: 97%/14642

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 16m | Avg:  6m 11s | Max: 15m 20s | Hits:  97%/12298 
      🟩 arm64              Pass: 100%/4   | Total: 13m 07s | Avg:  3m 16s | Max:  3m 36s | Hits:  99%/2344  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 21m 54s | Avg:  7m 18s | Max: 14m 59s | Hits:  94%/1463  
      🟩 12.8               Pass: 100%/23  | Total:  2h 07m | Avg:  5m 32s | Max: 15m 20s | Hits:  98%/13179 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 21m 54s | Avg:  7m 18s | Max: 14m 59s | Hits:  94%/1463  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 07m | Avg:  5m 32s | Max: 15m 20s | Hits:  98%/13179 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 29m | Avg:  5m 44s | Max: 15m 20s | Hits:  97%/14642 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  3m 50s | Hits: 100%/1176  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s | Hits: 100%/586   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s | Hits: 100%/586   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s | Hits: 100%/586   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits: 100%/586   
      🟩 Clang19            Pass: 100%/4   | Total: 20m 23s | Avg:  5m 05s | Max: 10m 40s | Hits: 100%/2344  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 53s | Hits:  99%/1176  
      🟩 GCC11              Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s | Hits:  99%/586   
      🟩 GCC12              Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s | Hits:  99%/586   
      🟩 GCC13              Pass: 100%/8   | Total: 38m 34s | Avg:  4m 49s | Max:  9m 08s | Hits:  99%/4688  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s | Hits:  74%/287   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 15m 20s | Avg: 15m 20s | Max: 15m 20s | Hits:  74%/287   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 22m 19s | Avg: 11m 09s | Max: 11m 13s | Hits:  86%/1168  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 42m 07s | Avg:  4m 12s | Max: 10m 40s | Hits: 100%/5864  
      🟩 GCC                Pass: 100%/12  | Total: 54m 36s | Avg:  4m 33s | Max:  9m 08s | Hits:  99%/7036  
      🟩 MSVC               Pass: 100%/2   | Total: 30m 19s | Avg: 15m 09s | Max: 15m 20s | Hits:  74%/574   
      🟩 NVHPC              Pass: 100%/2   | Total: 22m 19s | Avg: 11m 09s | Max: 11m 13s | Hits:  86%/1168  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  7m 59s | Hits:  99%/1172  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 18m | Avg:  5m 45s | Max: 15m 20s | Hits:  97%/13470 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 01m | Avg:  5m 17s | Max: 15m 20s | Hits:  97%/12884 
      🟩 Test               Pass: 100%/3   | Total: 27m 47s | Avg:  9m 15s | Max: 10m 40s | Hits:  99%/1758  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 14m 30s | Avg:  4m 50s | Max:  7m 59s | Hits:  99%/1758  
      🟩 90a                Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s | Hits:  99%/586   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max: 11m 06s | Hits:  96%/2342  
      🟩 20                 Pass: 100%/22  | Total:  2h 08m | Avg:  5m 50s | Max: 15m 20s | Hits:  98%/12300 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 21m 10s | Avg: 5m 17s | Max: 6m 31s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 13s | Avg:  6m 06s | Max:  6m 31s
      🟩 arm64              Pass: 100%/2   | Total:  8m 57s | Avg:  4m 28s | Max:  4m 31s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 31s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 42s
      🟩 20                 Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  6m 31s
    
  • 🟩 python: Pass: 100%/3 | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 37m 40s | Avg: 12m 33s | Max: 23m 56s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  6m 53s | Avg:  6m 53s | Max:  6m 53s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 23m 56s | Avg: 23m 56s | Max: 23m 56s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 35s | Avg: 8m 17s | Max: 14m 18s | Hits: 98%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max: 14m 18s | Hits:  98%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s | Hits:  98%/163   
      🟩 Test               Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 174)

# Runner
123 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
10 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Contributor

miscco commented May 20, 2025

/ok to test d7bbc10

Copy link
Contributor

🟨 CI finished in 2h 45m: Pass: 81%/174 | Total: 1d 05h | Avg: 10m 08s | Max: 35m 08s | Hits: 97%/187940
  • 🟨 libcudacxx: Pass: 28%/45 | Total: 6h 46m | Avg: 9m 01s | Max: 30m 58s | Hits: 83%/31913

    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total: 46m 08s | Avg:  9m 13s | Max: 28m 58s | Hits:  98%/16011 
      🔍 12.8               Pass:  20%/40  | Total:  6h 00m | Avg:  9m 00s | Max: 30m 58s | Hits:  68%/15902 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 48m 33s | Avg: 24m 16s | Max: 25m 35s | Hits:  26%/6515  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 46m 08s | Avg:  9m 13s | Max: 28m 58s | Hits:  98%/16011 
      🔍 nvcc12.8           Pass:  15%/38  | Total:  5h 11m | Avg:  8m 11s | Max: 30m 58s | Hits:  98%/9387  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 33s | Avg: 24m 16s | Max: 25m 35s | Hits:  26%/6515  
      🔍 nvcc               Pass:  25%/43  | Total:  5h 57m | Avg:  8m 18s | Max: 30m 58s | Hits:  98%/25398 
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total: 18m 45s | Avg:  4m 41s | Max:  5m 09s | Hits:  98%/6482  
      🟥 Clang15            Pass:   0%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  4m 53s
      🟥 Clang16            Pass:   0%/2   | Total: 10m 12s | Avg:  5m 06s | Max:  5m 07s
      🟥 Clang17            Pass:   0%/2   | Total: 15m 41s | Avg:  7m 50s | Max: 10m 30s
      🟥 Clang18            Pass:   0%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  4m 56s
      🟨 Clang19            Pass:  33%/6   | Total:  1h 03m | Avg: 10m 32s | Max: 25m 35s | Hits:  26%/6515  
      🟨 GCC7               Pass:  50%/2   | Total:  9m 11s | Avg:  4m 35s | Max:  4m 58s | Hits:  98%/3217  
      🟥 GCC8               Pass:   0%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s
      🟨 GCC9               Pass:  50%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 41s | Hits:  98%/3223  
      🟥 GCC10              Pass:   0%/2   | Total:  9m 28s | Avg:  4m 44s | Max:  4m 49s
      🟥 GCC11              Pass:   0%/2   | Total:  9m 16s | Avg:  4m 38s | Max:  4m 38s
      🟥 GCC12              Pass:   0%/2   | Total: 10m 25s | Avg:  5m 12s | Max:  5m 17s
      🟨 GCC13              Pass:  30%/10  | Total:  1h 23m | Avg:  8m 20s | Max: 22m 28s | Hits:  90%/40    
      🟩 MSVC14.29          Pass: 100%/2   | Total: 59m 09s | Avg: 29m 34s | Max: 30m 11s | Hits:  98%/6189  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 38s | Max: 30m 58s | Hits:  98%/6247  
      🟥 NVHPC25.3          Pass:   0%/2   | Total: 23m 06s | Avg: 11m 33s | Max: 11m 53s
    🟨 cxx_family
      🟨 Clang              Pass:  22%/18  | Total:  2h 07m | Avg:  7m 04s | Max: 25m 35s | Hits:  62%/12997 
      🟨 GCC                Pass:  23%/21  | Total:  2h 15m | Avg:  6m 26s | Max: 22m 28s | Hits:  98%/6480  
      🟩 MSVC               Pass: 100%/4   | Total:  2h 00m | Avg: 30m 06s | Max: 30m 58s | Hits:  98%/12436 
      🟥 NVHPC              Pass:   0%/2   | Total: 23m 06s | Avg: 11m 33s | Max: 11m 53s
    🟨 jobs
      🟨 Build              Pass:  25%/39  | Total:  5h 59m | Avg:  9m 12s | Max: 30m 58s | Hits:  83%/31873 
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 41s | Avg: 22m 20s | Max: 22m 28s | Hits:  90%/40    
      🟥 Test               Pass:   0%/3  
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
    🟨 sm
      🟩 75                 Pass: 100%/2   | Total: 44m 41s | Avg: 22m 20s | Max: 22m 28s | Hits:  90%/40    
      🟥 90                 Pass:   0%/2   | Total:  7m 14s | Avg:  3m 37s | Max:  7m 14s
      🟥 90;90a;100         Pass:   0%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
    🟨 cpu
      🟨 amd64              Pass:  30%/43  | Total:  6h 36m | Avg:  9m 13s | Max: 30m 58s | Hits:  83%/31913 
      🟥 arm64              Pass:   0%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 38s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  7m 14s | Avg:  3m 37s | Max:  7m 14s
      🟨 rtx2080            Pass:  30%/43  | Total:  6h 38m | Avg:  9m 16s | Max: 30m 58s | Hits:  83%/31913 
    🟨 std
      🟨 17                 Pass:  36%/22  | Total:  3h 42m | Avg: 10m 07s | Max: 30m 58s | Hits:  88%/22209 
      🟨 20                 Pass:  18%/22  | Total:  3h 01m | Avg:  8m 13s | Max: 30m 19s | Hits:  74%/9704  
    
  • 🟩 cub: Pass: 100%/47 | Total: 10h 30m | Avg: 13m 24s | Max: 33m 39s | Hits: 99%/56985

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 10h 16m | Avg: 13m 41s | Max: 33m 39s | Hits:  99%/54507 
      🟩 arm64              Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max:  8m 03s | Hits:  99%/2478  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 57m 11s | Avg: 11m 26s | Max: 27m 55s | Hits:  99%/6021  
      🟩 12.8               Pass: 100%/42  | Total:  9h 32m | Avg: 13m 38s | Max: 33m 39s | Hits:  99%/50964 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 46s | Avg:  5m 53s | Max:  6m 30s | Hits: 100%/2134  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 57m 11s | Avg: 11m 26s | Max: 27m 55s | Hits:  99%/6021  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  9h 21m | Avg: 14m 01s | Max: 33m 39s | Hits:  99%/48830 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 46s | Avg:  5m 53s | Max:  6m 30s | Hits: 100%/2134  
      🟩 nvcc               Pass: 100%/45  | Total: 10h 18m | Avg: 13m 44s | Max: 33m 39s | Hits:  99%/54851 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 26m 44s | Avg:  6m 41s | Max:  7m 14s | Hits: 100%/4964  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 30s | Avg:  6m 45s | Max:  6m 47s | Hits: 100%/2478  
      🟩 Clang16            Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max:  7m 30s | Hits: 100%/2478  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 04s | Avg:  7m 02s | Max:  7m 27s | Hits: 100%/2478  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 35s | Avg:  6m 47s | Max:  6m 57s | Hits: 100%/2478  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 24m | Avg: 12m 05s | Max: 26m 13s | Hits: 100%/8329  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 23s | Avg:  8m 11s | Max:  8m 34s | Hits:  99%/2482  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 39s | Avg:  8m 39s | Max:  8m 39s | Hits:  99%/1241  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 04s | Avg:  8m 32s | Max:  8m 38s | Hits:  99%/2482  
      🟩 GCC10              Pass: 100%/2   | Total: 17m 22s | Avg:  8m 41s | Max:  8m 47s | Hits:  99%/2482  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 35s | Avg:  8m 47s | Max:  8m 48s | Hits:  99%/2478  
      🟩 GCC12              Pass: 100%/2   | Total: 17m 28s | Avg:  8m 44s | Max:  8m 50s | Hits:  99%/2478  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 41m | Avg: 20m 07s | Max: 33m 39s | Hits:  99%/13629 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 56m 52s | Avg: 28m 26s | Max: 28m 57s | Hits:  99%/2114  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 12s | Max: 31m 26s | Hits:  99%/2114  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 27m 33s | Avg: 13m 46s | Max: 13m 52s | Hits:  98%/2280  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 47m | Avg:  8m 48s | Max: 26m 13s | Hits: 100%/23205 
      🟩 GCC                Pass: 100%/22  | Total:  5h 15m | Avg: 14m 21s | Max: 33m 39s | Hits:  99%/27272 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 59m | Avg: 29m 49s | Max: 31m 26s | Hits:  99%/4228  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 33s | Avg: 13m 46s | Max: 13m 52s | Hits:  98%/2280  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 58m 17s | Avg: 19m 25s | Max: 27m 54s | Hits:  99%/3717  
      🟩 rtx2080            Pass: 100%/36  | Total:  6h 16m | Avg: 10m 27s | Max: 31m 26s | Hits:  99%/43356 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 15m | Avg: 24m 27s | Max: 33m 39s | Hits:  99%/9912  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 38m | Avg: 10m 13s | Max: 31m 26s | Hits:  99%/47073 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 33m 36s | Avg: 33m 36s | Max: 33m 36s | Hits:  99%/1239  
      🟩 GraphCapture       Pass: 100%/1   | Total: 33m 39s | Avg: 33m 39s | Max: 33m 39s | Hits:  99%/1239  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 24m | Avg: 28m 15s | Max: 30m 39s | Hits:  99%/3717  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 19m | Avg: 26m 30s | Max: 29m 14s | Hits:  99%/3717  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 58m 17s | Avg: 19m 25s | Max: 27m 54s | Hits:  99%/3717  
      🟩 90;90a;100         Pass: 100%/1   | Total:  9m 13s | Avg:  9m 13s | Max:  9m 13s | Hits:  99%/1239  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 55m | Avg: 11m 11s | Max: 30m 59s | Hits:  99%/25218 
      🟩 20                 Pass: 100%/26  | Total:  6h 35m | Avg: 15m 11s | Max: 33m 39s | Hits:  99%/31767 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 8h 30m | Avg: 10m 51s | Max: 35m 08s | Hits: 99%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 21s | Avg: 10m 10s | Max: 12m 50s | Hits:  99%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  8h 18m | Avg: 11m 04s | Max: 35m 08s | Hits:  99%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 11m 51s | Avg:  5m 55s | Max:  6m 44s | Hits:  99%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 53m 48s | Avg: 10m 45s | Max: 29m 27s | Hits:  99%/8941  
      🟩 12.8               Pass: 100%/42  | Total:  7h 36m | Avg: 10m 52s | Max: 35m 08s | Hits:  99%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  5m 35s | Hits: 100%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 53m 48s | Avg: 10m 45s | Max: 29m 27s | Hits:  99%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  7h 25m | Avg: 11m 08s | Max: 35m 08s | Hits:  99%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  5m 35s | Hits: 100%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  8h 19m | Avg: 11m 05s | Max: 35m 08s | Hits:  99%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 17s | Avg:  5m 49s | Max:  5m 59s | Hits: 100%/7156  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 28s | Avg:  6m 14s | Max:  6m 19s | Hits: 100%/3578  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 16s | Hits: 100%/3578  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 28s | Avg:  5m 44s | Max:  5m 47s | Hits: 100%/3578  
      🟩 Clang18            Pass: 100%/2   | Total: 12m 16s | Avg:  6m 08s | Max:  6m 12s | Hits: 100%/3578  
      🟩 Clang19            Pass: 100%/7   | Total: 47m 24s | Avg:  6m 46s | Max: 10m 46s | Hits: 100%/12523 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 09s | Avg:  6m 34s | Max:  6m 41s | Hits:  99%/3580  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 37s | Avg:  6m 37s | Max:  6m 37s | Hits:  99%/1790  
      🟩 GCC9               Pass: 100%/2   | Total: 13m 34s | Avg:  6m 47s | Max:  7m 05s | Hits:  99%/3580  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 18s | Avg:  7m 09s | Max:  7m 17s | Hits:  99%/3580  
      🟩 GCC11              Pass: 100%/2   | Total: 15m 27s | Avg:  7m 43s | Max:  7m 45s | Hits:  99%/3580  
      🟩 GCC12              Pass: 100%/2   | Total: 15m 11s | Avg:  7m 35s | Max:  7m 57s | Hits:  99%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 32m | Avg:  9m 17s | Max: 13m 22s | Hits:  99%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 43s | Max: 31m 59s | Hits:  99%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 39m | Avg: 33m 12s | Max: 35m 08s | Hits:  99%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 58m 53s | Avg: 29m 26s | Max: 32m 12s | Hits:  99%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 59m | Avg:  6m 16s | Max: 10m 46s | Hits: 100%/33991 
      🟩 GCC                Pass: 100%/21  | Total:  2h 51m | Avg:  8m 09s | Max: 13m 22s | Hits:  99%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 41m | Avg: 32m 12s | Max: 35m 08s | Hits:  99%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total: 58m 53s | Avg: 29m 26s | Max: 32m 12s | Hits:  99%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 09s | Avg:  9m 04s | Max: 12m 02s | Hits:  99%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total:  5h 47m | Avg:  9m 55s | Max: 32m 12s | Hits:  99%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 24m | Avg: 14m 29s | Max: 35m 08s | Hits:  99%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  6h 47m | Avg: 10m 11s | Max: 32m 35s | Hits:  99%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 33s | Avg: 17m 51s | Max: 35m 08s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total: 49m 00s | Avg: 12m 15s | Max: 13m 22s | Hits:  99%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 18m 09s | Avg:  9m 04s | Max: 12m 02s | Hits:  99%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 10s | Avg:  8m 10s | Max:  8m 10s | Hits:  99%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 56m | Avg: 11m 17s | Max: 32m 12s | Hits:  99%/37560 
      🟩 20                 Pass: 100%/24  | Total:  4h 12m | Avg: 10m 32s | Max: 35m 08s | Hits:  99%/42934 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 28m | Avg: 5m 42s | Max: 13m 43s | Hits: 99%/14642

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 11m | Avg:  5m 59s | Max: 13m 43s | Hits:  99%/12298 
      🟩 arm64              Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 28s | Hits:  99%/2344  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 20m 26s | Avg:  6m 48s | Max: 12m 56s | Hits:  98%/1463  
      🟩 12.8               Pass: 100%/23  | Total:  2h 08m | Avg:  5m 34s | Max: 13m 43s | Hits:  99%/13179 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 20m 26s | Avg:  6m 48s | Max: 12m 56s | Hits:  98%/1463  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 08m | Avg:  5m 34s | Max: 13m 43s | Hits:  99%/13179 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 28m | Avg:  5m 42s | Max: 13m 43s | Hits:  99%/14642 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  7m 31s | Avg:  3m 45s | Max:  4m 26s | Hits: 100%/1176  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 25s | Avg:  3m 25s | Max:  3m 25s | Hits: 100%/586   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 40s | Avg:  3m 40s | Max:  3m 40s | Hits: 100%/586   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s | Hits: 100%/586   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits: 100%/586   
      🟩 Clang19            Pass: 100%/4   | Total: 26m 03s | Avg:  6m 30s | Max: 13m 43s | Hits: 100%/2344  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 25s | Hits:  99%/1176  
      🟩 GCC11              Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s | Hits:  99%/586   
      🟩 GCC12              Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s | Hits:  99%/586   
      🟩 GCC13              Pass: 100%/8   | Total: 40m 47s | Avg:  5m 05s | Max: 10m 50s | Hits:  99%/4688  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 56s | Avg: 12m 56s | Max: 12m 56s | Hits:  95%/287   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 13m 16s | Avg: 13m 16s | Max: 13m 16s | Hits:  95%/287   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max:  8m 48s | Hits:  97%/1168  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 48m 31s | Avg:  4m 51s | Max: 13m 43s | Hits: 100%/5864  
      🟩 GCC                Pass: 100%/12  | Total: 56m 52s | Avg:  4m 44s | Max: 10m 50s | Hits:  99%/7036  
      🟩 MSVC               Pass: 100%/2   | Total: 26m 12s | Avg: 13m 06s | Max: 13m 16s | Hits:  95%/574   
      🟩 NVHPC              Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max:  8m 48s | Hits:  97%/1168  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  7m 49s | Hits:  99%/1172  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 17m | Avg:  5m 43s | Max: 13m 43s | Hits:  99%/13470 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  1h 56m | Avg:  5m 02s | Max: 13m 16s | Hits:  99%/12884 
      🟩 Test               Pass: 100%/3   | Total: 32m 22s | Avg: 10m 47s | Max: 13m 43s | Hits:  99%/1758  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  7m 49s | Hits:  99%/1758  
      🟩 90a                Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s | Hits:  99%/586   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 20m 02s | Avg:  5m 00s | Max:  8m 06s | Hits:  99%/2342  
      🟩 20                 Pass: 100%/22  | Total:  2h 08m | Avg:  5m 50s | Max: 13m 43s | Hits:  99%/12300 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 19m 26s | Avg: 4m 51s | Max: 5m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  5m 43s
      🟩 arm64              Pass: 100%/2   | Total:  8m 21s | Avg:  4m 10s | Max:  4m 23s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 43s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  5m 43s
      🟩 20                 Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  5m 22s
    
  • 🟩 python: Pass: 100%/3 | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 30m 54s | Avg: 10m 18s | Max: 15m 38s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  6m 38s | Avg:  6m 38s | Max:  6m 38s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 15m 38s | Avg: 15m 38s | Max: 15m 38s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  8m 38s | Avg:  8m 38s | Max:  8m 38s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits: 98%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 16m 03s | Hits:  98%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s | Hits:  98%/163   
      🟩 Test               Pass: 100%/1   | Total: 16m 03s | Avg: 16m 03s | Max: 16m 03s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 174)

# Runner
123 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
10 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Contributor

miscco commented May 23, 2025

/ok to test 284945f

Copy link
Contributor

🟩 CI finished in 1h 15m: Pass: 100%/174 | Total: 1d 06h | Avg: 10m 31s | Max: 36m 23s | Hits: 97%/282514
  • 🟩 cub: Pass: 100%/47 | Total: 10h 16m | Avg: 13m 06s | Max: 33m 36s | Hits: 99%/56985

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 10h 02m | Avg: 13m 23s | Max: 33m 36s | Hits:  99%/54507 
      🟩 arm64              Pass: 100%/2   | Total: 14m 08s | Avg:  7m 04s | Max:  8m 02s | Hits:  99%/2478  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 56m 04s | Avg: 11m 12s | Max: 27m 46s | Hits:  99%/6021  
      🟩 12.8               Pass: 100%/42  | Total:  9h 20m | Avg: 13m 20s | Max: 33m 36s | Hits:  99%/50964 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 20s | Hits: 100%/2134  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 56m 04s | Avg: 11m 12s | Max: 27m 46s | Hits:  99%/6021  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  9h 09m | Avg: 13m 44s | Max: 33m 36s | Hits:  99%/48830 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 20s | Hits: 100%/2134  
      🟩 nvcc               Pass: 100%/45  | Total: 10h 05m | Avg: 13m 27s | Max: 33m 36s | Hits:  99%/54851 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 26m 33s | Avg:  6m 38s | Max:  7m 19s | Hits: 100%/4964  
      🟩 Clang15            Pass: 100%/2   | Total: 14m 08s | Avg:  7m 04s | Max:  7m 16s | Hits: 100%/2478  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 32s | Avg:  6m 46s | Max:  6m 47s | Hits: 100%/2478  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 12s | Avg:  7m 06s | Max:  7m 14s | Hits: 100%/2478  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 36s | Hits: 100%/2478  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 22m | Avg: 11m 47s | Max: 26m 08s | Hits: 100%/8329  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max:  8m 40s | Hits:  99%/2482  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 00s | Avg:  8m 00s | Max:  8m 00s | Hits:  99%/1241  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 06s | Avg:  8m 33s | Max:  9m 04s | Hits:  99%/2482  
      🟩 GCC10              Pass: 100%/2   | Total: 17m 44s | Avg:  8m 52s | Max:  9m 05s | Hits:  99%/2482  
      🟩 GCC11              Pass: 100%/2   | Total: 16m 55s | Avg:  8m 27s | Max:  8m 28s | Hits:  99%/2478  
      🟩 GCC12              Pass: 100%/2   | Total: 17m 32s | Avg:  8m 46s | Max:  8m 48s | Hits:  99%/2478  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 31m | Avg: 19m 15s | Max: 31m 46s | Hits:  99%/13629 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 56m 32s | Avg: 28m 16s | Max: 28m 46s | Hits:  99%/2114  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 44s | Max: 33m 36s | Hits:  99%/2114  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 26m 53s | Avg: 13m 26s | Max: 13m 33s | Hits:  98%/2280  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 43m | Avg:  8m 37s | Max: 26m 08s | Hits: 100%/23205 
      🟩 GCC                Pass: 100%/22  | Total:  5h 05m | Avg: 13m 53s | Max: 31m 46s | Hits:  99%/27272 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 00m | Avg: 30m 00s | Max: 33m 36s | Hits:  99%/4228  
      🟩 NVHPC              Pass: 100%/2   | Total: 26m 53s | Avg: 13m 26s | Max: 13m 33s | Hits:  98%/2280  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 58m 38s | Avg: 19m 32s | Max: 26m 53s | Hits:  99%/3717  
      🟩 rtx2080            Pass: 100%/36  | Total:  6h 11m | Avg: 10m 19s | Max: 33m 36s | Hits:  99%/43356 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 06m | Avg: 23m 17s | Max: 31m 46s | Hits:  99%/9912  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 33m | Avg: 10m 05s | Max: 33m 36s | Hits:  99%/47073 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 31m 03s | Avg: 31m 03s | Max: 31m 03s | Hits:  99%/1239  
      🟩 GraphCapture       Pass: 100%/1   | Total: 27m 18s | Avg: 27m 18s | Max: 27m 18s | Hits:  99%/1239  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 24m | Avg: 28m 15s | Max: 31m 46s | Hits:  99%/3717  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 19m | Avg: 26m 36s | Max: 28m 19s | Hits:  99%/3717  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 58m 38s | Avg: 19m 32s | Max: 26m 53s | Hits:  99%/3717  
      🟩 90;90a;100         Pass: 100%/1   | Total:  9m 06s | Avg:  9m 06s | Max:  9m 06s | Hits:  99%/1239  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 48m | Avg: 10m 54s | Max: 29m 52s | Hits:  99%/25218 
      🟩 20                 Pass: 100%/26  | Total:  6h 27m | Avg: 14m 54s | Max: 33m 36s | Hits:  99%/31767 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 8h 22m | Avg: 10m 42s | Max: 36m 23s | Hits: 99%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 45s | Avg: 10m 22s | Max: 13m 22s | Hits:  99%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  8h 11m | Avg: 10m 54s | Max: 36m 23s | Hits:  99%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 11m 44s | Avg:  5m 52s | Max:  6m 35s | Hits:  99%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 49m 39s | Avg:  9m 55s | Max: 25m 34s | Hits:  99%/8941  
      🟩 12.8               Pass: 100%/42  | Total:  7h 33m | Avg: 10m 47s | Max: 36m 23s | Hits:  99%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  5m 29s | Hits: 100%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 49m 39s | Avg:  9m 55s | Max: 25m 34s | Hits:  99%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  7h 22m | Avg: 11m 03s | Max: 36m 23s | Hits:  99%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  5m 29s | Hits: 100%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  8h 11m | Avg: 10m 55s | Max: 36m 23s | Hits:  99%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 44s | Avg:  5m 41s | Max:  6m 07s | Hits: 100%/7156  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 39s | Avg:  6m 19s | Max:  6m 31s | Hits: 100%/3578  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 37s | Avg:  5m 48s | Max:  5m 51s | Hits: 100%/3578  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 02s | Avg:  6m 01s | Max:  6m 10s | Hits: 100%/3578  
      🟩 Clang18            Pass: 100%/2   | Total: 12m 11s | Avg:  6m 05s | Max:  6m 14s | Hits: 100%/3578  
      🟩 Clang19            Pass: 100%/7   | Total: 46m 50s | Avg:  6m 41s | Max: 10m 16s | Hits: 100%/12523 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 04s | Avg:  6m 32s | Max:  6m 52s | Hits:  99%/3580  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 45s | Avg:  6m 45s | Max:  6m 45s | Hits:  99%/1790  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 07s | Avg:  7m 03s | Max:  7m 05s | Hits:  99%/3580  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 02s | Avg:  7m 01s | Max:  7m 02s | Hits:  99%/3580  
      🟩 GCC11              Pass: 100%/2   | Total: 15m 03s | Avg:  7m 31s | Max:  8m 04s | Hits:  99%/3580  
      🟩 GCC12              Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max:  7m 53s | Hits:  99%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 30m | Avg:  9m 03s | Max: 13m 22s | Hits:  99%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 34s | Avg: 27m 47s | Max: 30m 00s | Hits:  99%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 40m | Avg: 33m 21s | Max: 36m 23s | Hits:  99%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 17s | Max: 30m 41s | Hits:  99%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 58m | Avg:  6m 12s | Max: 10m 16s | Hits: 100%/33991 
      🟩 GCC                Pass: 100%/21  | Total:  2h 48m | Avg:  8m 01s | Max: 13m 22s | Hits:  99%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 35m | Avg: 31m 07s | Max: 36m 23s | Hits:  99%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 17s | Max: 30m 41s | Hits:  99%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 34s | Avg:  8m 47s | Max: 11m 59s | Hits:  99%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total:  5h 39m | Avg:  9m 41s | Max: 30m 41s | Hits:  99%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 26m | Avg: 14m 37s | Max: 36m 23s | Hits:  99%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  6h 40m | Avg: 10m 00s | Max: 34m 01s | Hits:  99%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 55s | Avg: 17m 58s | Max: 36m 23s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total: 48m 40s | Avg: 12m 10s | Max: 13m 22s | Hits:  99%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 17m 34s | Avg:  8m 47s | Max: 11m 59s | Hits:  99%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 05s | Avg:  8m 05s | Max:  8m 05s | Hits:  99%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 46m | Avg: 10m 46s | Max: 30m 41s | Hits:  99%/37560 
      🟩 20                 Pass: 100%/24  | Total:  4h 15m | Avg: 10m 39s | Max: 36m 23s | Hits:  99%/42934 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 7h 39m | Avg: 10m 12s | Max: 32m 47s | Hits: 94%/126487

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  7h 30m | Avg: 10m 28s | Max: 32m 47s | Hits:  94%/119934
      🟩 arm64              Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  4m 32s | Hits:  98%/6553  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 11s | Avg:  9m 26s | Max: 25m 15s | Hits:  96%/16011 
      🟩 12.8               Pass: 100%/40  | Total:  6h 52m | Avg: 10m 18s | Max: 32m 47s | Hits:  94%/110476
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 48m 59s | Avg: 24m 29s | Max: 25m 54s | Hits:  26%/6515  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 11s | Avg:  9m 26s | Max: 25m 15s | Hits:  96%/16011 
      🟩 nvcc12.8           Pass: 100%/38  | Total:  6h 03m | Avg:  9m 33s | Max: 32m 47s | Hits:  98%/103961
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 59s | Avg: 24m 29s | Max: 25m 54s | Hits:  26%/6515  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 50m | Avg:  9m 32s | Max: 32m 47s | Hits:  98%/119972
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 21s | Avg:  5m 50s | Max:  9m 24s | Hits:  97%/12986 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 15s | Avg:  5m 07s | Max:  5m 29s | Hits:  99%/6511  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 50s | Avg:  4m 55s | Max:  4m 55s | Hits:  98%/6511  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 09s | Hits:  99%/6511  
      🟩 Clang18            Pass: 100%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 01s | Hits:  99%/6511  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 25m | Avg: 14m 10s | Max: 25m 54s | Hits:  70%/16302 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 49s | Hits:  99%/6445  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s | Hits:  99%/3233  
      🟩 GCC9               Pass: 100%/2   | Total:  8m 48s | Avg:  4m 24s | Max:  4m 28s | Hits:  99%/6457  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 18s | Avg:  4m 39s | Max:  4m 48s | Hits:  98%/6513  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 40s | Avg:  4m 50s | Max:  5m 05s | Hits:  99%/6509  
      🟩 GCC12              Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  4m 58s | Hits:  99%/6513  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 53m | Avg: 11m 18s | Max: 23m 08s | Hits:  98%/16548 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 50s | Avg: 27m 55s | Max: 30m 35s | Hits:  97%/6189  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 15s | Max: 32m 47s | Hits:  98%/6247  
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 28m 26s | Avg: 14m 13s | Max: 17m 05s | Hits:  94%/6501  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 28m | Avg:  8m 15s | Max: 25m 54s | Hits:  90%/55332 
      🟩 GCC                Pass: 100%/21  | Total:  2h 44m | Avg:  7m 48s | Max: 23m 08s | Hits:  98%/52218 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 58m | Avg: 29m 35s | Max: 32m 47s | Hits:  98%/12436 
      🟩 NVHPC              Pass: 100%/2   | Total: 28m 26s | Avg: 14m 13s | Max: 17m 05s | Hits:  94%/6501  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 06s | Avg:  9m 03s | Max: 13m 13s | Hits:  99%/3359  
      🟩 rtx2080            Pass: 100%/43  | Total:  7h 21m | Avg: 10m 16s | Max: 32m 47s | Hits:  94%/123128
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  5h 59m | Avg:  9m 13s | Max: 32m 47s | Hits:  94%/126447
      🟩 NVRTC              Pass: 100%/2   | Total: 41m 07s | Avg: 20m 33s | Max: 23m 08s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 56m 50s | Avg: 18m 56s | Max: 21m 50s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 41m 07s | Avg: 20m 33s | Max: 23m 08s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 18m 06s | Avg:  9m 03s | Max: 13m 13s | Hits:  99%/3359  
      🟩 90;90a;100         Pass: 100%/1   | Total: 14m 37s | Avg: 14m 37s | Max: 14m 37s | Hits:  97%/3359  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  3h 41m | Avg: 10m 04s | Max: 32m 47s | Hits:  95%/67483 
      🟩 20                 Pass: 100%/22  | Total:  3h 55m | Avg: 10m 43s | Max: 29m 44s | Hits:  94%/59004 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 51m | Avg: 6m 36s | Max: 31m 42s | Hits: 99%/14642

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 39m | Avg:  7m 13s | Max: 31m 42s | Hits:  99%/12298 
      🟩 arm64              Pass: 100%/4   | Total: 12m 57s | Avg:  3m 14s | Max:  3m 29s | Hits:  99%/2344  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 16m 55s | Avg:  5m 38s | Max: 10m 36s | Hits:  98%/1463  
      🟩 12.8               Pass: 100%/23  | Total:  2h 35m | Avg:  6m 44s | Max: 31m 42s | Hits:  99%/13179 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 16m 55s | Avg:  5m 38s | Max: 10m 36s | Hits:  98%/1463  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 35m | Avg:  6m 44s | Max: 31m 42s | Hits:  99%/13179 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 51m | Avg:  6m 36s | Max: 31m 42s | Hits:  99%/14642 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 32s | Avg:  3m 16s | Max:  3m 28s | Hits: 100%/1176  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s | Hits: 100%/586   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s | Hits: 100%/586   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits: 100%/586   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits: 100%/586   
      🟩 Clang19            Pass: 100%/4   | Total: 35m 05s | Avg:  8m 46s | Max: 25m 29s | Hits: 100%/2344  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 50s | Hits:  99%/1176  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s | Hits:  99%/586   
      🟩 GCC12              Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s | Hits:  99%/586   
      🟩 GCC13              Pass: 100%/8   | Total:  1h 01m | Avg:  7m 39s | Max: 31m 42s | Hits:  99%/4688  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 36s | Avg: 10m 36s | Max: 10m 36s | Hits:  95%/287   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 12m 39s | Avg: 12m 39s | Max: 12m 39s | Hits:  95%/287   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 17m 17s | Avg:  8m 38s | Max:  8m 45s | Hits:  97%/1168  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 55m 32s | Avg:  5m 33s | Max: 25m 29s | Hits: 100%/5864  
      🟩 GCC                Pass: 100%/12  | Total:  1h 15m | Avg:  6m 19s | Max: 31m 42s | Hits:  99%/7036  
      🟩 MSVC               Pass: 100%/2   | Total: 23m 15s | Avg: 11m 37s | Max: 12m 39s | Hits:  95%/574   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 17s | Avg:  8m 38s | Max:  8m 45s | Hits:  97%/1168  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  7m 55s | Hits:  99%/1172  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 40m | Avg:  6m 41s | Max: 31m 42s | Hits:  99%/13470 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  1h 46m | Avg:  4m 38s | Max: 12m 39s | Hits:  99%/12884 
      🟩 Test               Pass: 100%/3   | Total:  1h 05m | Avg: 21m 42s | Max: 31m 42s | Hits:  99%/1758  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 14m 54s | Avg:  4m 58s | Max:  7m 55s | Hits:  99%/1758  
      🟩 90a                Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s | Hits:  99%/586   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 43s | Avg:  4m 40s | Max:  8m 45s | Hits:  99%/2342  
      🟩 20                 Pass: 100%/22  | Total:  2h 33m | Avg:  6m 57s | Max: 31m 42s | Hits:  99%/12300 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 18m 57s | Avg: 4m 44s | Max: 5m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 47s | Avg:  5m 23s | Max:  5m 33s
      🟩 arm64              Pass: 100%/2   | Total:  8m 10s | Avg:  4m 05s | Max:  4m 18s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 33s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  9m 06s | Avg:  4m 33s | Max:  5m 14s
      🟩 20                 Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  5m 33s
    
  • 🟩 python: Pass: 100%/3 | Total: 28m 22s | Avg: 9m 27s | Max: 17m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 28m 22s | Avg:  9m 27s | Max: 17m 11s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 17m 11s | Avg: 17m 11s | Max: 17m 11s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  7m 40s | Avg:  7m 40s | Max:  7m 40s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits: 98%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 34m 34s | Avg: 17m 17s | Max: 32m 21s | Hits:  98%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 13s | Avg:  2m 13s | Max:  2m 13s | Hits:  98%/163   
      🟩 Test               Pass: 100%/1   | Total: 32m 21s | Avg: 32m 21s | Max: 32m 21s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 174)

# Runner
123 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
10 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

@fbusato fbusato left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks great!! one more thing that can be useful.
We could check that the pointers live in the right memory space, i.e. shared/global

@davebayer
Copy link
Contributor Author

looks great!! one more thing that can be useful. We could check that the pointers live in the right memory space, i.e. shared/global

It is handled inside the cuda::device::memcpy_async_tx function and I think we can do that in cuda::memcpy_async, because that one supports copying global->shared and shared->global as well as copying on host

@bernhardmgruber
Copy link
Contributor

/ok to test 1590854

Copy link
Contributor

🟩 CI finished in 5h 34m: Pass: 100%/183 | Total: 1d 09h | Avg: 10m 50s | Max: 44m 33s | Hits: 95%/290820
  • 🟩 cub: Pass: 100%/47 | Total: 10h 44m | Avg: 13m 42s | Max: 37m 24s | Hits: 99%/57406

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 10h 29m | Avg: 13m 59s | Max: 37m 24s | Hits:  99%/54908 
      🟩 arm64              Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max:  8m 33s | Hits:  99%/2498  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 56m 37s | Avg: 11m 19s | Max: 25m 58s | Hits:  99%/6062  
      🟩 12.9               Pass: 100%/42  | Total:  9h 47m | Avg: 13m 59s | Max: 37m 24s | Hits:  99%/51344 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  5m 55s | Hits:  99%/2151  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 56m 37s | Avg: 11m 19s | Max: 25m 58s | Hits:  99%/6062  
      🟩 nvcc12.9           Pass: 100%/40  | Total:  9h 36m | Avg: 14m 24s | Max: 37m 24s | Hits:  99%/49193 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  5m 55s | Hits:  99%/2151  
      🟩 nvcc               Pass: 100%/45  | Total: 10h 32m | Avg: 14m 03s | Max: 37m 24s | Hits:  99%/55255 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 28m 05s | Avg:  7m 01s | Max:  7m 17s | Hits:  99%/4998  
      🟩 Clang15            Pass: 100%/2   | Total: 14m 42s | Avg:  7m 21s | Max:  7m 45s | Hits:  99%/2495  
      🟩 Clang16            Pass: 100%/2   | Total: 14m 34s | Avg:  7m 17s | Max:  7m 20s | Hits:  99%/2495  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 37s | Avg:  7m 18s | Max:  7m 23s | Hits:  99%/2495  
      🟩 Clang18            Pass: 100%/2   | Total: 14m 36s | Avg:  7m 18s | Max:  7m 24s | Hits:  99%/2495  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 32m | Avg: 13m 13s | Max: 32m 09s | Hits:  99%/8390  
      🟩 GCC7               Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max:  8m 44s | Hits:  99%/2498  
      🟩 GCC8               Pass: 100%/1   | Total:  9m 09s | Avg:  9m 09s | Max:  9m 09s | Hits:  99%/1249  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 52s | Avg:  8m 56s | Max:  9m 12s | Hits:  99%/2498  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max:  9m 24s | Hits:  99%/2499  
      🟩 GCC11              Pass: 100%/2   | Total: 18m 06s | Avg:  9m 03s | Max:  9m 06s | Hits:  99%/2495  
      🟩 GCC12              Pass: 100%/2   | Total: 19m 39s | Avg:  9m 49s | Max: 10m 02s | Hits:  99%/2495  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 47m | Avg: 20m 40s | Max: 37m 24s | Hits:  99%/13747 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 53m 54s | Avg: 26m 57s | Max: 27m 56s | Hits:  99%/2130  
      🟩 MSVC14.43          Pass: 100%/2   | Total: 55m 26s | Avg: 27m 43s | Max: 28m 19s | Hits:  99%/2130  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 08s | Hits:  98%/2297  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 59m | Avg:  9m 25s | Max: 32m 09s | Hits:  99%/23368 
      🟩 GCC                Pass: 100%/22  | Total:  5h 27m | Avg: 14m 54s | Max: 37m 24s | Hits:  99%/27481 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 49m | Avg: 27m 20s | Max: 28m 19s | Hits:  99%/4260  
      🟩 NVHPC              Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 08s | Hits:  98%/2297  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 02m | Avg: 20m 48s | Max: 30m 56s | Hits:  99%/3750  
      🟩 rtx2080            Pass: 100%/36  | Total:  6h 17m | Avg: 10m 28s | Max: 28m 19s | Hits:  99%/43662 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 24m | Avg: 25m 34s | Max: 37m 24s | Hits:  99%/9994  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 41m | Avg: 10m 18s | Max: 28m 19s | Hits:  99%/47410 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 35m 09s | Avg: 35m 09s | Max: 35m 09s | Hits:  99%/1250  
      🟩 GraphCapture       Pass: 100%/1   | Total: 26m 35s | Avg: 26m 35s | Max: 26m 35s | Hits:  99%/1250  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 40m | Avg: 33m 29s | Max: 37m 24s | Hits:  99%/3748  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 20m | Avg: 26m 46s | Max: 28m 54s | Hits:  99%/3748  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 02m | Avg: 20m 48s | Max: 30m 56s | Hits:  99%/3750  
      🟩 90;90a;100         Pass: 100%/1   | Total:  9m 32s | Avg:  9m 32s | Max:  9m 32s | Hits:  99%/1250  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 53m | Avg: 11m 08s | Max: 27m 56s | Hits:  99%/25386 
      🟩 20                 Pass: 100%/26  | Total:  6h 50m | Avg: 15m 47s | Max: 37m 24s | Hits:  99%/32020 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 8h 30m | Avg: 10m 51s | Max: 31m 28s | Hits: 99%/89613

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 25m 16s | Avg: 12m 38s | Max: 17m 29s | Hits:  99%/3816  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  8h 17m | Avg: 11m 03s | Max: 31m 28s | Hits:  99%/85798 
      🟩 arm64              Pass: 100%/2   | Total: 12m 52s | Avg:  6m 26s | Max:  7m 08s | Hits:  99%/3815  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 52m 21s | Avg: 10m 28s | Max: 26m 37s | Hits:  99%/9530  
      🟩 12.9               Pass: 100%/42  | Total:  7h 38m | Avg: 10m 54s | Max: 31m 28s | Hits:  99%/80083 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 41s | Avg:  5m 50s | Max:  5m 56s | Hits: 100%/3814  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 52m 21s | Avg: 10m 28s | Max: 26m 37s | Hits:  99%/9530  
      🟩 nvcc12.9           Pass: 100%/40  | Total:  7h 26m | Avg: 11m 09s | Max: 31m 28s | Hits:  99%/76269 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 41s | Avg:  5m 50s | Max:  5m 56s | Hits: 100%/3814  
      🟩 nvcc               Pass: 100%/45  | Total:  8h 18m | Avg: 11m 05s | Max: 31m 28s | Hits:  99%/85799 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 24m 33s | Avg:  6m 08s | Max:  6m 53s | Hits: 100%/7628  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 21s | Hits: 100%/3814  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 11s | Avg:  6m 35s | Max:  6m 41s | Hits: 100%/3814  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 16s | Avg:  6m 38s | Max:  6m 46s | Hits: 100%/3814  
      🟩 Clang18            Pass: 100%/2   | Total: 12m 21s | Avg:  6m 10s | Max:  6m 22s | Hits: 100%/3814  
      🟩 Clang19            Pass: 100%/7   | Total: 49m 39s | Avg:  7m 05s | Max: 11m 04s | Hits: 100%/13349 
      🟩 GCC7               Pass: 100%/2   | Total: 14m 41s | Avg:  7m 20s | Max:  7m 45s | Hits:  99%/3816  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 05s | Avg:  7m 05s | Max:  7m 05s | Hits:  99%/1908  
      🟩 GCC9               Pass: 100%/2   | Total: 15m 08s | Avg:  7m 34s | Max:  7m 44s | Hits:  99%/3816  
      🟩 GCC10              Pass: 100%/2   | Total: 15m 27s | Avg:  7m 43s | Max:  7m 44s | Hits:  99%/3816  
      🟩 GCC11              Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max:  7m 30s | Hits:  99%/3816  
      🟩 GCC12              Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max:  8m 21s | Hits:  99%/3816  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 41m | Avg: 10m 08s | Max: 17m 29s | Hits:  99%/19080 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 54m 45s | Avg: 27m 22s | Max: 28m 08s | Hits:  99%/3800  
      🟩 MSVC14.43          Pass: 100%/3   | Total:  1h 25m | Avg: 28m 39s | Max: 31m 28s | Hits:  99%/5700  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 59m 14s | Avg: 29m 37s | Max: 29m 44s | Hits:  99%/3812  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 05m | Avg:  6m 36s | Max: 11m 04s | Hits: 100%/36233 
      🟩 GCC                Pass: 100%/21  | Total:  3h 04m | Avg:  8m 48s | Max: 17m 29s | Hits:  99%/40068 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 20m | Avg: 28m 08s | Max: 31m 28s | Hits:  99%/9500  
      🟩 NVHPC              Pass: 100%/2   | Total: 59m 14s | Avg: 29m 37s | Max: 29m 44s | Hits:  99%/3812  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 19m 25s | Avg:  9m 42s | Max: 13m 20s | Hits:  99%/3816  
      🟩 rtx2080            Pass: 100%/35  | Total:  5h 46m | Avg:  9m 54s | Max: 29m 44s | Hits:  99%/66736 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 24m | Avg: 14m 26s | Max: 31m 28s | Hits:  99%/19061 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  6h 42m | Avg: 10m 03s | Max: 29m 44s | Hits:  99%/76267 
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 44s | Avg: 16m 34s | Max: 31m 28s | Hits:  99%/5715  
      🟩 TestGPU            Pass: 100%/4   | Total: 58m 11s | Avg: 14m 32s | Max: 17m 29s | Hits:  99%/7631  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 19m 25s | Avg:  9m 42s | Max: 13m 20s | Hits:  99%/3816  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 54s | Avg:  7m 54s | Max:  7m 54s | Hits:  99%/1908  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 49m | Avg: 10m 56s | Max: 29m 44s | Hits:  99%/40034 
      🟩 20                 Pass: 100%/24  | Total:  4h 15m | Avg: 10m 38s | Max: 31m 28s | Hits:  99%/45763 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 8h 47m | Avg: 11m 42s | Max: 44m 33s | Hits: 90%/128701

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 37m | Avg: 12m 01s | Max: 44m 33s | Hits:  90%/122036
      🟩 arm64              Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 06s | Hits:  98%/6665  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 54m 22s | Avg: 10m 52s | Max: 27m 13s | Hits:  94%/16299 
      🟩 12.9               Pass: 100%/40  | Total:  7h 52m | Avg: 11m 49s | Max: 44m 33s | Hits:  90%/112402
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 50m 44s | Avg: 25m 22s | Max: 26m 04s | Hits:  26%/6629  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 54m 22s | Avg: 10m 52s | Max: 27m 13s | Hits:  94%/16299 
      🟩 nvcc12.9           Pass: 100%/38  | Total:  7h 02m | Avg: 11m 06s | Max: 44m 33s | Hits:  94%/105773
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 44s | Avg: 25m 22s | Max: 26m 04s | Hits:  26%/6629  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 56m | Avg: 11m 04s | Max: 44m 33s | Hits:  94%/122072
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 35m 22s | Avg:  8m 50s | Max: 12m 41s | Hits:  89%/13214 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  5m 15s | Hits:  98%/6625  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  5m 48s | Hits:  98%/6625  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 18s | Hits:  98%/6625  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 09s | Avg:  5m 34s | Max:  5m 46s | Hits:  98%/6625  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 18m | Avg: 13m 01s | Max: 26m 04s | Hits:  69%/16586 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 28s | Hits:  98%/6561  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s | Hits:  98%/3291  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 25s | Avg:  5m 12s | Max:  5m 21s | Hits:  98%/6573  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 32s | Avg:  5m 16s | Max:  5m 33s | Hits:  98%/6627  
      🟩 GCC11              Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max: 13m 44s | Hits:  89%/6623  
      🟩 GCC12              Pass: 100%/2   | Total: 19m 39s | Avg:  9m 49s | Max: 14m 38s | Hits:  89%/6627  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 58m | Avg: 11m 49s | Max: 24m 58s | Hits:  97%/16830 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 56m 47s | Avg: 28m 23s | Max: 29m 34s | Hits:  98%/6301  
      🟩 MSVC14.43          Pass: 100%/2   | Total: 56m 40s | Avg: 28m 20s | Max: 28m 42s | Hits:  98%/6353  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 43s | Max: 44m 33s | Hits:  57%/6615  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 37m | Avg:  8m 43s | Max: 26m 04s | Hits:  88%/56300 
      🟩 GCC                Pass: 100%/21  | Total:  3h 13m | Avg:  9m 12s | Max: 24m 58s | Hits:  96%/53132 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 53m | Avg: 28m 21s | Max: 29m 34s | Hits:  98%/12654 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 43s | Max: 44m 33s | Hits:  57%/6615  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 17s | Avg: 11m 38s | Max: 14m 48s | Hits:  96%/3415  
      🟩 rtx2080            Pass: 100%/43  | Total:  8h 23m | Avg: 11m 43s | Max: 44m 33s | Hits:  90%/125286
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  7h 20m | Avg: 11m 17s | Max: 44m 33s | Hits:  90%/128661
      🟩 NVRTC              Pass: 100%/2   | Total: 47m 03s | Avg: 23m 31s | Max: 24m 58s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 37m 11s | Avg: 12m 23s | Max: 14m 48s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 28s | Avg:  2m 28s | Max:  2m 28s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 47m 03s | Avg: 23m 31s | Max: 24m 58s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 23m 17s | Avg: 11m 38s | Max: 14m 48s | Hits:  96%/3415  
      🟩 90;90a;100         Pass: 100%/1   | Total: 18m 21s | Avg: 18m 21s | Max: 18m 21s | Hits:  96%/3415  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  4h 23m | Avg: 11m 58s | Max: 44m 33s | Hits:  91%/68693 
      🟩 20                 Pass: 100%/22  | Total:  4h 21m | Avg: 11m 52s | Max: 28m 42s | Hits:  89%/60008 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 25m | Avg: 5m 35s | Max: 12m 02s | Hits: 99%/14772

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 10m | Avg:  5m 56s | Max: 12m 02s | Hits:  99%/12408 
      🟩 arm64              Pass: 100%/4   | Total: 14m 37s | Avg:  3m 39s | Max:  3m 57s | Hits:  99%/2364  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 17m 24s | Avg:  5m 48s | Max: 10m 06s | Hits:  98%/1478  
      🟩 12.9               Pass: 100%/23  | Total:  2h 07m | Avg:  5m 33s | Max: 12m 02s | Hits:  99%/13294 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 17m 24s | Avg:  5m 48s | Max: 10m 06s | Hits:  98%/1478  
      🟩 nvcc12.9           Pass: 100%/23  | Total:  2h 07m | Avg:  5m 33s | Max: 12m 02s | Hits:  99%/13294 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 25m | Avg:  5m 35s | Max: 12m 02s | Hits:  99%/14772 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  4m 04s | Hits: 100%/1186  
      🟩 Clang15            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s | Hits: 100%/591   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s | Hits: 100%/591   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s | Hits: 100%/591   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 58s | Avg:  3m 58s | Max:  3m 58s | Hits: 100%/591   
      🟩 Clang19            Pass: 100%/4   | Total: 20m 00s | Avg:  5m 00s | Max:  9m 19s | Hits: 100%/2364  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 51s | Avg:  3m 55s | Max:  4m 04s | Hits:  99%/1186  
      🟩 GCC11              Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s | Hits:  99%/591   
      🟩 GCC12              Pass: 100%/1   | Total:  4m 24s | Avg:  4m 24s | Max:  4m 24s | Hits:  99%/591   
      🟩 GCC13              Pass: 100%/8   | Total: 45m 33s | Avg:  5m 41s | Max: 11m 47s | Hits:  99%/4728  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 06s | Avg: 10m 06s | Max: 10m 06s | Hits:  95%/292   
      🟩 MSVC14.43          Pass: 100%/1   | Total: 12m 02s | Avg: 12m 02s | Max: 12m 02s | Hits:  95%/292   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 17m 54s | Avg:  8m 57s | Max:  9m 07s | Hits:  97%/1178  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 43m 26s | Avg:  4m 20s | Max:  9m 19s | Hits: 100%/5914  
      🟩 GCC                Pass: 100%/12  | Total:  1h 01m | Avg:  5m 09s | Max: 11m 47s | Hits:  99%/7096  
      🟩 MSVC               Pass: 100%/2   | Total: 22m 08s | Avg: 11m 04s | Max: 12m 02s | Hits:  95%/584   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 54s | Avg:  8m 57s | Max:  9m 07s | Hits:  97%/1178  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 14m 01s | Avg:  7m 00s | Max: 10m 13s | Hits:  99%/1182  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 11m | Avg:  5m 28s | Max: 12m 02s | Hits:  99%/13590 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  1h 54m | Avg:  4m 57s | Max: 12m 02s | Hits:  99%/12999 
      🟩 Test               Pass: 100%/3   | Total: 31m 19s | Avg: 10m 26s | Max: 11m 47s | Hits:  99%/1773  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 17m 41s | Avg:  5m 53s | Max: 10m 13s | Hits:  99%/1773  
      🟩 90a                Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s | Hits:  99%/591   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 20m 05s | Avg:  5m 01s | Max:  9m 07s | Hits:  99%/2362  
      🟩 20                 Pass: 100%/22  | Total:  2h 05m | Avg:  5m 41s | Max: 12m 02s | Hits:  99%/12410 
    
  • 🟩 python: Pass: 100%/12 | Total: 1h 58m | Avg: 9m 54s | Max: 22m 45s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 ctk
      🟩 12.9               Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  1h 58m | Avg:  9m 54s | Max: 22m 45s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  7m 10s | Avg:  3m 35s | Max:  3m 37s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  7m 09s | Avg:  3m 34s | Max:  3m 44s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 17m 20s | Avg:  8m 40s | Max:  8m 40s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total: 12m 37s | Avg:  6m 18s | Max:  6m 55s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 40m 19s | Avg: 20m 09s | Max: 22m 45s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 34m 20s | Avg: 17m 10s | Max: 17m 14s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total:  1h 01m | Avg: 10m 11s | Max: 22m 45s
      🟩 3.13               Pass: 100%/6   | Total: 57m 44s | Avg:  9m 37s | Max: 17m 34s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 20m 56s | Avg: 5m 14s | Max: 6m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 05s | Avg:  6m 02s | Max:  6m 26s
      🟩 arm64              Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 30s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 20m 56s | Avg:  5m 14s | Max:  6m 26s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 39s
      🟩 20                 Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  6m 26s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 12s | Avg: 8m 36s | Max: 14m 29s | Hits: 98%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 ctk
      🟩 12.9               Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max: 14m 29s | Hits:  98%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 43s | Avg:  2m 43s | Max:  2m 43s | Hits:  98%/164   
      🟩 Test               Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s | Hits:  98%/164   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 183)

# Runner
129 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
7 linux-amd64-gpu-rtx2080-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco miscco requested a review from fbusato May 26, 2025 12:57
@github-project-automation github-project-automation bot moved this from In Progress to In Review in CCCL May 27, 2025
@miscco miscco merged commit b4f3405 into NVIDIA:main May 28, 2025
193 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL May 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

4 participants