Skip to content

Split Optimize Warp Reduce PR - CUB part #4716

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 12 commits into from
May 20, 2025

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented May 15, 2025

Description

Split #4312.
List of changes:

  • Remove invoke_result_t and use cuda::std::invoke_result_t instead.
  • Simplify array_utils.cuh namespaces.
  • Move unsafe_bitcast() into an independent file.
  • Use more precise names for identification type traits for cuda::std:: operators
  • Add short2/half[2]/bfloat[2] identification type traits

@fbusato fbusato self-assigned this May 15, 2025
@fbusato fbusato requested a review from a team as a code owner May 15, 2025 23:03
@fbusato fbusato added this to CCCL May 15, 2025
@fbusato fbusato added the 3.1.0 Targeted for 3.1 release label May 15, 2025
@fbusato fbusato requested a review from alliepiper May 15, 2025 23:03
@github-project-automation github-project-automation bot moved this to Todo in CCCL May 15, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL May 15, 2025
@fbusato fbusato added the CI 100% All CI tests passed label May 16, 2025
Copy link
Contributor

🟩 CI finished in 1h 58m: Pass: 100%/138 | Total: 3d 02h | Avg: 32m 36s | Max: 1h 22m | Hits: 63%/156215
  • 🟩 cub: Pass: 100%/47 | Total: 1d 18h | Avg: 54m 20s | Max: 1h 22m | Hits: 34%/57173

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 16h | Avg: 54m 18s | Max:  1h 22m | Hits:  35%/54687 
      🟩 arm64              Pass: 100%/2   | Total:  1h 50m | Avg: 55m 16s | Max: 57m 52s | Hits:  21%/2486  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 31m | Avg:  1h 06m | Max:  1h 20m | Hits:  21%/6041  
      🟩 12.8               Pass: 100%/42  | Total:  1d 13h | Avg: 52m 55s | Max:  1h 22m | Hits:  36%/51132 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 09m | Avg: 34m 52s | Max: 35m 04s | Hits:  20%/2142  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 31m | Avg:  1h 06m | Max:  1h 20m | Hits:  21%/6041  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 11h | Avg: 53m 49s | Max:  1h 22m | Hits:  37%/48990 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 52s | Max: 35m 04s | Hits:  20%/2142  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 17h | Avg: 55m 12s | Max:  1h 22m | Hits:  35%/55031 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 44m | Avg: 56m 12s | Max: 58m 01s | Hits:  21%/4980  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 54m | Avg: 57m 03s | Max: 58m 19s | Hits:  21%/2486  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 49m | Avg: 54m 33s | Max: 55m 29s | Hits:  21%/2486  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 51s | Max: 58m 12s | Hits:  21%/2486  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 47m | Avg: 53m 36s | Max: 54m 12s | Hits:  21%/2486  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 40m | Avg: 40m 04s | Max: 54m 04s | Hits:  44%/8357  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 07m | Hits:  21%/2490  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m | Hits:  21%/1245  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 09m | Hits:  21%/2490  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  21%/2490  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 18s | Max: 59m 29s | Hits:  21%/2486  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits:  21%/2486  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 20m | Avg: 40m 05s | Max:  1h 01m | Hits:  64%/13673 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 21m | Hits:  19%/2122  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 22m | Hits:  19%/2122  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 11m | Hits:  18%/2288  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 15h 49m | Avg: 49m 58s | Max: 58m 19s | Hits:  30%/23281 
      🟩 GCC                Pass: 100%/22  | Total: 18h 58m | Avg: 51m 46s | Max:  1h 09m | Hits:  42%/27360 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 24m | Avg:  1h 21m | Max:  1h 22m | Hits:  19%/4244  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 11m | Hits:  18%/2288  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 23m | Avg: 27m 59s | Max: 30m 48s | Hits:  73%/3729  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 12h | Avg:  1h 00m | Max:  1h 22m | Hits:  21%/43500 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 43m | Avg: 35m 25s | Max:  1h 01m | Hits:  80%/9944  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 14h | Avg: 59m 48s | Max:  1h 22m | Hits:  21%/47229 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 32m 13s | Avg: 32m 13s | Max: 32m 13s | Hits:  99%/1243  
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 54s | Avg: 24m 54s | Max: 24m 54s | Hits:  99%/1243  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 27m | Avg: 29m 18s | Max: 31m 51s | Hits:  99%/3729  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 16m | Avg: 25m 34s | Max: 29m 01s | Hits:  99%/3729  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 23m | Avg: 27m 59s | Max: 30m 48s | Hits:  73%/3729  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  21%/1243  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 21h 53m | Avg:  1h 02m | Max:  1h 21m | Hits:  21%/25302 
      🟩 20                 Pass: 100%/26  | Total: 20h 40m | Avg: 47m 42s | Max:  1h 22m | Hits:  45%/31871 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 02h | Avg: 34m 22s | Max: 1h 14m | Hits: 78%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 43m 07s | Avg: 21m 33s | Max: 29m 43s | Hits:  88%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 01h | Avg: 34m 34s | Max:  1h 14m | Hits:  78%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 59m 49s | Avg: 29m 54s | Max: 32m 31s | Hits:  76%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 19m | Avg: 39m 54s | Max:  1h 02m | Hits:  74%/8941  
      🟩 12.8               Pass: 100%/42  | Total: 23h 36m | Avg: 33m 43s | Max:  1h 14m | Hits:  79%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 54m 01s | Avg: 27m 00s | Max: 27m 15s | Hits:  76%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 19m | Avg: 39m 54s | Max:  1h 02m | Hits:  74%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total: 22h 42m | Avg: 34m 03s | Max:  1h 14m | Hits:  79%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 54m 01s | Avg: 27m 00s | Max: 27m 15s | Hits:  76%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 02h | Avg: 34m 42s | Max:  1h 14m | Hits:  78%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 51s | Max: 34m 44s | Hits:  76%/7156  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 32s | Max: 32m 33s | Hits:  76%/3578  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 05s | Max: 31m 59s | Hits:  76%/3578  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 48s | Max: 36m 06s | Hits:  76%/3578  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 33m 20s | Hits:  76%/3578  
      🟩 Clang19            Pass: 100%/7   | Total:  2h 41m | Avg: 23m 06s | Max: 32m 13s | Hits:  83%/12523 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 44s | Max: 34m 33s | Hits:  76%/3580  
      🟩 GCC8               Pass: 100%/1   | Total: 32m 41s | Avg: 32m 41s | Max: 32m 41s | Hits:  76%/1790  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 13m | Avg: 36m 36s | Max: 37m 55s | Hits:  76%/3580  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 34s | Max: 37m 38s | Hits:  76%/3580  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 13s | Max: 35m 56s | Hits:  76%/3580  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 16m | Avg: 38m 07s | Max: 38m 26s | Hits:  76%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 00m | Avg: 24m 01s | Max: 36m 51s | Hits:  86%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 02m | Hits:  64%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  3h 06m | Avg:  1h 02m | Max:  1h 14m | Hits:  75%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 03m | Hits:  64%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 06m | Avg: 28m 47s | Max: 36m 06s | Hits:  79%/33991 
      🟩 GCC                Pass: 100%/21  | Total: 10h 31m | Avg: 30m 04s | Max: 38m 26s | Hits:  81%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  5h 11m | Avg:  1h 02m | Max:  1h 14m | Hits:  71%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 03m | Hits:  64%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 32m 16s | Avg: 16m 08s | Max: 19m 13s | Hits:  88%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total: 21h 57m | Avg: 37m 38s | Max:  1h 12m | Hits:  75%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 25m | Avg: 26m 35s | Max:  1h 14m | Hits:  89%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 01h | Avg: 37m 41s | Max:  1h 14m | Hits:  75%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 57m 15s | Avg: 19m 05s | Max: 38m 48s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total: 51m 03s | Avg: 12m 45s | Max: 13m 51s | Hits:  99%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 32m 16s | Avg: 16m 08s | Max: 19m 13s | Hits:  88%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total: 34m 40s | Avg: 34m 40s | Max: 34m 40s | Hits:  76%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 13h 59m | Avg: 39m 59s | Max:  1h 12m | Hits:  74%/37560 
      🟩 20                 Pass: 100%/24  | Total: 12h 12m | Avg: 30m 32s | Max:  1h 14m | Hits:  81%/42934 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 12m | Avg: 7m 23s | Max: 15m 07s | Hits: 91%/14642

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 50m | Avg:  7m 44s | Max: 15m 07s | Hits:  91%/12298 
      🟩 arm64              Pass: 100%/4   | Total: 21m 45s | Avg:  5m 26s | Max:  5m 56s | Hits:  90%/2344  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 27m 15s | Avg:  9m 05s | Max: 15m 07s | Hits:  88%/1463  
      🟩 12.8               Pass: 100%/23  | Total:  2h 44m | Avg:  7m 09s | Max: 15m 05s | Hits:  91%/13179 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 27m 15s | Avg:  9m 05s | Max: 15m 07s | Hits:  88%/1463  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 44m | Avg:  7m 09s | Max: 15m 05s | Hits:  91%/13179 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 12m | Avg:  7m 23s | Max: 15m 07s | Hits:  91%/14642 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  5m 58s | Hits:  90%/1176  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s | Hits:  90%/586   
      🟩 Clang16            Pass: 100%/1   | Total:  6m 09s | Avg:  6m 09s | Max:  6m 09s | Hits:  90%/586   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s | Hits:  90%/586   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s | Hits:  90%/586   
      🟩 Clang19            Pass: 100%/4   | Total: 24m 45s | Avg:  6m 11s | Max:  8m 47s | Hits:  93%/2344  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 10s | Hits:  90%/1176  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 13s | Avg:  6m 13s | Max:  6m 13s | Hits:  90%/586   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 55s | Avg:  6m 55s | Max:  6m 55s | Hits:  90%/586   
      🟩 GCC13              Pass: 100%/8   | Total: 53m 51s | Avg:  6m 43s | Max: 12m 03s | Hits:  92%/4688  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 07s | Avg: 15m 07s | Max: 15m 07s | Hits:  78%/287   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 15m 05s | Avg: 15m 05s | Max: 15m 05s | Hits:  78%/287   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 23m 20s | Avg: 11m 40s | Max: 11m 53s | Hits:  88%/1168  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 59m 18s | Avg:  5m 55s | Max:  8m 47s | Hits:  91%/5864  
      🟩 GCC                Pass: 100%/12  | Total:  1h 19m | Avg:  6m 36s | Max: 12m 03s | Hits:  91%/7036  
      🟩 MSVC               Pass: 100%/2   | Total: 30m 12s | Avg: 15m 06s | Max: 15m 07s | Hits:  78%/574   
      🟩 NVHPC              Pass: 100%/2   | Total: 23m 20s | Avg: 11m 40s | Max: 11m 53s | Hits:  88%/1168  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 14m 07s | Avg:  7m 03s | Max:  8m 19s | Hits:  95%/1172  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 57m | Avg:  7m 24s | Max: 15m 07s | Hits:  90%/13470 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 42m | Avg:  7m 04s | Max: 15m 07s | Hits:  89%/12884 
      🟩 Test               Pass: 100%/3   | Total: 29m 09s | Avg:  9m 43s | Max: 12m 03s | Hits:  99%/1758  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 18m 53s | Avg:  6m 17s | Max:  8m 19s | Hits:  93%/1758  
      🟩 90a                Pass: 100%/1   | Total:  4m 48s | Avg:  4m 48s | Max:  4m 48s | Hits:  90%/586   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 26m 56s | Avg:  6m 44s | Max: 11m 27s | Hits:  90%/2342  
      🟩 20                 Pass: 100%/22  | Total:  2h 45m | Avg:  7m 30s | Max: 15m 07s | Hits:  91%/12300 
    
  • 🟩 python: Pass: 100%/12 | Total: 1h 40m | Avg: 8m 20s | Max: 19m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 ctk
      🟩 12.8               Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  1h 40m | Avg:  8m 20s | Max: 19m 55s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  7m 03s | Avg:  3m 31s | Max:  3m 37s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 48s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 19m 44s | Avg:  9m 52s | Max: 10m 13s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 24s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 37m 36s | Avg: 18m 48s | Max: 19m 55s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 19m 52s | Avg:  9m 56s | Max:  9m 59s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total: 51m 56s | Avg:  8m 39s | Max: 19m 55s
      🟩 3.13               Pass: 100%/6   | Total: 48m 04s | Avg:  8m 00s | Max: 17m 41s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 21m 00s | Avg: 5m 15s | Max: 6m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 18s | Avg:  6m 09s | Max:  6m 11s
      🟩 arm64              Pass: 100%/2   | Total:  8m 42s | Avg:  4m 21s | Max:  4m 22s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 11s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  6m 07s
      🟩 20                 Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  6m 11s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 16s | Avg: 8m 08s | Max: 12m 17s | Hits: 93%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max: 12m 17s | Hits:  93%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s | Hits:  87%/163   
      🟩 Test               Pass: 100%/1   | Total: 12m 17s | Avg: 12m 17s | Max: 12m 17s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 138)

# Runner
95 linux-amd64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 windows-amd64-cpu16
10 linux-arm64-cpu16
4 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@fbusato fbusato requested a review from bernhardmgruber May 19, 2025 17:13
Copy link
Contributor

🟨 CI finished in 2h 34m: Pass: 99%/138 | Total: 3d 03h | Avg: 32m 44s | Max: 1h 21m | Hits: 63%/156215
  • 🟨 python: Pass: 91%/12 | Total: 1h 35m | Avg: 7m 56s | Max: 19m 09s

    🔍 jobs: Test cuda.cccl 🔍
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  6m 49s | Avg:  3m 24s | Max:  3m 25s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  6m 52s | Avg:  3m 26s | Max:  3m 29s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 19m 42s | Avg:  9m 51s | Max: 10m 00s
      🔍 Test cuda.cccl     Pass:  50%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  4m 34s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 36m 46s | Avg: 18m 23s | Max: 19m 09s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 17m 41s | Avg:  8m 50s | Max:  8m 52s
    🔍 py_version: 3.13 🔍
      🟩 3.10               Pass: 100%/6   | Total: 47m 30s | Avg:  7m 55s | Max: 17m 37s
      🔍 3.13               Pass:  83%/6   | Total: 47m 49s | Avg:  7m 58s | Max: 19m 09s
    🟨 cpu
      🟨 amd64              Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    🟨 ctk
      🟨 12.8               Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    🟨 cudacxx
      🟨 nvcc12.8           Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    🟨 cxx
      🟨 GCC13              Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    🟨 cxx_family
      🟨 GCC                Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    🟨 gpu
      🟨 rtxa6000           Pass:  91%/12  | Total:  1h 35m | Avg:  7m 56s | Max: 19m 09s
    
  • 🟩 cub: Pass: 100%/47 | Total: 1d 19h | Avg: 55m 00s | Max: 1h 21m | Hits: 34%/57173

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 17h | Avg: 54m 57s | Max:  1h 21m | Hits:  35%/54687 
      🟩 arm64              Pass: 100%/2   | Total:  1h 52m | Avg: 56m 16s | Max: 59m 43s | Hits:  21%/2486  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 37m | Avg:  1h 07m | Max:  1h 15m | Hits:  21%/6041  
      🟩 12.8               Pass: 100%/42  | Total:  1d 13h | Avg: 53m 32s | Max:  1h 21m | Hits:  36%/51132 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 12m | Avg: 36m 05s | Max: 37m 01s | Hits:  21%/2142  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 37m | Avg:  1h 07m | Max:  1h 15m | Hits:  21%/6041  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 12h | Avg: 54m 24s | Max:  1h 21m | Hits:  37%/48990 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 12m | Avg: 36m 05s | Max: 37m 01s | Hits:  21%/2142  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 17h | Avg: 55m 51s | Max:  1h 21m | Hits:  35%/55031 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 00m | Avg:  1h 00m | Max:  1h 05m | Hits:  21%/4980  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 55m | Avg: 57m 48s | Max: 59m 04s | Hits:  21%/2486  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 23s | Max: 59m 45s | Hits:  21%/2486  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 49m | Avg: 54m 49s | Max: 55m 53s | Hits:  21%/2486  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 42s | Max: 58m 51s | Hits:  21%/2486  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 51m | Avg: 41m 40s | Max: 55m 51s | Hits:  44%/8357  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 07m | Hits:  21%/2490  
      🟩 GCC8               Pass: 100%/1   | Total: 58m 49s | Avg: 58m 49s | Max: 58m 49s | Hits:  21%/1245  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m | Hits:  21%/2490  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 44s | Max:  1h 00m | Hits:  21%/2490  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 18s | Max: 59m 41s | Hits:  21%/2486  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 06m | Hits:  21%/2486  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 36m | Avg: 41m 31s | Max:  1h 03m | Hits:  64%/13673 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m | Hits:  19%/2122  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 35m | Avg:  1h 17m | Max:  1h 18m | Hits:  19%/2122  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 21m | Hits:  18%/2288  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 16h 27m | Avg: 51m 57s | Max:  1h 05m | Hits:  30%/23281 
      🟩 GCC                Pass: 100%/22  | Total: 18h 58m | Avg: 51m 44s | Max:  1h 07m | Hits:  42%/27360 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 07m | Avg:  1h 16m | Max:  1h 18m | Hits:  19%/4244  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 21m | Hits:  18%/2288  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 26m | Avg: 28m 56s | Max: 33m 31s | Hits:  73%/3729  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 12h | Avg:  1h 01m | Max:  1h 21m | Hits:  21%/43500 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 58m | Avg: 37m 21s | Max: 59m 52s | Hits:  80%/9944  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 15h | Avg:  1h 00m | Max:  1h 21m | Hits:  21%/47229 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 31m 34s | Avg: 31m 34s | Max: 31m 34s | Hits:  99%/1243  
      🟩 GraphCapture       Pass: 100%/1   | Total: 29m 56s | Avg: 29m 56s | Max: 29m 56s | Hits:  99%/1243  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 37m | Avg: 32m 31s | Max: 36m 15s | Hits:  99%/3729  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 02s | Max: 30m 26s | Hits:  99%/3729  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 26m | Avg: 28m 56s | Max: 33m 31s | Hits:  73%/3729  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:  21%/1243  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 21h 54m | Avg:  1h 02m | Max:  1h 21m | Hits:  21%/25302 
      🟩 20                 Pass: 100%/26  | Total: 21h 11m | Avg: 48m 54s | Max:  1h 18m | Hits:  45%/31871 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 02h | Avg: 34m 15s | Max: 1h 13m | Hits: 78%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 43m 07s | Avg: 21m 33s | Max: 29m 22s | Hits:  88%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 01h | Avg: 34m 27s | Max:  1h 13m | Hits:  78%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 59m 41s | Avg: 29m 50s | Max: 31m 56s | Hits:  76%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 17m | Avg: 39m 25s | Max:  1h 01m | Hits:  74%/8941  
      🟩 12.8               Pass: 100%/42  | Total: 23h 33m | Avg: 33m 38s | Max:  1h 13m | Hits:  79%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 55m 20s | Avg: 27m 40s | Max: 30m 02s | Hits:  76%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 17m | Avg: 39m 25s | Max:  1h 01m | Hits:  74%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total: 22h 37m | Avg: 33m 56s | Max:  1h 13m | Hits:  79%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 55m 20s | Avg: 27m 40s | Max: 30m 02s | Hits:  76%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 01h | Avg: 34m 33s | Max:  1h 13m | Hits:  78%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 06m | Avg: 31m 42s | Max: 32m 50s | Hits:  76%/7156  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 29s | Max: 34m 02s | Hits:  76%/3578  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 57s | Max: 31m 12s | Hits:  76%/3578  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 47s | Max: 34m 21s | Hits:  76%/3578  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 22s | Max: 30m 42s | Hits:  76%/3578  
      🟩 Clang19            Pass: 100%/7   | Total:  2h 43m | Avg: 23m 24s | Max: 31m 02s | Hits:  83%/12523 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 13m | Avg: 36m 39s | Max: 36m 44s | Hits:  76%/3580  
      🟩 GCC8               Pass: 100%/1   | Total: 33m 22s | Avg: 33m 22s | Max: 33m 22s | Hits:  76%/1790  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 39s | Max: 36m 05s | Hits:  76%/3580  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 26s | Max: 33m 56s | Hits:  76%/3580  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 12m | Avg: 36m 12s | Max: 37m 16s | Hits:  76%/3580  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 06s | Max: 34m 17s | Hits:  76%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 00m | Avg: 24m 02s | Max: 38m 11s | Hits:  86%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  64%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 56m | Avg: 58m 57s | Max:  1h 12m | Hits:  75%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 13m | Hits:  64%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 03m | Avg: 28m 37s | Max: 34m 21s | Hits:  79%/33991 
      🟩 GCC                Pass: 100%/21  | Total: 10h 25m | Avg: 29m 48s | Max: 38m 11s | Hits:  81%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  5h 04m | Avg:  1h 00m | Max:  1h 12m | Hits:  71%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 13m | Hits:  64%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 32m 29s | Avg: 16m 14s | Max: 19m 34s | Hits:  88%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total: 21h 52m | Avg: 37m 29s | Max:  1h 13m | Hits:  75%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 25m | Avg: 26m 33s | Max:  1h 12m | Hits:  89%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 01h | Avg: 37m 32s | Max:  1h 13m | Hits:  75%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 57m 05s | Avg: 19m 01s | Max: 38m 29s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total: 51m 19s | Avg: 12m 49s | Max: 13m 45s | Hits:  99%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 32m 29s | Avg: 16m 14s | Max: 19m 34s | Hits:  88%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total: 35m 50s | Avg: 35m 50s | Max: 35m 50s | Hits:  76%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 13h 53m | Avg: 39m 40s | Max:  1h 13m | Hits:  74%/37560 
      🟩 20                 Pass: 100%/24  | Total: 12h 14m | Avg: 30m 35s | Max:  1h 12m | Hits:  81%/42934 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 08m | Avg: 7m 15s | Max: 14m 57s | Hits: 91%/14642

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 46m | Avg:  7m 34s | Max: 14m 57s | Hits:  91%/12298 
      🟩 arm64              Pass: 100%/4   | Total: 21m 54s | Avg:  5m 28s | Max:  6m 02s | Hits:  90%/2344  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 25m 50s | Avg:  8m 36s | Max: 14m 46s | Hits:  88%/1463  
      🟩 12.8               Pass: 100%/23  | Total:  2h 42m | Avg:  7m 04s | Max: 14m 57s | Hits:  91%/13179 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 25m 50s | Avg:  8m 36s | Max: 14m 46s | Hits:  88%/1463  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 42m | Avg:  7m 04s | Max: 14m 57s | Hits:  91%/13179 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 08m | Avg:  7m 15s | Max: 14m 57s | Hits:  91%/14642 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 29s | Hits:  90%/1176  
      🟩 Clang15            Pass: 100%/1   | Total:  6m 10s | Avg:  6m 10s | Max:  6m 10s | Hits:  90%/586   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s | Hits:  90%/586   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 50s | Avg:  5m 50s | Max:  5m 50s | Hits:  90%/586   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s | Hits:  90%/586   
      🟩 Clang19            Pass: 100%/4   | Total: 24m 12s | Avg:  6m 03s | Max:  8m 13s | Hits:  93%/2344  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 25s | Avg:  6m 12s | Max:  6m 28s | Hits:  90%/1176  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 31s | Avg:  6m 31s | Max:  6m 31s | Hits:  90%/586   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 37s | Avg:  6m 37s | Max:  6m 37s | Hits:  90%/586   
      🟩 GCC13              Pass: 100%/8   | Total: 51m 22s | Avg:  6m 25s | Max:  9m 47s | Hits:  92%/4688  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 46s | Avg: 14m 46s | Max: 14m 46s | Hits:  78%/287   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 14m 57s | Avg: 14m 57s | Max: 14m 57s | Hits:  78%/287   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 23m 41s | Avg: 11m 50s | Max: 12m 05s | Hits:  88%/1168  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 58m 15s | Avg:  5m 49s | Max:  8m 13s | Hits:  91%/5864  
      🟩 GCC                Pass: 100%/12  | Total:  1h 16m | Avg:  6m 24s | Max:  9m 47s | Hits:  91%/7036  
      🟩 MSVC               Pass: 100%/2   | Total: 29m 43s | Avg: 14m 51s | Max: 14m 57s | Hits:  78%/574   
      🟩 NVHPC              Pass: 100%/2   | Total: 23m 41s | Avg: 11m 50s | Max: 12m 05s | Hits:  88%/1168  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 13m 02s | Avg:  6m 31s | Max:  8m 08s | Hits:  95%/1172  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 55m | Avg:  7m 18s | Max: 14m 57s | Hits:  90%/13470 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 42m | Avg:  7m 03s | Max: 14m 57s | Hits:  89%/12884 
      🟩 Test               Pass: 100%/3   | Total: 26m 08s | Avg:  8m 42s | Max:  9m 47s | Hits:  99%/1758  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 18m 05s | Avg:  6m 01s | Max:  8m 08s | Hits:  93%/1758  
      🟩 90a                Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s | Hits:  90%/586   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 27m 54s | Avg:  6m 58s | Max: 12m 05s | Hits:  90%/2342  
      🟩 20                 Pass: 100%/22  | Total:  2h 40m | Avg:  7m 18s | Max: 14m 57s | Hits:  91%/12300 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 22m 57s | Avg: 5m 44s | Max: 6m 20s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 37s | Avg:  6m 18s | Max:  6m 20s
      🟩 arm64              Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 16s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 22m 57s | Avg:  5m 44s | Max:  6m 20s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  6m 20s
      🟩 20                 Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  6m 17s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 21s | Avg: 8m 10s | Max: 12m 22s | Hits: 93%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 21s | Avg:  8m 10s | Max: 12m 22s | Hits:  93%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s | Hits:  87%/163   
      🟩 Test               Pass: 100%/1   | Total: 12m 22s | Avg: 12m 22s | Max: 12m 22s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 138)

# Runner
95 linux-amd64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 windows-amd64-cpu16
10 linux-arm64-cpu16
4 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@fbusato fbusato enabled auto-merge (squash) May 20, 2025 16:52
Copy link
Contributor

🟩 CI finished in 6h 55m: Pass: 100%/138 | Total: 3d 01h | Avg: 31m 56s | Max: 1h 21m | Hits: 71%/156215
  • 🟩 cub: Pass: 100%/47 | Total: 1d 17h | Avg: 53m 10s | Max: 1h 21m | Hits: 50%/57173

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 15h | Avg: 53m 00s | Max:  1h 21m | Hits:  51%/54687 
      🟩 arm64              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 55s | Max:  1h 00m | Hits:  40%/2486  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 17m | Hits:  40%/6041  
      🟩 12.8               Pass: 100%/42  | Total:  1d 12h | Avg: 52m 00s | Max:  1h 21m | Hits:  51%/51132 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 08m | Avg: 34m 14s | Max: 35m 24s | Hits:  42%/2142  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 17m | Hits:  40%/6041  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  1d 11h | Avg: 52m 53s | Max:  1h 21m | Hits:  52%/48990 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 14s | Max: 35m 24s | Hits:  42%/2142  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 16h | Avg: 54m 00s | Max:  1h 21m | Hits:  51%/55031 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 47m | Avg: 56m 48s | Max: 58m 33s | Hits:  40%/4980  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 05s | Max: 59m 35s | Hits:  40%/2486  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 49m | Avg: 54m 31s | Max: 55m 17s | Hits:  40%/2486  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 45m | Avg: 52m 51s | Max: 53m 56s | Hits:  40%/2486  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 44m | Avg: 52m 28s | Max: 53m 38s | Hits:  40%/2486  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 42m | Avg: 40m 17s | Max: 56m 04s | Hits:  58%/8357  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 57m | Avg: 58m 30s | Max:  1h 00m | Hits:  40%/2490  
      🟩 GCC8               Pass: 100%/1   | Total: 57m 32s | Avg: 57m 32s | Max: 57m 32s | Hits:  40%/1245  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  40%/2490  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m | Hits:  40%/2490  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 38s | Max: 58m 34s | Hits:  40%/2486  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 07s | Max: 59m 26s | Hits:  40%/2486  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 26m | Avg: 40m 34s | Max:  1h 05m | Hits:  72%/13673 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 17m | Hits:  41%/2122  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 36m | Avg:  1h 18m | Max:  1h 21m | Hits:  41%/2122  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 13m | Hits:  38%/2288  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 15h 45m | Avg: 49m 44s | Max: 59m 35s | Hits:  47%/23281 
      🟩 GCC                Pass: 100%/22  | Total: 18h 18m | Avg: 49m 54s | Max:  1h 05m | Hits:  56%/27360 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 08m | Avg:  1h 17m | Max:  1h 21m | Hits:  41%/4244  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 13m | Hits:  38%/2288  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 21m | Avg: 27m 14s | Max: 29m 05s | Hits:  79%/3729  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 11h | Avg: 59m 04s | Max:  1h 21m | Hits:  40%/43500 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 50m | Avg: 36m 20s | Max:  1h 05m | Hits:  84%/9944  
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 13h | Avg: 58m 17s | Max:  1h 21m | Hits:  40%/47229 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 32m 51s | Avg: 32m 51s | Max: 32m 51s | Hits:  99%/1243  
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 49s | Avg: 24m 49s | Max: 24m 49s | Hits:  99%/1243  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 28m | Avg: 29m 33s | Max: 32m 46s | Hits:  99%/3729  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 19s | Max: 30m 40s | Hits:  99%/3729  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 21m | Avg: 27m 14s | Max: 29m 05s | Hits:  79%/3729  
      🟩 90;90a;100         Pass: 100%/1   | Total: 57m 29s | Avg: 57m 29s | Max: 57m 29s | Hits:  40%/1243  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 20h 54m | Avg: 59m 44s | Max:  1h 17m | Hits:  40%/25302 
      🟩 20                 Pass: 100%/26  | Total: 20h 44m | Avg: 47m 51s | Max:  1h 21m | Hits:  58%/31871 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 02h | Avg: 33m 38s | Max: 1h 07m | Hits: 81%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 41m 46s | Avg: 20m 53s | Max: 28m 25s | Hits:  89%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 01h | Avg: 33m 49s | Max:  1h 07m | Hits:  81%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 58m 30s | Avg: 29m 15s | Max: 31m 25s | Hits:  78%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 23m | Avg: 40m 36s | Max:  1h 03m | Hits:  77%/8941  
      🟩 12.8               Pass: 100%/42  | Total: 22h 57m | Avg: 32m 48s | Max:  1h 07m | Hits:  81%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 26s | Avg: 26m 43s | Max: 27m 09s | Hits:  79%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 23m | Avg: 40m 36s | Max:  1h 03m | Hits:  77%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total: 22h 04m | Avg: 33m 06s | Max:  1h 07m | Hits:  81%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 26s | Avg: 26m 43s | Max: 27m 09s | Hits:  79%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 01h | Avg: 33m 56s | Max:  1h 07m | Hits:  81%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 51s | Max: 33m 49s | Hits:  79%/7156  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 03m | Avg: 31m 32s | Max: 32m 08s | Hits:  79%/3578  
      🟩 Clang16            Pass: 100%/2   | Total: 59m 58s | Avg: 29m 59s | Max: 31m 26s | Hits:  79%/3578  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 21s | Max: 33m 27s | Hits:  79%/3578  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 18s | Max: 31m 04s | Hits:  79%/3578  
      🟩 Clang19            Pass: 100%/7   | Total:  2h 42m | Avg: 23m 12s | Max: 33m 22s | Hits:  85%/12523 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 53s | Max: 37m 34s | Hits:  78%/3580  
      🟩 GCC8               Pass: 100%/1   | Total: 32m 01s | Avg: 32m 01s | Max: 32m 01s | Hits:  78%/1790  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 47s | Max: 36m 33s | Hits:  78%/3580  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 43s | Max: 33m 48s | Hits:  78%/3580  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 00s | Max: 33m 07s | Hits:  78%/3580  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 31s | Max: 34m 03s | Hits:  78%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 00m | Avg: 24m 01s | Max: 38m 18s | Hits:  87%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  72%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 47m | Avg: 55m 51s | Max:  1h 07m | Hits:  81%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 07m | Hits:  72%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 00m | Avg: 28m 26s | Max: 33m 49s | Hits:  81%/33991 
      🟩 GCC                Pass: 100%/21  | Total: 10h 12m | Avg: 29m 08s | Max: 38m 18s | Hits:  82%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 55m | Avg: 59m 08s | Max:  1h 07m | Hits:  77%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 07m | Hits:  72%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 31m 28s | Avg: 15m 44s | Max: 18m 23s | Hits:  89%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total: 21h 30m | Avg: 36m 52s | Max:  1h 07m | Hits:  78%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 18m | Avg: 25m 52s | Max:  1h 07m | Hits:  90%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 00h | Avg: 36m 43s | Max:  1h 07m | Hits:  77%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 55m 29s | Avg: 18m 29s | Max: 37m 44s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total: 56m 17s | Avg: 14m 04s | Max: 19m 04s | Hits:  99%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 31m 28s | Avg: 15m 44s | Max: 18m 23s | Hits:  89%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total: 38m 18s | Avg: 38m 18s | Max: 38m 18s | Hits:  78%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 13h 40m | Avg: 39m 04s | Max:  1h 07m | Hits:  77%/37560 
      🟩 20                 Pass: 100%/24  | Total: 11h 58m | Avg: 29m 56s | Max:  1h 07m | Hits:  83%/42934 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 11m | Avg: 7m 21s | Max: 16m 15s | Hits: 91%/14642

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 49m | Avg:  7m 41s | Max: 16m 15s | Hits:  91%/12298 
      🟩 arm64              Pass: 100%/4   | Total: 21m 54s | Avg:  5m 28s | Max:  6m 04s | Hits:  90%/2344  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 26m 28s | Avg:  8m 49s | Max: 15m 36s | Hits:  88%/1463  
      🟩 12.8               Pass: 100%/23  | Total:  2h 44m | Avg:  7m 09s | Max: 16m 15s | Hits:  91%/13179 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 26m 28s | Avg:  8m 49s | Max: 15m 36s | Hits:  88%/1463  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 44m | Avg:  7m 09s | Max: 16m 15s | Hits:  91%/13179 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 11m | Avg:  7m 21s | Max: 16m 15s | Hits:  91%/14642 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 32s | Hits:  90%/1176  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s | Hits:  90%/586   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s | Hits:  90%/586   
      🟩 Clang17            Pass: 100%/1   | Total:  6m 07s | Avg:  6m 07s | Max:  6m 07s | Hits:  90%/586   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s | Hits:  90%/586   
      🟩 Clang19            Pass: 100%/4   | Total: 25m 07s | Avg:  6m 16s | Max:  9m 06s | Hits:  93%/2344  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  5m 57s | Hits:  90%/1176  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 38s | Avg:  6m 38s | Max:  6m 38s | Hits:  90%/586   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 29s | Avg:  6m 29s | Max:  6m 29s | Hits:  90%/586   
      🟩 GCC13              Pass: 100%/8   | Total: 52m 36s | Avg:  6m 34s | Max: 11m 26s | Hits:  92%/4688  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 36s | Avg: 15m 36s | Max: 15m 36s | Hits:  78%/287   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 16m 15s | Avg: 16m 15s | Max: 16m 15s | Hits:  78%/287   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 23m 18s | Avg: 11m 39s | Max: 11m 56s | Hits:  88%/1168  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 58m 48s | Avg:  5m 52s | Max:  9m 06s | Hits:  91%/5864  
      🟩 GCC                Pass: 100%/12  | Total:  1h 17m | Avg:  6m 26s | Max: 11m 26s | Hits:  91%/7036  
      🟩 MSVC               Pass: 100%/2   | Total: 31m 51s | Avg: 15m 55s | Max: 16m 15s | Hits:  78%/574   
      🟩 NVHPC              Pass: 100%/2   | Total: 23m 18s | Avg: 11m 39s | Max: 11m 56s | Hits:  88%/1168  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 12m 55s | Avg:  6m 27s | Max:  8m 03s | Hits:  95%/1172  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 58m | Avg:  7m 25s | Max: 16m 15s | Hits:  90%/13470 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 42m | Avg:  7m 04s | Max: 16m 15s | Hits:  89%/12884 
      🟩 Test               Pass: 100%/3   | Total: 28m 35s | Avg:  9m 31s | Max: 11m 26s | Hits:  99%/1758  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 17m 42s | Avg:  5m 54s | Max:  8m 03s | Hits:  93%/1758  
      🟩 90a                Pass: 100%/1   | Total:  4m 55s | Avg:  4m 55s | Max:  4m 55s | Hits:  90%/586   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 27m 25s | Avg:  6m 51s | Max: 11m 56s | Hits:  90%/2342  
      🟩 20                 Pass: 100%/22  | Total:  2h 43m | Avg:  7m 26s | Max: 16m 15s | Hits:  91%/12300 
    
  • 🟩 python: Pass: 100%/12 | Total: 1h 38m | Avg: 8m 13s | Max: 20m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 ctk
      🟩 12.8               Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  1h 38m | Avg:  8m 13s | Max: 20m 32s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  6m 46s | Avg:  3m 23s | Max:  3m 28s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  6m 59s | Avg:  3m 29s | Max:  3m 30s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 18m 59s | Avg:  9m 29s | Max:  9m 30s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total:  9m 03s | Avg:  4m 31s | Max:  4m 40s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 39m 32s | Avg: 19m 46s | Max: 20m 32s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 17m 22s | Avg:  8m 41s | Max:  8m 45s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total: 50m 06s | Avg:  8m 21s | Max: 20m 32s
      🟩 3.13               Pass: 100%/6   | Total: 48m 35s | Avg:  8m 05s | Max: 19m 00s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 20m 49s | Avg: 5m 12s | Max: 6m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  6m 01s
      🟩 arm64              Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  4m 26s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 01s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 56s
      🟩 20                 Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  6m 01s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 49s | Avg: 8m 24s | Max: 13m 20s | Hits: 96%/326

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 13m 20s | Hits:  96%/326   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits:  94%/163   
      🟩 Test               Pass: 100%/1   | Total: 13m 20s | Avg: 13m 20s | Max: 13m 20s | Hits:  98%/163   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 138)

# Runner
95 linux-amd64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 windows-amd64-cpu16
10 linux-arm64-cpu16
4 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@fbusato fbusato merged commit 14428a1 into NVIDIA:main May 20, 2025
148 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL May 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.1.0 Targeted for 3.1 release CI 100% All CI tests passed
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants