Skip to content

Add Histogram implementation for c.parallel #4689

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 36 commits into
base: main
Choose a base branch
from

Conversation

NaderAlAwar
Copy link
Contributor

Description

closes #4600
should only be merged after #4636

Note that this currently only implements evenly spaced bins for 1 channel. The other features will be added later.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

NaderAlAwar added 28 commits May 1, 2025 22:21
… separate class in the kernels file since it will be needed to instantiate the kernels
@NaderAlAwar NaderAlAwar requested review from a team as code owners May 13, 2025 18:32
@github-project-automation github-project-automation bot moved this to Todo in CCCL May 13, 2025
@NaderAlAwar NaderAlAwar marked this pull request as draft May 13, 2025 18:32
Copy link

copy-pr-bot bot commented May 13, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Progress in CCCL May 13, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from In Progress to In Review in CCCL May 13, 2025
Copy link
Contributor

🟨 CI finished in 56m 12s: Pass: 99%/129 | Total: 20h 46m | Avg: 9m 39s | Max: 31m 54s | Hits: 98%/155107
  • 🟨 cub: Pass: 97%/47 | Total: 9h 16m | Avg: 11m 50s | Max: 26m 22s | Hits: 98%/56033

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/45  | Total:  9h 00m | Avg: 12m 00s | Max: 26m 22s | Hits:  98%/53547 
      🟩 arm64              Pass: 100%/2   | Total: 15m 40s | Avg:  7m 50s | Max:  8m 33s | Hits:  97%/2486  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total: 50m 40s | Avg: 10m 08s | Max: 20m 34s | Hits:  97%/6041  
      🔍 12.8               Pass:  97%/42  | Total:  8h 25m | Avg: 12m 02s | Max: 26m 22s | Hits:  98%/49992 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 12m 43s | Avg:  6m 21s | Max:  6m 34s | Hits:  98%/2142  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 50m 40s | Avg: 10m 08s | Max: 20m 34s | Hits:  97%/6041  
      🔍 nvcc12.8           Pass:  97%/40  | Total:  8h 13m | Avg: 12m 19s | Max: 26m 22s | Hits:  98%/47850 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 12m 43s | Avg:  6m 21s | Max:  6m 34s | Hits:  98%/2142  
      🔍 nvcc               Pass:  97%/45  | Total:  9h 03m | Avg: 12m 04s | Max: 26m 22s | Hits:  98%/53891 
    🔍 cxx: NVHPC25.3 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 30m 00s | Avg:  7m 30s | Max:  8m 14s | Hits:  97%/4980  
      🟩 Clang15            Pass: 100%/2   | Total: 16m 35s | Avg:  8m 17s | Max:  8m 39s | Hits:  97%/2486  
      🟩 Clang16            Pass: 100%/2   | Total: 16m 00s | Avg:  8m 00s | Max:  8m 17s | Hits:  97%/2486  
      🟩 Clang17            Pass: 100%/2   | Total: 16m 52s | Avg:  8m 26s | Max:  8m 30s | Hits:  97%/2486  
      🟩 Clang18            Pass: 100%/2   | Total: 15m 26s | Avg:  7m 43s | Max:  7m 47s | Hits:  97%/2486  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 16m | Avg: 10m 54s | Max: 23m 20s | Hits:  98%/8357  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 39s | Avg:  8m 19s | Max:  8m 21s | Hits:  97%/2490  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 55s | Avg:  7m 55s | Max:  7m 55s | Hits:  97%/1245  
      🟩 GCC9               Pass: 100%/2   | Total: 16m 05s | Avg:  8m 02s | Max:  8m 26s | Hits:  97%/2490  
      🟩 GCC10              Pass: 100%/2   | Total: 17m 23s | Avg:  8m 41s | Max:  8m 42s | Hits:  97%/2490  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 24s | Avg:  8m 42s | Max:  8m 54s | Hits:  97%/2486  
      🟩 GCC12              Pass: 100%/2   | Total: 17m 14s | Avg:  8m 37s | Max:  8m 54s | Hits:  97%/2486  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 57m | Avg: 16m 09s | Max: 26m 22s | Hits:  98%/13673 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 42m 23s | Avg: 21m 11s | Max: 21m 49s | Hits:  96%/2122  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 46m 31s | Avg: 23m 15s | Max: 23m 47s | Hits:  97%/2122  
      🔍 NVHPC25.3          Pass:  50%/2   | Total: 25m 58s | Avg: 12m 59s | Max: 13m 01s | Hits:  96%/1148  
    🔍 cxx_family: NVHPC 🔍
      🟩 Clang              Pass: 100%/19  | Total:  2h 51m | Avg:  9m 00s | Max: 23m 20s | Hits:  98%/23281 
      🟩 GCC                Pass: 100%/22  | Total:  4h 30m | Avg: 12m 17s | Max: 26m 22s | Hits:  98%/27360 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 28m | Avg: 22m 13s | Max: 23m 47s | Hits:  96%/4244  
      🔍 NVHPC              Pass:  50%/2   | Total: 25m 58s | Avg: 12m 59s | Max: 13m 01s | Hits:  96%/1148  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/3   | Total: 55m 20s | Avg: 18m 26s | Max: 26m 22s | Hits:  99%/3729  
      🔍 rtx2080            Pass:  97%/36  | Total:  5h 57m | Avg:  9m 56s | Max: 23m 47s | Hits:  97%/42360 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 23m | Avg: 17m 55s | Max: 24m 10s | Hits:  99%/9944  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  97%/39  | Total:  6h 20m | Avg:  9m 45s | Max: 23m 47s | Hits:  97%/46089 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 10s | Avg: 24m 10s | Max: 24m 10s | Hits:  99%/1243  
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 02s | Avg: 18m 02s | Max: 18m 02s | Hits:  99%/1243  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 26s | Max: 26m 22s | Hits:  99%/3729  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 00m | Avg: 20m 06s | Max: 22m 28s | Hits:  99%/3729  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/21  | Total:  3h 36m | Avg: 10m 19s | Max: 22m 44s | Hits:  97%/25306 
      🔍 20                 Pass:  96%/26  | Total:  5h 39m | Avg: 13m 03s | Max: 26m 22s | Hits:  98%/30727 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 55m 20s | Avg: 18m 26s | Max: 26m 22s | Hits:  99%/3729  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 59s | Avg:  8m 59s | Max:  8m 59s | Hits:  97%/1243  
    
  • 🟩 thrust: Pass: 100%/47 | Total: 7h 44m | Avg: 9m 53s | Max: 31m 54s | Hits: 98%/84074

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 39s | Avg:  8m 49s | Max: 10m 58s | Hits:  99%/3580  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  7h 34m | Avg: 10m 05s | Max: 31m 54s | Hits:  98%/80495 
      🟩 arm64              Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  5m 51s | Hits:  99%/3579  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 42m 54s | Avg:  8m 34s | Max: 21m 41s | Hits:  99%/8941  
      🟩 12.8               Pass: 100%/42  | Total:  7h 01m | Avg: 10m 02s | Max: 31m 54s | Hits:  98%/75133 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  6m 11s | Hits: 100%/3578  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 42m 54s | Avg:  8m 34s | Max: 21m 41s | Hits:  99%/8941  
      🟩 nvcc12.8           Pass: 100%/40  | Total:  6h 50m | Avg: 10m 15s | Max: 31m 54s | Hits:  98%/71555 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  6m 11s | Hits: 100%/3578  
      🟩 nvcc               Pass: 100%/45  | Total:  7h 33m | Avg: 10m 04s | Max: 31m 54s | Hits:  98%/80496 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 06s | Avg:  5m 31s | Max:  6m 14s | Hits: 100%/7156  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 01s | Hits: 100%/3578  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 13s | Avg:  6m 06s | Max:  6m 13s | Hits: 100%/3578  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max:  6m 21s | Hits: 100%/3578  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 48s | Avg:  5m 54s | Max:  6m 03s | Hits: 100%/3578  
      🟩 Clang19            Pass: 100%/7   | Total: 46m 06s | Avg:  6m 35s | Max: 10m 02s | Hits: 100%/12523 
      🟩 GCC7               Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 28s | Hits:  99%/3580  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 14s | Avg:  6m 14s | Max:  6m 14s | Hits:  99%/1790  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  5m 59s | Hits:  99%/3580  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 06s | Avg:  6m 33s | Max:  6m 38s | Hits:  99%/3580  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 07s | Avg:  6m 33s | Max:  6m 38s | Hits:  99%/3580  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 02s | Avg:  7m 01s | Max:  7m 11s | Hits:  99%/3580  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 39m | Avg:  9m 54s | Max: 31m 01s | Hits:  94%/17900 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 43m 32s | Avg: 21m 46s | Max: 21m 51s | Hits:  99%/3566  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 13m | Avg: 24m 39s | Max: 26m 54s | Hits:  99%/5349  
      🟩 NVHPC25.3          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 48s | Max: 31m 54s | Hits:  99%/3578  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 56m | Avg:  6m 08s | Max: 10m 02s | Hits: 100%/33991 
      🟩 GCC                Pass: 100%/21  | Total:  2h 49m | Avg:  8m 02s | Max: 31m 01s | Hits:  97%/37590 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 57m | Avg: 23m 30s | Max: 26m 54s | Hits:  99%/8915  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 48s | Max: 31m 54s | Hits:  99%/3578  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 36m 12s | Avg: 18m 06s | Max: 31m 01s | Hits:  75%/3580  
      🟩 rtx2080            Pass: 100%/35  | Total:  5h 10m | Avg:  8m 52s | Max: 31m 54s | Hits:  99%/62611 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 58m | Avg: 11m 49s | Max: 26m 54s | Hits:  99%/17883 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  5h 58m | Avg:  8m 57s | Max: 31m 54s | Hits:  99%/71553 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 04s | Avg: 14m 21s | Max: 26m 54s | Hits:  99%/5362  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 03m | Avg: 15m 57s | Max: 31m 01s | Hits:  87%/7159  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 36m 12s | Avg: 18m 06s | Max: 31m 01s | Hits:  75%/3580  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 26s | Avg:  6m 26s | Max:  6m 26s | Hits:  99%/1790  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 22m | Avg:  9m 38s | Max: 31m 54s | Hits:  99%/37560 
      🟩 20                 Pass: 100%/24  | Total:  4h 04m | Avg: 10m 12s | Max: 31m 01s | Hits:  97%/42934 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 19m | Avg: 5m 21s | Max: 11m 29s | Hits: 99%/14668

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 05m | Avg:  5m 41s | Max: 11m 29s | Hits:  98%/12320 
      🟩 arm64              Pass: 100%/4   | Total: 14m 01s | Avg:  3m 30s | Max:  3m 48s | Hits:  99%/2348  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 17m 47s | Avg:  5m 55s | Max: 10m 13s | Hits:  98%/1466  
      🟩 12.8               Pass: 100%/23  | Total:  2h 01m | Avg:  5m 16s | Max: 11m 29s | Hits:  99%/13202 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 17m 47s | Avg:  5m 55s | Max: 10m 13s | Hits:  98%/1466  
      🟩 nvcc12.8           Pass: 100%/23  | Total:  2h 01m | Avg:  5m 16s | Max: 11m 29s | Hits:  99%/13202 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 19m | Avg:  5m 21s | Max: 11m 29s | Hits:  99%/14668 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  8m 00s | Avg:  4m 00s | Max:  4m 22s | Hits:  99%/1178  
      🟩 Clang15            Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s | Hits:  99%/587   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s | Hits:  99%/587   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 18s | Avg:  4m 18s | Max:  4m 18s | Hits:  99%/587   
      🟩 Clang18            Pass: 100%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s | Hits:  99%/587   
      🟩 Clang19            Pass: 100%/4   | Total: 18m 15s | Avg:  4m 33s | Max:  7m 31s | Hits:  99%/2348  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 51s | Avg:  3m 55s | Max:  3m 56s | Hits:  99%/1178  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s | Hits:  99%/587   
      🟩 GCC12              Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s | Hits:  99%/587   
      🟩 GCC13              Pass: 100%/8   | Total: 40m 34s | Avg:  5m 04s | Max:  9m 39s | Hits:  99%/4696  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 13s | Avg: 10m 13s | Max: 10m 13s | Hits:  95%/288   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s | Hits:  95%/288   
      🟩 NVHPC25.3          Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 25s | Hits:  97%/1170  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 43m 56s | Avg:  4m 23s | Max:  7m 31s | Hits:  99%/5874  
      🟩 GCC                Pass: 100%/12  | Total: 56m 58s | Avg:  4m 44s | Max:  9m 39s | Hits:  99%/7048  
      🟩 MSVC               Pass: 100%/2   | Total: 21m 42s | Avg: 10m 51s | Max: 11m 29s | Hits:  95%/576   
      🟩 NVHPC              Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 25s | Hits:  97%/1170  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  8m 12s | Hits:  99%/1174  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 07m | Avg:  5m 18s | Max: 11m 29s | Hits:  99%/13494 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  1h 53m | Avg:  4m 56s | Max: 11m 29s | Hits:  98%/12907 
      🟩 Test               Pass: 100%/3   | Total: 25m 22s | Avg:  8m 27s | Max:  9m 39s | Hits:  99%/1761  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 15m 37s | Avg:  5m 12s | Max:  8m 12s | Hits:  99%/1761  
      🟩 90a                Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s | Hits:  99%/587   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 58s | Avg:  4m 44s | Max:  8m 25s | Hits:  98%/2346  
      🟩 20                 Pass: 100%/22  | Total:  2h 00m | Avg:  5m 27s | Max: 11m 29s | Hits:  99%/12322 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 19m 53s | Avg: 4m 58s | Max: 5m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  5m 10s
      🟩 arm64              Pass: 100%/2   | Total:  9m 36s | Avg:  4m 48s | Max:  5m 56s
    🟩 ctk
      🟩 12.8               Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 cxx
      🟩 NVHPC25.3          Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 56s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 56s
      🟩 20                 Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  5m 10s
    
  • 🟩 python: Pass: 100%/3 | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 ctk
      🟩 12.8               Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/3   | Total: 33m 10s | Avg: 11m 03s | Max: 19m 05s
    🟩 jobs
      🟩 cuda.cccl          Pass: 100%/1   | Total:  7m 24s | Avg:  7m 24s | Max:  7m 24s
      🟩 cuda.cooperative   Pass: 100%/1   | Total: 19m 05s | Avg: 19m 05s | Max: 19m 05s
      🟩 cuda.parallel      Pass: 100%/1   | Total:  6m 41s | Avg:  6m 41s | Max:  6m 41s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits: 97%/332

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 30m 02s | Hits:  97%/332   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 44s | Avg:  2m 44s | Max:  2m 44s | Hits:  96%/166   
      🟩 Test               Pass: 100%/1   | Total: 30m 02s | Avg: 30m 02s | Max: 30m 02s | Hits:  98%/166   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
stdpar
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 129)

# Runner
89 linux-amd64-cpu16
11 windows-amd64-cpu16
10 linux-arm64-cpu16
6 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

[FEA]: Implement cccl.c.parallel version of histogram
2 participants