Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add --extended-lambda to the list of removed clangd flags #3432

Merged

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Jan 16, 2025

Description

clangd doesn't support --extended-lambda flag (Allow host, device annotations in lambda declaration).
The PR is aligned with the flag --expt-extended-lambda already present in .clangd.

@fbusato fbusato added the 2.8.0 target for 2.8.0 release label Jan 16, 2025
@fbusato fbusato self-assigned this Jan 16, 2025
@fbusato fbusato requested a review from a team as a code owner January 16, 2025 23:37
@fbusato fbusato requested a review from jrhemstad January 16, 2025 23:37
Copy link
Contributor

🟨 CI finished in 2h 01m: Pass: 99%/148 | Total: 1d 05h | Avg: 12m 04s | Max: 1h 48m | Hits: 536%/25754
  • 🟨 cub: Pass: 97%/38 | Total: 10h 15m | Avg: 16m 11s | Max: 1h 48m | Hits: 539%/3540

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/36  | Total: 10h 05m | Avg: 16m 49s | Max:  1h 48m | Hits: 539%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  5m 00s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total: 49m 56s | Avg:  9m 59s | Max: 28m 42s | Hits: 539%/885   
      🟩 12.5               Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
      🔍 12.6               Pass:  96%/31  | Total:  9h 07m | Avg: 17m 38s | Max:  1h 48m | Hits: 539%/2655  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 32s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 49m 56s | Avg:  9m 59s | Max: 28m 42s | Hits: 539%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
      🔍 nvcc12.6           Pass:  96%/29  | Total:  8h 58m | Avg: 18m 33s | Max:  1h 48m | Hits: 539%/2655  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 32s
      🔍 nvcc               Pass:  97%/36  | Total: 10h 06m | Avg: 16m 50s | Max:  1h 48m | Hits: 539%/3540  
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 21m 44s | Avg:  5m 26s | Max:  5m 46s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s
      🔍 Clang18            Pass:  85%/7   | Total:  1h 08m | Avg:  9m 51s | Max: 40m 07s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 25s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 47s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 GCC12              Pass: 100%/3   | Total: 29m 43s | Avg:  9m 54s | Max: 19m 15s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 03m | Avg: 37m 57s | Max:  1h 48m
      🟩 MSVC14.29          Pass: 100%/2   | Total: 57m 24s | Avg: 28m 42s | Max: 28m 42s | Hits: 539%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 50s | Avg: 29m 55s | Max: 30m 16s | Hits: 539%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  92%/14  | Total:  1h 47m | Avg:  7m 41s | Max: 40m 07s
      🟩 GCC                Pass: 100%/18  | Total:  6h 12m | Avg: 20m 40s | Max:  1h 48m
      🟩 MSVC               Pass: 100%/4   | Total:  1h 57m | Avg: 29m 18s | Max: 30m 16s | Hits: 539%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 23m 34s | Avg: 11m 47s | Max: 19m 15s
      🔍 v100               Pass:  97%/36  | Total:  9h 51m | Avg: 16m 26s | Max:  1h 48m | Hits: 539%/3540  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/31  | Total:  4h 29m | Avg:  8m 41s | Max: 30m 16s | Hits: 539%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 40m | Avg:  1h 40m | Max:  1h 40m
      🟩 GraphCapture       Pass: 100%/1   | Total:  1h 48m | Avg:  1h 48m | Max:  1h 48m
      🔍 HostLaunch         Pass:  66%/3   | Total: 53m 10s | Avg: 17m 43s | Max: 30m 02s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 22m | Avg: 41m 24s | Max: 42m 42s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/14  | Total:  2h 30m | Avg: 10m 43s | Max: 29m 34s | Hits: 539%/2655  
      🔍 20                 Pass:  95%/24  | Total:  7h 45m | Avg: 19m 22s | Max:  1h 48m | Hits: 539%/885   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 34s | Avg: 11m 47s | Max: 19m 15s
      🟩 90a                Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
    
  • 🟩 libcudacxx: Pass: 100%/46 | Total: 9h 52m | Avg: 12m 52s | Max: 36m 11s | Hits: 668%/12472

    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  9h 45m | Avg: 13m 18s | Max: 36m 11s | Hits: 668%/12472 
      🟩 arm64              Pass: 100%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/8   | Total:  1h 19m | Avg:  9m 53s | Max: 26m 14s | Hits: 649%/4869  
      🟩 12.5               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
      🟩 12.6               Pass: 100%/36  | Total:  7h 31m | Avg: 12m 31s | Max: 36m 11s | Hits: 681%/7603  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 04m | Avg: 16m 13s | Max: 20m 04s
      🟩 nvcc12.0           Pass: 100%/8   | Total:  1h 19m | Avg:  9m 53s | Max: 26m 14s | Hits: 649%/4869  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  6h 26m | Avg: 12m 04s | Max: 36m 11s | Hits: 681%/7603  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 04m | Avg: 16m 13s | Max: 20m 04s
      🟩 nvcc               Pass: 100%/42  | Total:  8h 47m | Avg: 12m 33s | Max: 36m 11s | Hits: 668%/12472 
    🟩 cxx
      🟩 Clang14            Pass: 100%/6   | Total: 35m 17s | Avg:  5m 52s | Max: 15m 29s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 40m | Avg: 12m 36s | Max: 23m 04s
      🟩 GCC7               Pass: 100%/5   | Total: 29m 55s | Avg:  5m 59s | Max: 15m 25s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 GCC9               Pass: 100%/3   | Total: 21m 45s | Avg:  7m 15s | Max: 14m 12s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 GCC13              Pass: 100%/10  | Total:  3h 00m | Avg: 18m 03s | Max: 36m 11s
      🟩 MSVC14.29          Pass: 100%/3   | Total:  1h 12m | Avg: 24m 03s | Max: 26m 14s | Hits: 660%/7354  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 02s | Avg: 29m 31s | Max: 31m 11s | Hits: 680%/5118  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 29m | Avg:  8m 49s | Max: 23m 04s
      🟩 GCC                Pass: 100%/22  | Total:  4h 09m | Avg: 11m 19s | Max: 36m 11s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 11m | Avg: 26m 14s | Max: 31m 11s | Hits: 668%/12472 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  9h 52m | Avg: 12m 52s | Max: 36m 11s | Hits: 668%/12472 
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 55m | Avg: 10m 39s | Max: 31m 19s | Hits: 668%/12472 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 59m | Avg: 29m 55s | Max: 36m 11s
      🟩 Test               Pass: 100%/2   | Total: 55m 21s | Avg: 27m 40s | Max: 32m 17s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
      🟩 90a                Pass: 100%/2   | Total: 16m 04s | Avg:  8m 02s | Max: 12m 29s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 23m | Avg: 13m 51s | Max: 31m 30s
      🟩 14                 Pass: 100%/4   | Total:  1h 04m | Avg: 16m 10s | Max: 36m 11s | Hits: 680%/2394  
      🟩 17                 Pass: 100%/14  | Total:  3h 19m | Avg: 14m 16s | Max: 31m 19s | Hits: 661%/7445  
      🟩 20                 Pass: 100%/21  | Total:  4h 02m | Avg: 11m 33s | Max: 32m 17s | Hits: 678%/2633  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 6h 40m | Avg: 10m 49s | Max: 35m 15s | Hits: 365%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 29m 01s | Avg: 14m 30s | Max: 22m 37s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  6h 30m | Avg: 11m 10s | Max: 35m 15s | Hits: 365%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 30s | Avg:  4m 45s | Max:  4m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 48s | Avg:  9m 33s | Max: 27m 14s | Hits: 365%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
      🟩 12.6               Pass: 100%/30  | Total:  5h 23m | Avg: 10m 46s | Max: 35m 15s | Hits: 365%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 10s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 48s | Avg:  9m 33s | Max: 27m 14s | Hits: 365%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
      🟩 nvcc12.6           Pass: 100%/28  | Total:  5h 12m | Avg: 11m 10s | Max: 35m 15s | Hits: 365%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 10s
      🟩 nvcc               Pass: 100%/35  | Total:  6h 30m | Avg: 11m 08s | Max: 35m 15s | Hits: 365%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 49s | Avg:  5m 27s | Max:  5m 42s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 36s | Avg:  5m 36s | Max:  5m 36s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang18            Pass: 100%/7   | Total: 46m 17s | Avg:  6m 36s | Max: 12m 24s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 11s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 55s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 33m | Avg: 11m 41s | Max: 35m 15s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 56m 18s | Avg: 28m 09s | Max: 29m 04s | Hits: 365%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 33m | Avg: 31m 07s | Max: 34m 04s | Hits: 365%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 24m | Avg:  6m 02s | Max: 12m 24s
      🟩 GCC                Pass: 100%/16  | Total:  2h 16m | Avg:  8m 33s | Max: 35m 15s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 29m | Avg: 29m 56s | Max: 34m 04s | Hits: 365%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  6h 40m | Avg: 10m 49s | Max: 35m 15s | Hits: 365%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  4h 40m | Avg:  9m 02s | Max: 30m 29s | Hits: 365%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 48s | Avg: 16m 36s | Max: 34m 04s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 25s | Max: 35m 15s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  2h 33m | Avg: 10m 58s | Max: 29m 04s | Hits: 365%/5532  
      🟩 20                 Pass: 100%/21  | Total:  3h 37m | Avg: 10m 22s | Max: 35m 15s | Hits: 365%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 49m | Avg: 5m 28s | Max: 16m 29s | Hits: 388%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 39m | Avg:  6m 11s | Max: 16m 29s | Hits: 388%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 23s | Avg:  2m 35s | Max:  2m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
      🟩 12.6               Pass: 100%/17  | Total:  1h 27m | Avg:  5m 07s | Max: 16m 29s | Hits: 388%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 27m | Avg:  5m 07s | Max: 16m 29s | Hits: 388%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 49m | Avg:  5m 28s | Max: 16m 29s | Hits: 388%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s
      🟩 Clang18            Pass: 100%/4   | Total: 24m 56s | Avg:  6m 14s | Max: 16m 29s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 01s | Avg:  3m 01s | Max:  3m 01s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 51s | Avg:  9m 55s | Max: 16m 29s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 42s | Avg:  2m 40s | Max:  2m 47s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 12s | Avg: 12m 12s | Max: 12m 12s | Hits: 388%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 38m 19s | Avg:  4m 47s | Max: 16m 29s
      🟩 GCC                Pass: 100%/8   | Total: 36m 38s | Avg:  4m 34s | Max: 16m 29s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 50s | Avg: 11m 55s | Max: 12m 12s | Hits: 388%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 49m | Avg:  5m 28s | Max: 16m 29s | Hits: 388%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 16m | Avg:  4m 14s | Max: 12m 12s | Hits: 388%/522   
      🟩 Test               Pass: 100%/2   | Total: 32m 58s | Avg: 16m 29s | Max: 16m 29s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
      🟩 90a                Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 03s | Avg:  3m 15s | Max:  5m 04s
      🟩 20                 Pass: 100%/16  | Total:  1h 36m | Avg:  6m 01s | Max: 16m 29s | Hits: 388%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 25m 11s | Avg: 6m 17s | Max: 11m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 16m 12s | Avg:  8m 06s | Max: 11m 51s
      🟩 12.6               Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 37s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 16m 12s | Avg:  8m 06s | Max: 11m 51s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 11m 51s | Avg: 11m 51s | Max: 11m 51s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 11m 51s
      🟩 GCC                Pass: 100%/2   | Total:  8m 43s | Avg:  4m 21s | Max:  4m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 18s | Avg: 5m 39s | Max: 9m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟩 Test               Pass: 100%/1   | Total:  9m 19s | Avg:  9m 19s | Max:  9m 19s
    
  • 🟩 python: Pass: 100%/1 | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 148)

# Runner
98 linux-amd64-cpu16
23 linux-amd64-gpu-v100-latest-1
16 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 18h 29m: Pass: 100%/148 | Total: 1d 06h | Avg: 12m 11s | Max: 1h 48m | Hits: 536%/25754
  • 🟩 libcudacxx: Pass: 100%/46 | Total: 9h 52m | Avg: 12m 52s | Max: 36m 11s | Hits: 668%/12472

    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  9h 45m | Avg: 13m 18s | Max: 36m 11s | Hits: 668%/12472 
      🟩 arm64              Pass: 100%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/8   | Total:  1h 19m | Avg:  9m 53s | Max: 26m 14s | Hits: 649%/4869  
      🟩 12.5               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
      🟩 12.6               Pass: 100%/36  | Total:  7h 31m | Avg: 12m 31s | Max: 36m 11s | Hits: 681%/7603  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 04m | Avg: 16m 13s | Max: 20m 04s
      🟩 nvcc12.0           Pass: 100%/8   | Total:  1h 19m | Avg:  9m 53s | Max: 26m 14s | Hits: 649%/4869  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  6h 26m | Avg: 12m 04s | Max: 36m 11s | Hits: 681%/7603  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 04m | Avg: 16m 13s | Max: 20m 04s
      🟩 nvcc               Pass: 100%/42  | Total:  8h 47m | Avg: 12m 33s | Max: 36m 11s | Hits: 668%/12472 
    🟩 cxx
      🟩 Clang14            Pass: 100%/6   | Total: 35m 17s | Avg:  5m 52s | Max: 15m 29s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 52s | Avg:  4m 52s | Max:  4m 52s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 40m | Avg: 12m 36s | Max: 23m 04s
      🟩 GCC7               Pass: 100%/5   | Total: 29m 55s | Avg:  5m 59s | Max: 15m 25s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 GCC9               Pass: 100%/3   | Total: 21m 45s | Avg:  7m 15s | Max: 14m 12s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 GCC13              Pass: 100%/10  | Total:  3h 00m | Avg: 18m 03s | Max: 36m 11s
      🟩 MSVC14.29          Pass: 100%/3   | Total:  1h 12m | Avg: 24m 03s | Max: 26m 14s | Hits: 660%/7354  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 02s | Avg: 29m 31s | Max: 31m 11s | Hits: 680%/5118  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 29m | Avg:  8m 49s | Max: 23m 04s
      🟩 GCC                Pass: 100%/22  | Total:  4h 09m | Avg: 11m 19s | Max: 36m 11s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 11m | Avg: 26m 14s | Max: 31m 11s | Hits: 668%/12472 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  9h 52m | Avg: 12m 52s | Max: 36m 11s | Hits: 668%/12472 
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 55m | Avg: 10m 39s | Max: 31m 19s | Hits: 668%/12472 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 59m | Avg: 29m 55s | Max: 36m 11s
      🟩 Test               Pass: 100%/2   | Total: 55m 21s | Avg: 27m 40s | Max: 32m 17s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
      🟩 90a                Pass: 100%/2   | Total: 16m 04s | Avg:  8m 02s | Max: 12m 29s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 23m | Avg: 13m 51s | Max: 31m 30s
      🟩 14                 Pass: 100%/4   | Total:  1h 04m | Avg: 16m 10s | Max: 36m 11s | Hits: 680%/2394  
      🟩 17                 Pass: 100%/14  | Total:  3h 19m | Avg: 14m 16s | Max: 31m 19s | Hits: 661%/7445  
      🟩 20                 Pass: 100%/21  | Total:  4h 02m | Avg: 11m 33s | Max: 32m 17s | Hits: 678%/2633  
    
  • 🟩 cub: Pass: 100%/38 | Total: 10h 33m | Avg: 16m 40s | Max: 1h 48m | Hits: 539%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total: 10h 23m | Avg: 17m 19s | Max:  1h 48m | Hits: 539%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  5m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 49m 56s | Avg:  9m 59s | Max: 28m 42s | Hits: 539%/885   
      🟩 12.5               Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
      🟩 12.6               Pass: 100%/31  | Total:  9h 25m | Avg: 18m 13s | Max:  1h 48m | Hits: 539%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 32s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 49m 56s | Avg:  9m 59s | Max: 28m 42s | Hits: 539%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
      🟩 nvcc12.6           Pass: 100%/29  | Total:  9h 16m | Avg: 19m 10s | Max:  1h 48m | Hits: 539%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 32s
      🟩 nvcc               Pass: 100%/36  | Total: 10h 24m | Avg: 17m 20s | Max:  1h 48m | Hits: 539%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 44s | Avg:  5m 26s | Max:  5m 46s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 27m | Avg: 12m 25s | Max: 40m 07s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 25s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 47s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 GCC12              Pass: 100%/3   | Total: 29m 43s | Avg:  9m 54s | Max: 19m 15s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 03m | Avg: 37m 57s | Max:  1h 48m
      🟩 MSVC14.29          Pass: 100%/2   | Total: 57m 24s | Avg: 28m 42s | Max: 28m 42s | Hits: 539%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 50s | Avg: 29m 55s | Max: 30m 16s | Hits: 539%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  2h 05m | Avg:  8m 59s | Max: 40m 07s
      🟩 GCC                Pass: 100%/18  | Total:  6h 12m | Avg: 20m 40s | Max:  1h 48m
      🟩 MSVC               Pass: 100%/4   | Total:  1h 57m | Avg: 29m 18s | Max: 30m 16s | Hits: 539%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 21s | Avg:  9m 10s | Max:  9m 16s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 34s | Avg: 11m 47s | Max: 19m 15s
      🟩 v100               Pass: 100%/36  | Total: 10h 09m | Avg: 16m 56s | Max:  1h 48m | Hits: 539%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  4h 29m | Avg:  8m 41s | Max: 30m 16s | Hits: 539%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 40m | Avg:  1h 40m | Max:  1h 40m
      🟩 GraphCapture       Pass: 100%/1   | Total:  1h 48m | Avg:  1h 48m | Max:  1h 48m
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 44s | Max: 30m 02s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 22m | Avg: 41m 24s | Max: 42m 42s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 34s | Avg: 11m 47s | Max: 19m 15s
      🟩 90a                Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  2h 30m | Avg: 10m 43s | Max: 29m 34s | Hits: 539%/2655  
      🟩 20                 Pass: 100%/24  | Total:  8h 03m | Avg: 20m 08s | Max:  1h 48m | Hits: 539%/885   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 6h 40m | Avg: 10m 49s | Max: 35m 15s | Hits: 365%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 29m 01s | Avg: 14m 30s | Max: 22m 37s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  6h 30m | Avg: 11m 10s | Max: 35m 15s | Hits: 365%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 30s | Avg:  4m 45s | Max:  4m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 48s | Avg:  9m 33s | Max: 27m 14s | Hits: 365%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
      🟩 12.6               Pass: 100%/30  | Total:  5h 23m | Avg: 10m 46s | Max: 35m 15s | Hits: 365%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 10s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 48s | Avg:  9m 33s | Max: 27m 14s | Hits: 365%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
      🟩 nvcc12.6           Pass: 100%/28  | Total:  5h 12m | Avg: 11m 10s | Max: 35m 15s | Hits: 365%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 10s
      🟩 nvcc               Pass: 100%/35  | Total:  6h 30m | Avg: 11m 08s | Max: 35m 15s | Hits: 365%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 49s | Avg:  5m 27s | Max:  5m 42s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 36s | Avg:  5m 36s | Max:  5m 36s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang18            Pass: 100%/7   | Total: 46m 17s | Avg:  6m 36s | Max: 12m 24s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 11s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 55s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 33m | Avg: 11m 41s | Max: 35m 15s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 56m 18s | Avg: 28m 09s | Max: 29m 04s | Hits: 365%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 33m | Avg: 31m 07s | Max: 34m 04s | Hits: 365%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 24m | Avg:  6m 02s | Max: 12m 24s
      🟩 GCC                Pass: 100%/16  | Total:  2h 16m | Avg:  8m 33s | Max: 35m 15s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 29m | Avg: 29m 56s | Max: 34m 04s | Hits: 365%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 28s | Avg: 14m 44s | Max: 15m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  6h 40m | Avg: 10m 49s | Max: 35m 15s | Hits: 365%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  4h 40m | Avg:  9m 02s | Max: 30m 29s | Hits: 365%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 48s | Avg: 16m 36s | Max: 34m 04s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 25s | Max: 35m 15s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  2h 33m | Avg: 10m 58s | Max: 29m 04s | Hits: 365%/5532  
      🟩 20                 Pass: 100%/21  | Total:  3h 37m | Avg: 10m 22s | Max: 35m 15s | Hits: 365%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 49m | Avg: 5m 28s | Max: 16m 29s | Hits: 388%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 39m | Avg:  6m 11s | Max: 16m 29s | Hits: 388%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 23s | Avg:  2m 35s | Max:  2m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
      🟩 12.6               Pass: 100%/17  | Total:  1h 27m | Avg:  5m 07s | Max: 16m 29s | Hits: 388%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 27m | Avg:  5m 07s | Max: 16m 29s | Hits: 388%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 49m | Avg:  5m 28s | Max: 16m 29s | Hits: 388%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s
      🟩 Clang18            Pass: 100%/4   | Total: 24m 56s | Avg:  6m 14s | Max: 16m 29s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 01s | Avg:  3m 01s | Max:  3m 01s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 51s | Avg:  9m 55s | Max: 16m 29s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 42s | Avg:  2m 40s | Max:  2m 47s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 38s | Avg: 11m 38s | Max: 11m 38s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 12s | Avg: 12m 12s | Max: 12m 12s | Hits: 388%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 38m 19s | Avg:  4m 47s | Max: 16m 29s
      🟩 GCC                Pass: 100%/8   | Total: 36m 38s | Avg:  4m 34s | Max: 16m 29s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 50s | Avg: 11m 55s | Max: 12m 12s | Hits: 388%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 49m | Avg:  5m 28s | Max: 16m 29s | Hits: 388%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 16m | Avg:  4m 14s | Max: 12m 12s | Hits: 388%/522   
      🟩 Test               Pass: 100%/2   | Total: 32m 58s | Avg: 16m 29s | Max: 16m 29s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
      🟩 90a                Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 03s | Avg:  3m 15s | Max:  5m 04s
      🟩 20                 Pass: 100%/16  | Total:  1h 36m | Avg:  6m 01s | Max: 16m 29s | Hits: 388%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 25m 11s | Avg: 6m 17s | Max: 11m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 16m 12s | Avg:  8m 06s | Max: 11m 51s
      🟩 12.6               Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 37s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 16m 12s | Avg:  8m 06s | Max: 11m 51s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 11m 51s | Avg: 11m 51s | Max: 11m 51s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 11m 51s
      🟩 GCC                Pass: 100%/2   | Total:  8m 43s | Avg:  4m 21s | Max:  4m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 25m 11s | Avg:  6m 17s | Max: 11m 51s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 18s | Avg: 5m 39s | Max: 9m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  9m 19s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟩 Test               Pass: 100%/1   | Total:  9m 19s | Avg:  9m 19s | Max:  9m 19s
    
  • 🟩 python: Pass: 100%/1 | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 32m 11s | Avg: 32m 11s | Max: 32m 11s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 148)

# Runner
98 linux-amd64-cpu16
23 linux-amd64-gpu-v100-latest-1
16 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato enabled auto-merge (squash) January 21, 2025 19:55
@fbusato fbusato merged commit d507a78 into NVIDIA:main Jan 21, 2025
161 of 164 checks passed
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 22, 2025
davebayer added a commit to davebayer/cccl that referenced this pull request Jan 22, 2025
update docs

update docs

add `memcmp`, `memmove` and `memchr` implementations

implement tests

Use cuda::std::min/max in Thrust (NVIDIA#3364)

Implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16` (NVIDIA#3361)

* implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16`

Cleanup util_arch (NVIDIA#2773)

Deprecate thrust::null_type (NVIDIA#3367)

Deprecate cub::DeviceSpmv (NVIDIA#3320)

Fixes: NVIDIA#896

Improves `DeviceSegmentedSort` test run time for large number of items and segments (NVIDIA#3246)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* fixes spelling

* adds tests for large number of segments

* fixes narrowing conversion in tests

* addresses review comments

* fixes includes

Compile basic infra test with C++17 (NVIDIA#3377)

Adds support for large number of items and large number of segments to `DeviceSegmentedSort` (NVIDIA#3308)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* addresses review comments

* introduces segment offset type

* adds tests for large number of segments

* adds support for large number of segments

* drops segment offset type

* fixes thrust namespace

* removes about-to-be-deprecated cub iterators

* no exec specifier on defaulted ctor

* fixes gcc7 linker error

* uses local_segment_index_t throughout

* determine offset type based on type returned by segment iterator begin/end iterators

* minor style improvements

Exit with error when RAPIDS CI fails. (NVIDIA#3385)

cuda.parallel: Support structured types as algorithm inputs (NVIDIA#3218)

* Introduce gpu_struct decorator and typing

* Enable `reduce` to accept arrays of structs as inputs

* Add test for reducing arrays-of-struct

* Update documentation

* Use a numpy array rather than ctypes object

* Change zeros -> empty for output array and temp storage

* Add a TODO for typing GpuStruct

* Documentation udpates

* Remove test_reduce_struct_type from test_reduce.py

* Revert to `to_cccl_value()` accepting ndarray + GpuStruct

* Bump copyrights

---------

Co-authored-by: Ashwin Srinath <[email protected]>

Deprecate thrust::async (NVIDIA#3324)

Fixes: NVIDIA#100

Review/Deprecate CUB `util.ptx` for CCCL 2.x (NVIDIA#3342)

Fix broken `_CCCL_BUILTIN_ASSUME` macro (NVIDIA#3314)

* add compiler-specific path
* fix device code path
* add _CCC_ASSUME

Deprecate thrust::numeric_limits (NVIDIA#3366)

Replace `typedef` with `using` in libcu++ (NVIDIA#3368)

Deprecate thrust::optional (NVIDIA#3307)

Fixes: NVIDIA#3306

Upgrade to Catch2 3.8  (NVIDIA#3310)

Fixes: NVIDIA#1724

refactor `<cuda/std/cstdint>` (NVIDIA#3325)

Co-authored-by: Bernhard Manfred Gruber <[email protected]>

Update CODEOWNERS (NVIDIA#3331)

* Update CODEOWNERS

* Update CODEOWNERS

* Update CODEOWNERS

* [pre-commit.ci] auto code formatting

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Fix sign-compare warning (NVIDIA#3408)

Implement more cmath functions to be usable on host and device (NVIDIA#3382)

* Implement more cmath functions to be usable on host and device

* Implement math roots functions

* Implement exponential functions

Redefine and deprecate thrust::remove_cvref (NVIDIA#3394)

* Redefine and deprecate thrust::remove_cvref

Co-authored-by: Michael Schellenberger Costa <[email protected]>

Fix assert definition for NVHPC due to constexpr issues (NVIDIA#3418)

NVHPC cannot decide at compile time where the code would run so _CCCL_ASSERT within a constexpr function breaks it.

Fix this by always using the host definition which should also work on device.

Fixes NVIDIA#3411

Extend CUB reduce benchmarks (NVIDIA#3401)

* Rename max.cu to custom.cu, since it uses a custom operator
* Extend types covered my min.cu to all fundamental types
* Add some notes on how to collect tuning parameters

Fixes: NVIDIA#3283

Update upload-pages-artifact to v3 (NVIDIA#3423)

* Update upload-pages-artifact to v3

* Empty commit

---------

Co-authored-by: Ashwin Srinath <[email protected]>

Replace and deprecate thrust::cuda_cub::terminate (NVIDIA#3421)

`std::linalg` accessors and `transposed_layout` (NVIDIA#2962)

Add round up/down to multiple (NVIDIA#3234)

[FEA]: Introduce Python module with CCCL headers (NVIDIA#3201)

* Add cccl/python/cuda_cccl directory and use from cuda_parallel, cuda_cooperative

* Run `copy_cccl_headers_to_aude_include()` before `setup()`

* Create python/cuda_cccl/cuda/_include/__init__.py, then simply import cuda._include to find the include path.

* Add cuda.cccl._version exactly as for cuda.cooperative and cuda.parallel

* Bug fix: cuda/_include only exists after shutil.copytree() ran.

* Use `f"cuda-cccl @ file://{cccl_path}/python/cuda_cccl"` in setup.py

* Remove CustomBuildCommand, CustomWheelBuild in cuda_parallel/setup.py (they are equivalent to the default functions)

* Replace := operator (needs Python 3.8+)

* Fix oversights: remove `pip3 install ./cuda_cccl` lines from README.md

* Restore original README.md: `pip3 install -e` now works on first pass.

* cuda_cccl/README.md: FOR INTERNAL USE ONLY

* Remove `$pymajor.$pyminor.` prefix in cuda_cccl _version.py (as suggested under NVIDIA#3201 (comment))

Command used: ci/update_version.sh 2 8 0

* Modernize pyproject.toml, setup.py

Trigger for this change:

* NVIDIA#3201 (comment)

* NVIDIA#3201 (comment)

* Install CCCL headers under cuda.cccl.include

Trigger for this change:

* NVIDIA#3201 (comment)

Unexpected accidental discovery: cuda.cooperative unit tests pass without CCCL headers entirely.

* Factor out cuda_cccl/cuda/cccl/include_paths.py

* Reuse cuda_cccl/cuda/cccl/include_paths.py from cuda_cooperative

* Add missing Copyright notice.

* Add missing __init__.py (cuda.cccl)

* Add `"cuda.cccl"` to `autodoc.mock_imports`

* Move cuda.cccl.include_paths into function where it is used. (Attempt to resolve Build and Verify Docs failure.)

* Add # TODO: move this to a module-level import

* Modernize cuda_cooperative/pyproject.toml, setup.py

* Convert cuda_cooperative to use hatchling as build backend.

* Revert "Convert cuda_cooperative to use hatchling as build backend."

This reverts commit 61637d6.

* Move numpy from [build-system] requires -> [project] dependencies

* Move pyproject.toml [project] dependencies -> setup.py install_requires, to be able to use CCCL_PATH

* Remove copy_license() and use license_files=["../../LICENSE"] instead.

* Further modernize cuda_cccl/setup.py to use pathlib

* Trivial simplifications in cuda_cccl/pyproject.toml

* Further simplify cuda_cccl/pyproject.toml, setup.py: remove inconsequential code

* Make cuda_cooperative/pyproject.toml more similar to cuda_cccl/pyproject.toml

* Add taplo-pre-commit to .pre-commit-config.yaml

* taplo-pre-commit auto-fixes

* Use pathlib in cuda_cooperative/setup.py

* CCCL_PYTHON_PATH in cuda_cooperative/setup.py

* Modernize cuda_parallel/pyproject.toml, setup.py

* Use pathlib in cuda_parallel/setup.py

* Add `# TOML lint & format` comment.

* Replace MANIFEST.in with `[tool.setuptools.package-data]` section in pyproject.toml

* Use pathlib in cuda/cccl/include_paths.py

* pre-commit autoupdate (EXCEPT clang-format, which was manually restored)

* Fixes after git merge main

* Resolve warning: AttributeError: '_Reduce' object has no attribute 'build_result'

```
=========================================================================== warnings summary ===========================================================================
tests/test_reduce.py::test_reduce_non_contiguous
  /home/coder/cccl/python/devenv/lib/python3.12/site-packages/_pytest/unraisableexception.py:85: PytestUnraisableExceptionWarning: Exception ignored in: <function _Reduce.__del__ at 0x7bf123139080>

  Traceback (most recent call last):
    File "/home/coder/cccl/python/cuda_parallel/cuda/parallel/experimental/algorithms/reduce.py", line 132, in __del__
      bindings.cccl_device_reduce_cleanup(ctypes.byref(self.build_result))
                                                       ^^^^^^^^^^^^^^^^^
  AttributeError: '_Reduce' object has no attribute 'build_result'

    warnings.warn(pytest.PytestUnraisableExceptionWarning(msg))

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================================================= 1 passed, 93 deselected, 1 warning in 0.44s ==============================================================
```

* Move `copy_cccl_headers_to_cuda_cccl_include()` functionality to `class CustomBuildPy`

* Introduce cuda_cooperative/constraints.txt

* Also add cuda_parallel/constraints.txt

* Add `--constraint constraints.txt` in ci/test_python.sh

* Update Copyright dates

* Switch to https://github.com/ComPWA/taplo-pre-commit (the other repo has been archived by the owner on Jul 1, 2024)

For completeness: The other repo took a long time to install into the pre-commit cache; so long it lead to timeouts in the CCCL CI.

* Remove unused cuda_parallel jinja2 dependency (noticed by chance).

* Remove constraints.txt files, advertise running `pip install cuda-cccl` first instead.

* Make cuda_cooperative, cuda_parallel testing completely independent.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Fix sign-compare warning (NVIDIA#3408) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]"

This reverts commit ea33a21.

Error message: NVIDIA#3201 (comment)

* Try using A100 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Also show cuda-cooperative site-packages, cuda-parallel site-packages (after pip install) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using l4 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Restore original ci/matrix.yaml [skip-rapids]

* Use for loop in test_python.sh to avoid code duplication.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]

* Comment out taplo-lint in pre-commit config [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]"

This reverts commit ec206fd.

* Implement suggestion by @shwina (NVIDIA#3201 (review))

* Address feedback by @leofang

---------

Co-authored-by: Bernhard Manfred Gruber <[email protected]>

cuda.parallel: Add optional stream argument to reduce_into() (NVIDIA#3348)

* Add optional stream argument to reduce_into()

* Add tests to check for reduce_into() stream behavior

* Move protocol related utils to separate file and rework __cuda_stream__ error messages

* Fix synchronization issue in stream test and add one more invalid stream test case

* Rename cuda stream validation function after removing leading underscore

* Unpack values from __cuda_stream__ instead of indexing

* Fix linting errors

* Handle TypeError when unpacking invalid __cuda_stream__ return

* Use stream to allocate cupy memory in new stream test

Upgrade to actions/deploy-pages@v4 (from v2), as suggested by @leofang (NVIDIA#3434)

Deprecate `cub::{min, max}` and replace internal uses with those from libcu++ (NVIDIA#3419)

* Deprecate `cub::{min, max}` and replace internal uses with those from libcu++

Fixes NVIDIA#3404

Fix CI issues (NVIDIA#3443)

Remove deprecated `cub::min` (NVIDIA#3450)

* Remove deprecated `cuda::{min,max}`

* Drop unused `thrust::remove_cvref` file

Fix typo in builtin (NVIDIA#3451)

Moves agents to `detail::<algorithm_name>` namespace (NVIDIA#3435)

uses unsigned offset types in thrust's scan dispatch (NVIDIA#3436)

Default transform_iterator's copy ctor (NVIDIA#3395)

Fixes: NVIDIA#2393

Turn C++ dialect warning into error (NVIDIA#3453)

Uses unsigned offset types in thrust's sort algorithm calling into `DispatchMergeSort` (NVIDIA#3437)

* uses thrust's dynamic dispatch for merge_sort

* [pre-commit.ci] auto code formatting

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Refactor allocator handling of contiguous_storage (NVIDIA#3050)

Co-authored-by: Michael Schellenberger Costa <[email protected]>

Drop thrust::detail::integer_traits (NVIDIA#3391)

Add cuda::is_floating_point supporting half and bfloat (NVIDIA#3379)

Co-authored-by: Michael Schellenberger Costa <[email protected]>

Improve docs of std headers (NVIDIA#3416)

Drop C++11 and C++14 support for all of cccl (NVIDIA#3417)

* Drop C++11 and C++14 support for all of cccl

---------

Co-authored-by: Bernhard Manfred Gruber <[email protected]>

Deprecate a few CUB macros (NVIDIA#3456)

Deprecate thrust universal iterator categories (NVIDIA#3461)

Fix launch args order (NVIDIA#3465)

Add `--extended-lambda` to the list of removed clangd flags (NVIDIA#3432)

add `_CCCL_HAS_NVFP8` macro (NVIDIA#3429)

Add `_CCCL_BUILTIN_PREFETCH` (NVIDIA#3433)

Drop universal iterator categories (NVIDIA#3474)

Ensure that headers in `<cuda/*>` can be build with a C++ only compiler (NVIDIA#3472)

Specialize __is_extended_floating_point for FP8 types (NVIDIA#3470)

Also ensure that we actually can enable FP8 due to FP16 and BF16 requirements

Co-authored-by: Michael Schellenberger Costa <[email protected]>

Moves CUB kernel entry points to a detail namespace (NVIDIA#3468)

* moves emptykernel to detail ns

* second batch

* third batch

* fourth batch

* fixes cuda parallel

* concatenates nested namespaces

Deprecate block/warp algo specializations (NVIDIA#3455)

Fixes: NVIDIA#3409

Refactor CUB's util_debug (NVIDIA#3345)
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.8.0 target for 2.8.0 release
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants