Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cleanup and modularize <cuda/std/barrier> #2443

Merged
merged 28 commits into from
Oct 5, 2024

Conversation

miscco
Copy link
Collaborator

@miscco miscco commented Sep 20, 2024

This cleanes up one of the more complicated headers we had.

It essentially had 2 different contents. barrier and memcpy_async

Disentangeling them helps a lot there

@miscco miscco requested review from a team as code owners September 20, 2024 15:56
@miscco miscco added feature request New feature or request. libcu++ For all items related to libcu++ labels Sep 20, 2024
@miscco miscco force-pushed the modularize_barrier branch 2 times, most recently from 831483f to 36f2a0e Compare September 23, 2024 09:05
Comment on lines 150 to 156
NV_PROVIDES_SM_70,
(
if (!__isShared(&__barrier)) { return __barrier.arrive(__update); }

unsigned int __mask = __activemask();
unsigned int __activeA = __match_any_sync(__mask, __update);
unsigned int __activeB = __match_any_sync(__mask, reinterpret_cast<_CUDA_VSTD::uintptr_t>(&__barrier));
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@wmaxey I believe this is kind of a fix. because pre SM70 there is no __match_any_sync

@miscco miscco force-pushed the modularize_barrier branch 3 times, most recently from 4e34f08 to 7656ef4 Compare September 23, 2024 10:56
Copy link
Contributor

🟩 CI finished in 53m 14s: Pass: 100%/364 | Total: 2d 02h | Avg: 8m 18s | Max: 45m 45s | Hits: 75%/25855
  • 🟩 cub: Pass: 100%/104 | Total: 12h 04m | Avg: 6m 57s | Max: 45m 45s | Hits: 98%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 29m | Avg:  7m 10s | Max: 45m 45s | Hits:  98%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 35m 21s | Avg:  4m 25s | Max:  4m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 07m | Avg:  4m 29s | Max: 15m 39s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 13m 16s | Avg:  4m 25s | Max:  4m 29s
      🟩 12.6               Pass: 100%/86  | Total: 10h 43m | Avg:  7m 29s | Max: 45m 45s | Hits:  98%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 12s | Avg:  3m 36s | Max:  3m 42s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 07m | Avg:  4m 29s | Max: 15m 39s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 13m 16s | Avg:  4m 25s | Max:  4m 29s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 36m | Avg:  7m 34s | Max: 45m 45s | Hits:  98%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 12s | Avg:  3m 36s | Max:  3m 42s
      🟩 nvcc               Pass: 100%/102 | Total: 11h 57m | Avg:  7m 01s | Max: 45m 45s | Hits:  98%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 57s | Avg:  4m 39s | Max:  5m 58s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 48s | Avg:  5m 36s | Max:  6m 18s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 18s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 55s | Avg:  4m 43s | Max:  5m 03s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 39s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 47s | Avg:  4m 41s | Max:  5m 28s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 31s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 27s | Avg:  4m 51s | Max:  5m 21s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 00s | Avg:  5m 00s | Max:  5m 12s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 17m | Avg:  8m 35s | Max: 25m 57s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 52s | Avg:  3m 26s | Max:  3m 35s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 03s | Avg:  4m 00s | Max:  4m 47s
      🟩 GCC8               Pass: 100%/6   | Total: 24m 58s | Avg:  4m 09s | Max:  4m 46s
      🟩 GCC9               Pass: 100%/6   | Total: 23m 59s | Avg:  3m 59s | Max:  4m 23s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 31s | Avg:  4m 37s | Max:  4m 43s
      🟩 GCC11              Pass: 100%/7   | Total: 31m 50s | Avg:  4m 32s | Max:  5m 00s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 45s | Avg:  4m 41s | Max:  4m 53s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 05m | Avg: 11m 34s | Max: 41m 37s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 40s | Avg:  5m 13s | Max:  5m 22s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 39s | Avg: 15m 39s | Max: 15m 39s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 03s | Avg: 29m 01s | Max: 44m 59s | Hits:  98%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 45m 45s | Avg: 45m 45s | Max: 45m 45s | Hits:  98%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 15m | Avg:  5m 33s | Max: 25m 57s
      🟩 GCC                Pass: 100%/51  | Total:  5h 34m | Avg:  6m 32s | Max: 41m 37s
      🟩 Intel              Pass: 100%/3   | Total: 15m 40s | Avg:  5m 13s | Max:  5m 22s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 59m | Avg: 29m 51s | Max: 45m 45s | Hits:  98%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 04m | Avg:  6m 57s | Max: 45m 45s | Hits:  98%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 54m | Avg:  5m 34s | Max: 45m 45s | Hits:  98%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 20s | Avg: 18m 20s | Max: 18m 20s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 51s | Avg: 15m 51s | Max: 15m 51s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 00m | Avg: 20m 08s | Max: 21m 33s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 41s | Max: 41m 37s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 13m 16s | Avg:  4m 25s | Max:  4m 29s
      🟩 90a                Pass: 100%/4   | Total: 15m 55s | Avg:  3m 58s | Max:  4m 22s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 38m | Avg:  5m 39s | Max: 27m 31s
      🟩 14                 Pass: 100%/27  | Total:  2h 19m | Avg:  5m 11s | Max: 15m 39s | Hits:  99%/1454  
      🟩 17                 Pass: 100%/26  | Total:  2h 41m | Avg:  6m 13s | Max: 44m 59s | Hits:  98%/727   
      🟩 20                 Pass: 100%/23  | Total:  4h 24m | Avg: 11m 29s | Max: 45m 45s | Hits:  98%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 1d 01h | Avg: 14m 42s | Max: 43m 16s | Hits: 49%/11583

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 23h 39m | Avg: 14m 46s | Max: 43m 16s | Hits:  49%/11583 
      🟩 arm64              Pass: 100%/8   | Total:  1h 50m | Avg: 13m 46s | Max: 23m 36s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 28m | Avg:  9m 54s | Max: 28m 01s | Hits:  68%/2698  
      🟩 11.8               Pass: 100%/3   | Total: 53m 27s | Avg: 17m 49s | Max: 27m 26s
      🟩 12.6               Pass: 100%/86  | Total: 22h 07m | Avg: 15m 25s | Max: 43m 16s | Hits:  43%/8885  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 36m 26s | Avg: 18m 13s | Max: 19m 02s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 28m | Avg:  9m 54s | Max: 28m 01s | Hits:  68%/2698  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 53m 27s | Avg: 17m 49s | Max: 27m 26s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 21h 30m | Avg: 15m 21s | Max: 43m 16s | Hits:  43%/8885  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 36m 26s | Avg: 18m 13s | Max: 19m 02s
      🟩 nvcc               Pass: 100%/102 | Total:  1d 00h | Avg: 14m 38s | Max: 43m 16s | Hits:  49%/11583 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 17m | Avg: 12m 51s | Max: 22m 39s
      🟩 Clang10            Pass: 100%/3   | Total: 29m 47s | Avg:  9m 55s | Max: 19m 37s
      🟩 Clang11            Pass: 100%/4   | Total: 58m 50s | Avg: 14m 42s | Max: 25m 38s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 58s | Avg:  4m 29s | Max:  5m 13s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 01m | Avg: 15m 24s | Max: 19m 25s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 07m | Avg: 16m 57s | Max: 25m 03s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 10m | Avg: 17m 33s | Max: 24m 51s
      🟩 Clang16            Pass: 100%/4   | Total: 51m 24s | Avg: 12m 51s | Max: 24m 52s
      🟩 Clang17            Pass: 100%/4   | Total: 51m 00s | Avg: 12m 45s | Max: 23m 28s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 55m | Avg: 14m 23s | Max: 26m 40s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 33s | Avg:  3m 16s | Max:  3m 43s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 22m | Avg: 13m 43s | Max: 22m 08s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 01m | Avg: 10m 18s | Max: 20m 41s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 15m | Avg: 12m 32s | Max: 24m 52s
      🟩 GCC10              Pass: 100%/4   | Total: 36m 40s | Avg:  9m 10s | Max: 24m 13s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 17m | Avg: 19m 39s | Max: 27m 26s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 05m | Avg: 16m 23s | Max: 20m 44s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 48m | Avg: 13m 26s | Max: 26m 39s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 27m | Avg: 29m 14s | Max: 33m 45s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 28m 01s | Avg: 28m 01s | Max: 28m 01s | Hits:  68%/2698  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 15m | Avg: 37m 32s | Max: 38m 03s | Hits:  44%/5758  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 43m 16s | Avg: 43m 16s | Max: 43m 16s | Hits:  41%/3127  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total: 10h 00m | Avg: 13m 21s | Max: 26m 40s
      🟩 GCC                Pass: 100%/52  | Total: 11h 34m | Avg: 13m 21s | Max: 27m 26s
      🟩 Intel              Pass: 100%/3   | Total:  1h 27m | Avg: 29m 14s | Max: 33m 45s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 26m | Avg: 36m 35s | Max: 43m 16s | Hits:  49%/11583 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  1d 01h | Avg: 14m 42s | Max: 43m 16s | Hits:  49%/11583 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 22h 50m | Avg: 14m 16s | Max: 43m 16s | Hits:  49%/11583 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 35m | Avg: 23m 45s | Max: 26m 39s
      🟩 Test               Pass: 100%/3   | Total:  1h 01m | Avg: 20m 37s | Max: 26m 40s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 53m 27s | Avg: 17m 49s | Max: 27m 26s
      🟩 90a                Pass: 100%/4   | Total: 26m 09s | Avg:  6m 32s | Max:  6m 52s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  5h 51m | Avg: 12m 33s | Max: 25m 48s
      🟩 14                 Pass: 100%/28  | Total:  6h 21m | Avg: 13m 38s | Max: 37m 02s | Hits:  56%/5497  
      🟩 17                 Pass: 100%/27  | Total:  7h 41m | Avg: 17m 04s | Max: 38m 03s | Hits:  42%/2959  
      🟩 20                 Pass: 100%/20  | Total:  5h 32m | Avg: 16m 36s | Max: 43m 16s | Hits:  41%/3127  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 10h 11m | Avg: 5m 56s | Max: 23m 25s | Hits: 96%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  9h 36m | Avg:  6m 04s | Max: 23m 25s | Hits:  96%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 35m 21s | Avg:  4m 25s | Max:  4m 55s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 16m | Avg:  5m 06s | Max: 19m 13s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 18s
      🟩 12.6               Pass: 100%/85  | Total:  8h 39m | Avg:  6m 06s | Max: 23m 25s | Hits:  96%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 27s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 16m | Avg:  5m 06s | Max: 19m 13s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 18s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  8h 29m | Avg:  6m 08s | Max: 23m 25s | Hits:  96%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 27s
      🟩 nvcc               Pass: 100%/101 | Total: 10h 01m | Avg:  5m 57s | Max: 23m 25s | Hits:  96%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 32s | Avg:  4m 55s | Max:  6m 07s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 25s | Avg:  5m 48s | Max:  6m 10s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 40s | Avg:  4m 40s | Max:  4m 49s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 46s | Avg:  4m 41s | Max:  5m 05s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  4m 50s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 41s | Avg:  4m 40s | Max:  4m 58s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 06s | Avg:  5m 01s | Max:  5m 24s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 41s | Avg:  4m 55s | Max:  5m 05s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 54s | Avg:  4m 58s | Max:  5m 20s
      🟩 Clang18            Pass: 100%/9   | Total: 54m 36s | Avg:  6m 04s | Max: 15m 33s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 38s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 20s | Avg:  4m 23s | Max:  5m 13s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 48s | Avg:  4m 18s | Max:  5m 01s
      🟩 GCC9               Pass: 100%/6   | Total: 27m 56s | Avg:  4m 39s | Max:  5m 48s
      🟩 GCC10              Pass: 100%/4   | Total: 20m 09s | Avg:  5m 02s | Max:  5m 16s
      🟩 GCC11              Pass: 100%/7   | Total: 35m 26s | Avg:  5m 03s | Max:  5m 27s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 23s | Avg:  5m 05s | Max:  5m 19s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 28m | Avg:  6m 17s | Max: 15m 31s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 01s | Avg:  6m 20s | Max:  7m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 13s | Avg: 19m 13s | Max: 19m 13s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 38m 11s | Avg: 19m 05s | Max: 21m 11s | Hits:  96%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 46m 46s | Avg: 23m 23s | Max: 23m 25s | Hits:  95%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  3h 55m | Avg:  5m 07s | Max: 15m 33s
      🟩 GCC                Pass: 100%/49  | Total:  4h 12m | Avg:  5m 09s | Max: 15m 31s
      🟩 Intel              Pass: 100%/3   | Total: 19m 01s | Avg:  6m 20s | Max:  7m 03s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 44m | Avg: 20m 50s | Max: 23m 25s | Hits:  96%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 10h 11m | Avg:  5m 56s | Max: 23m 25s | Hits:  96%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 43m | Avg:  5m 27s | Max: 23m 25s | Hits:  96%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 52s | Avg: 11m 13s | Max: 23m 21s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 43m 18s | Avg: 14m 26s | Max: 15m 33s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 18s
      🟩 90a                Pass: 100%/4   | Total: 16m 30s | Avg:  4m 07s | Max:  4m 29s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 15m | Avg:  4m 51s | Max: 12m 14s
      🟩 14                 Pass: 100%/27  | Total:  2h 35m | Avg:  5m 46s | Max: 19m 13s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  2h 28m | Avg:  5m 41s | Max: 21m 11s | Hits:  94%/2230  
      🟩 20                 Pass: 100%/22  | Total:  2h 51m | Avg:  7m 49s | Max: 23m 25s | Hits:  95%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 22m | Avg: 2m 44s | Max: 10m 04s | Hits: 90%/214

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 13m | Avg:  2m 46s | Max: 10m 04s | Hits:  90%/214   
      🟩 arm64              Pass: 100%/4   | Total:  9m 14s | Avg:  2m 18s | Max:  2m 27s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 49m 56s | Avg:  2m 37s | Max: 10m 02s | Hits:  90%/107   
      🟩 12.6               Pass: 100%/33  | Total:  1h 32m | Avg:  2m 48s | Max: 10m 04s | Hits:  90%/107   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 49m 56s | Avg:  2m 37s | Max: 10m 02s | Hits:  90%/107   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 32m | Avg:  2m 48s | Max: 10m 04s | Hits:  90%/107   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 10m 04s | Hits:  90%/214   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 23s | Avg:  2m 41s | Max:  3m 06s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 29s | Avg:  2m 14s | Max:  2m 39s
      🟩 Clang11            Pass: 100%/4   | Total:  8m 46s | Avg:  2m 11s | Max:  2m 35s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 58s | Avg:  2m 29s | Max:  2m 53s
      🟩 Clang13            Pass: 100%/4   | Total:  8m 41s | Avg:  2m 10s | Max:  2m 40s
      🟩 Clang14            Pass: 100%/4   | Total: 11m 04s | Avg:  2m 46s | Max:  3m 36s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 10s | Avg:  2m 35s | Max:  2m 42s
      🟩 Clang16            Pass: 100%/4   | Total:  9m 58s | Avg:  2m 29s | Max:  2m 49s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 32s | Avg:  2m 46s | Max:  2m 46s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 28s | Avg:  3m 14s | Max:  4m 21s
      🟩 GCC9               Pass: 100%/2   | Total:  3m 52s | Avg:  1m 56s | Max:  2m 04s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 05s | Avg:  2m 01s | Max:  2m 11s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 32s | Avg:  2m 08s | Max:  2m 26s
      🟩 GCC12              Pass: 100%/7   | Total: 19m 44s | Avg:  2m 49s | Max:  3m 55s
      🟩 GCC13              Pass: 100%/3   | Total:  6m 53s | Avg:  2m 17s | Max:  2m 27s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 02s | Avg: 10m 02s | Max: 10m 02s | Hits:  90%/107   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s | Hits:  90%/107   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 15m | Avg:  2m 30s | Max:  4m 21s
      🟩 GCC                Pass: 100%/20  | Total: 47m 06s | Avg:  2m 21s | Max:  3m 55s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 06s | Avg: 10m 03s | Max: 10m 04s | Hits:  90%/214   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 10m 04s | Hits:  90%/214   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 03m | Avg:  2m 37s | Max: 10m 04s | Hits:  90%/214   
      🟩 Test               Pass: 100%/5   | Total: 19m 07s | Avg:  3m 49s | Max:  4m 21s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  1m 52s | Avg:  1m 52s | Max:  1m 52s
      🟩 90a                Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 06m | Avg:  2m 23s | Max:  3m 53s
      🟩 20                 Pass: 100%/24  | Total:  1h 15m | Avg:  3m 09s | Max: 10m 04s | Hits:  90%/214   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

github-actions bot commented Oct 4, 2024

🟩 CI finished in 1h 51m: Pass: 100%/364 | Total: 2d 05h | Avg: 8m 48s | Max: 1h 22m | Hits: 10%/25912
  • 🟩 cub: Pass: 100%/104 | Total: 17h 28m | Avg: 10m 05s | Max: 1h 14m | Hits: 2%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 16h 53m | Avg: 10m 33s | Max:  1h 14m | Hits:   2%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 35m 28s | Avg:  4m 26s | Max:  4m 59s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 57m | Avg:  7m 49s | Max:  1h 04m | Hits:   2%/727   
      🟩 11.8               Pass: 100%/3   | Total: 13m 58s | Avg:  4m 39s | Max:  4m 51s
      🟩 12.6               Pass: 100%/86  | Total: 15h 17m | Avg: 10m 39s | Max:  1h 14m | Hits:   2%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 42s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 57m | Avg:  7m 49s | Max:  1h 04m | Hits:   2%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 13m 58s | Avg:  4m 39s | Max:  4m 51s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 15h 09m | Avg: 10m 49s | Max:  1h 14m | Hits:   2%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 42s
      🟩 nvcc               Pass: 100%/102 | Total: 17h 21m | Avg: 10m 12s | Max:  1h 14m | Hits:   2%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 09s | Avg:  4m 41s | Max:  5m 52s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 34s | Avg:  5m 31s | Max:  5m 52s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 07s | Avg:  4m 31s | Max:  4m 39s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 25s | Avg:  4m 36s | Max:  4m 52s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 04s | Avg:  4m 31s | Max:  4m 42s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 28s | Avg:  4m 37s | Max:  4m 58s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 55s | Avg:  4m 43s | Max:  5m 09s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 30s | Avg:  4m 52s | Max:  5m 03s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 46s | Avg:  4m 41s | Max:  5m 01s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 34m | Avg: 10m 29s | Max: 38m 22s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 48s | Avg:  3m 54s | Max:  4m 18s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 01s | Avg:  4m 00s | Max:  4m 55s
      🟩 GCC8               Pass: 100%/6   | Total: 24m 37s | Avg:  4m 06s | Max:  4m 21s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 07s | Avg:  4m 01s | Max:  4m 27s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 53s | Avg:  4m 28s | Max:  4m 34s
      🟩 GCC11              Pass: 100%/7   | Total: 32m 30s | Avg:  4m 38s | Max:  4m 58s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 55s | Avg:  4m 43s | Max:  5m 03s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 21m | Avg: 12m 34s | Max: 37m 27s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 42m | Avg: 54m 19s | Max: 56m 18s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:   2%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:   2%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 14m | Avg:  1h 14m | Max:  1h 14m | Hits:   2%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 29m | Avg:  5m 51s | Max: 38m 22s
      🟩 GCC                Pass: 100%/51  | Total:  5h 51m | Avg:  6m 53s | Max: 37m 27s
      🟩 Intel              Pass: 100%/3   | Total:  2h 42m | Avg: 54m 19s | Max: 56m 18s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 25m | Avg:  1h 06m | Max:  1h 14m | Hits:   2%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 17h 28m | Avg: 10m 05s | Max:  1h 14m | Hits:   2%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 13h 43m | Avg:  8m 34s | Max:  1h 14m | Hits:   2%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 16s | Avg: 24m 16s | Max: 24m 16s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 35s | Avg: 19m 35s | Max: 19m 35s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 53s | Max: 25m 45s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 52m | Avg: 37m 25s | Max: 38m 22s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 13m 58s | Avg:  4m 39s | Max:  4m 51s
      🟩 90a                Pass: 100%/4   | Total: 14m 18s | Avg:  3m 34s | Max:  3m 40s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  3h 36m | Avg:  7m 43s | Max: 52m 46s
      🟩 14                 Pass: 100%/27  | Total:  4h 49m | Avg: 10m 43s | Max:  1h 04m | Hits:   2%/1454  
      🟩 17                 Pass: 100%/26  | Total:  3h 46m | Avg:  8m 42s | Max:  1h 03m | Hits:   2%/727   
      🟩 20                 Pass: 100%/23  | Total:  5h 16m | Avg: 13m 45s | Max:  1h 14m | Hits:   2%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 17h 21m | Avg: 10m 00s | Max: 49m 40s | Hits: 2%/11638

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 16h 51m | Avg: 10m 32s | Max: 49m 40s | Hits:   2%/11638 
      🟩 arm64              Pass: 100%/8   | Total: 30m 00s | Avg:  3m 45s | Max:  4m 12s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 52m | Avg:  7m 29s | Max: 43m 59s | Hits:   3%/2711  
      🟩 11.8               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 27m 24s
      🟩 12.6               Pass: 100%/86  | Total: 14h 17m | Avg:  9m 58s | Max: 49m 40s | Hits:   2%/8927  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 36m 17s | Avg: 18m 08s | Max: 19m 11s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 52m | Avg:  7m 29s | Max: 43m 59s | Hits:   3%/2711  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 27m 24s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 13h 41m | Avg:  9m 46s | Max: 49m 40s | Hits:   2%/8927  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 36m 17s | Avg: 18m 08s | Max: 19m 11s
      🟩 nvcc               Pass: 100%/102 | Total: 16h 44m | Avg:  9m 51s | Max: 49m 40s | Hits:   2%/11638 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 51m 58s | Avg:  8m 39s | Max: 22m 28s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 56s | Avg:  5m 18s | Max:  5m 48s
      🟩 Clang11            Pass: 100%/4   | Total: 16m 53s | Avg:  4m 13s | Max:  4m 24s
      🟩 Clang12            Pass: 100%/4   | Total: 53m 37s | Avg: 13m 24s | Max: 25m 07s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 53s | Avg:  4m 28s | Max:  4m 43s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 32s | Avg:  4m 23s | Max:  4m 50s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 39s
      🟩 Clang17            Pass: 100%/4   | Total: 41m 38s | Avg: 10m 24s | Max: 28m 37s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 14m | Avg:  9m 17s | Max: 19m 11s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 29s | Avg:  2m 44s | Max:  2m 54s
      🟩 GCC7               Pass: 100%/6   | Total: 19m 53s | Avg:  3m 18s | Max:  4m 11s
      🟩 GCC8               Pass: 100%/6   | Total: 37m 16s | Avg:  6m 12s | Max: 19m 10s
      🟩 GCC9               Pass: 100%/6   | Total: 32m 20s | Avg:  5m 23s | Max: 16m 15s
      🟩 GCC10              Pass: 100%/4   | Total: 16m 08s | Avg:  4m 02s | Max:  4m 36s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 27m | Avg: 12m 27s | Max: 27m 24s
      🟩 GCC12              Pass: 100%/4   | Total: 30m 11s | Avg:  7m 32s | Max: 17m 28s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 42m | Avg: 13m 06s | Max: 36m 25s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 31m | Avg: 30m 23s | Max: 38m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 43m 59s | Avg: 43m 59s | Max: 43m 59s | Hits:   3%/2711  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 12s | Max: 42m 27s | Hits:   2%/5786  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 49m 40s | Avg: 49m 40s | Max: 49m 40s | Hits:   2%/3141  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  5h 24m | Avg:  7m 12s | Max: 28m 37s
      🟩 GCC                Pass: 100%/52  | Total:  7h 31m | Avg:  8m 40s | Max: 36m 25s
      🟩 Intel              Pass: 100%/3   | Total:  1h 31m | Avg: 30m 23s | Max: 38m 03s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 54m | Avg: 43m 31s | Max: 49m 40s | Hits:   2%/11638 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 17h 21m | Avg: 10m 00s | Max: 49m 40s | Hits:   2%/11638 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 14h 08m | Avg:  8m 50s | Max: 49m 40s | Hits:   2%/11638 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 53m | Avg: 28m 21s | Max: 32m 43s
      🟩 Test               Pass: 100%/3   | Total:  1h 17m | Avg: 25m 44s | Max: 36m 25s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 27m 24s
      🟩 90a                Pass: 100%/4   | Total: 25m 23s | Avg:  6m 20s | Max:  6m 41s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  3h 49m | Avg:  8m 12s | Max: 36m 25s
      🟩 14                 Pass: 100%/28  | Total:  4h 55m | Avg: 10m 32s | Max: 43m 59s | Hits:   2%/5524  
      🟩 17                 Pass: 100%/27  | Total:  4h 58m | Avg: 11m 02s | Max: 42m 27s | Hits:   2%/2973  
      🟩 20                 Pass: 100%/20  | Total:  3h 36m | Avg: 10m 48s | Max: 49m 40s | Hits:   2%/3141  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 15h 58m | Avg: 9m 18s | Max: 1h 22m | Hits: 19%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total: 15h 23m | Avg:  9m 43s | Max:  1h 22m | Hits:  19%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 35m 40s | Avg:  4m 27s | Max:  5m 25s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 07m | Avg:  8m 31s | Max:  1h 14m | Hits:   0%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 54s
      🟩 12.6               Pass: 100%/85  | Total: 13h 37m | Avg:  9m 37s | Max:  1h 22m | Hits:  24%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 09s | Avg:  4m 34s | Max:  4m 36s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 07m | Avg:  8m 31s | Max:  1h 14m | Hits:   0%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 54s
      🟩 nvcc12.6           Pass: 100%/83  | Total: 13h 28m | Avg:  9m 44s | Max:  1h 22m | Hits:  24%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 09s | Avg:  4m 34s | Max:  4m 36s
      🟩 nvcc               Pass: 100%/101 | Total: 15h 49m | Avg:  9m 24s | Max:  1h 22m | Hits:  19%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 26s | Avg:  4m 44s | Max:  5m 44s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 14s | Avg:  5m 44s | Max:  6m 07s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 30s | Avg:  4m 37s | Max:  4m 48s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  5m 30s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 17s | Avg:  5m 04s | Max:  5m 29s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 19s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 26s | Avg:  5m 06s | Max:  5m 33s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 35s | Avg:  4m 53s | Max:  5m 34s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  4m 59s
      🟩 Clang18            Pass: 100%/9   | Total: 58m 42s | Avg:  6m 31s | Max: 18m 16s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 13s | Avg:  3m 36s | Max:  3m 44s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 58s | Avg:  3m 59s | Max:  4m 39s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 35s | Avg:  4m 15s | Max:  4m 50s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 43s | Avg:  4m 17s | Max:  4m 54s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 39s | Avg:  4m 39s | Max:  4m 55s
      🟩 GCC11              Pass: 100%/7   | Total: 33m 18s | Avg:  4m 45s | Max:  5m 30s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 38s | Avg:  4m 54s | Max:  5m 08s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 35m | Avg:  6m 49s | Max: 18m 27s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 24m | Avg: 48m 19s | Max: 51m 57s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 14m | Avg:  1h 14m | Max:  1h 14m | Hits:   0%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 16m | Hits:   0%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 46s | Max:  1h 22m | Hits:  49%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 01m | Avg:  5m 14s | Max: 18m 16s
      🟩 GCC                Pass: 100%/49  | Total:  4h 09m | Avg:  5m 05s | Max: 18m 27s
      🟩 Intel              Pass: 100%/3   | Total:  2h 24m | Avg: 48m 19s | Max: 51m 57s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 23m | Avg:  1h 04m | Max:  1h 22m | Hits:  19%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 15h 58m | Avg:  9m 18s | Max:  1h 22m | Hits:  19%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 14h 16m | Avg:  8m 55s | Max:  1h 22m | Hits:   0%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 48m 33s | Avg: 12m 08s | Max: 25m 11s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 53m 49s | Avg: 17m 56s | Max: 18m 27s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 54s
      🟩 90a                Pass: 100%/4   | Total: 16m 30s | Avg:  4m 07s | Max:  4m 32s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 53m | Avg:  6m 12s | Max: 41m 33s
      🟩 14                 Pass: 100%/27  | Total:  4h 59m | Avg: 11m 04s | Max:  1h 14m | Hits:   0%/4460  
      🟩 17                 Pass: 100%/26  | Total:  4h 05m | Avg:  9m 26s | Max:  1h 16m | Hits:   0%/2230  
      🟩 20                 Pass: 100%/22  | Total:  4h 00m | Avg: 10m 56s | Max:  1h 22m | Hits:  49%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 23m | Avg: 2m 45s | Max: 11m 43s | Hits: 2%/216

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 16m | Avg:  2m 50s | Max: 11m 43s | Hits:   2%/216   
      🟩 arm64              Pass: 100%/4   | Total:  7m 06s | Avg:  1m 46s | Max:  1m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 55m 34s | Avg:  2m 55s | Max: 11m 37s | Hits:   2%/108   
      🟩 12.6               Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 11m 43s | Hits:   2%/108   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 55m 34s | Avg:  2m 55s | Max: 11m 37s | Hits:   2%/108   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 11m 43s | Hits:   2%/108   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 23m | Avg:  2m 45s | Max: 11m 43s | Hits:   2%/216   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 35s | Avg:  2m 17s | Max:  2m 23s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 26s | Avg:  2m 13s | Max:  2m 17s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 00s | Avg:  2m 15s | Max:  2m 20s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 50s | Avg:  2m 27s | Max:  2m 46s
      🟩 Clang13            Pass: 100%/4   | Total:  8m 54s | Avg:  2m 13s | Max:  2m 22s
      🟩 Clang14            Pass: 100%/4   | Total: 11m 06s | Avg:  2m 46s | Max:  4m 03s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 47s | Avg:  2m 23s | Max:  2m 30s
      🟩 Clang16            Pass: 100%/4   | Total:  8m 37s | Avg:  2m 09s | Max:  2m 27s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 00s | Avg:  2m 30s | Max:  2m 41s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 25s | Avg:  3m 12s | Max:  3m 59s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 04s | Avg:  2m 02s | Max:  2m 03s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 26s | Avg:  2m 06s | Max:  2m 10s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 31s | Avg:  2m 07s | Max:  2m 15s
      🟩 GCC12              Pass: 100%/7   | Total: 21m 26s | Avg:  3m 03s | Max:  4m 49s
      🟩 GCC13              Pass: 100%/3   | Total:  5m 24s | Avg:  1m 48s | Max:  2m 07s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:   2%/108   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 43s | Avg: 11m 43s | Max: 11m 43s | Hits:   2%/108   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 12m | Avg:  2m 25s | Max:  4m 03s
      🟩 GCC                Pass: 100%/20  | Total: 47m 51s | Avg:  2m 23s | Max:  4m 49s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 20s | Avg: 11m 40s | Max: 11m 43s | Hits:   2%/216   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 23m | Avg:  2m 45s | Max: 11m 43s | Hits:   2%/216   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 02m | Avg:  2m 36s | Max: 11m 43s | Hits:   2%/216   
      🟩 Test               Pass: 100%/5   | Total: 21m 04s | Avg:  4m 12s | Max:  4m 49s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 90a                Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 07m | Avg:  2m 23s | Max:  4m 49s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 11s | Max: 11m 43s | Hits:   2%/216   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 23s | Avg: 15m 23s | Max: 15m 23s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@miscco miscco merged commit c86caca into NVIDIA:main Oct 5, 2024
378 checks passed
@miscco miscco deleted the modularize_barrier branch October 5, 2024 10:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request. libcu++ For all items related to libcu++
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants