Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rework builtin handling #2461

Merged
merged 3 commits into from
Oct 2, 2024
Merged

Rework builtin handling #2461

merged 3 commits into from
Oct 2, 2024

Conversation

miscco
Copy link
Collaborator

@miscco miscco commented Sep 25, 2024

We are relying a lot on compiler builtins for libcu++, but there are a lot more and also it is all over the place.

This moves almost all of the builtin detection and fixes to a central CCCL header that provides it everywhere and also replaces most open coded uses with that.

Copy link
Contributor

🟨 CI finished in 2h 31m: Pass: 93%/364 | Total: 6d 13h | Avg: 26m 00s | Max: 1h 27m | Hits: 13%/17361
  • 🟨 libcudacxx: Pass: 84%/104 | Total: 1d 09h | Avg: 19m 32s | Max: 1h 27m | Hits: 2%/3081

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  83%/96  | Total:  1d 06h | Avg: 19m 13s | Max:  1h 27m | Hits:   2%/3081  
      🟩 arm64              Pass: 100%/8   | Total:  3h 06m | Avg: 23m 19s | Max: 30m 29s
    🟨 ctk
      🟨 11.1               Pass:  73%/15  | Total:  1h 46m | Avg:  7m 05s | Max: 38m 29s
      🟩 11.8               Pass: 100%/3   | Total: 24m 32s | Avg:  8m 10s | Max: 10m 32s
      🟨 12.6               Pass:  86%/86  | Total:  1d 07h | Avg: 22m 06s | Max:  1h 27m | Hits:   2%/3081  
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total: 38m 25s | Avg: 19m 12s | Max: 20m 18s
      🟨 nvcc11.1           Pass:  73%/15  | Total:  1h 46m | Avg:  7m 05s | Max: 38m 29s
      🟩 nvcc11.8           Pass: 100%/3   | Total: 24m 32s | Avg:  8m 10s | Max: 10m 32s
      🟨 nvcc12.6           Pass:  88%/84  | Total:  1d 07h | Avg: 22m 10s | Max:  1h 27m | Hits:   2%/3081  
    🟨 cxx
      🟥 Clang9             Pass:   0%/6   | Total:  1h 24m | Avg: 14m 04s | Max: 31m 18s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 04s | Max: 30m 13s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 39m | Avg: 24m 48s | Max: 27m 17s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 43m | Avg: 25m 46s | Max: 27m 38s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 40m | Avg: 25m 01s | Max: 27m 41s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 39m | Avg: 24m 56s | Max: 27m 57s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 39m | Avg: 24m 56s | Max: 29m 03s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 40m | Avg: 25m 14s | Max: 29m 08s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 40m | Avg: 25m 06s | Max: 29m 01s
      🟨 Clang18            Pass:  75%/8   | Total:  2h 58m | Avg: 22m 16s | Max: 29m 07s
      🟩 GCC6               Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 21s
      🟩 GCC7               Pass: 100%/6   | Total: 36m 22s | Avg:  6m 03s | Max:  7m 07s
      🟩 GCC8               Pass: 100%/6   | Total: 36m 47s | Avg:  6m 07s | Max:  7m 14s
      🟩 GCC9               Pass: 100%/6   | Total: 37m 29s | Avg:  6m 14s | Max:  7m 34s
      🟩 GCC10              Pass: 100%/4   | Total: 30m 57s | Avg:  7m 44s | Max:  8m 44s
      🟩 GCC11              Pass: 100%/7   | Total: 55m 48s | Avg:  7m 58s | Max: 10m 32s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 26m | Avg: 21m 39s | Max: 29m 36s
      🟨 GCC13              Pass:  70%/17  | Total:  7h 21m | Avg: 25m 59s | Max:  1h 27m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 26m | Avg: 28m 56s | Max: 33m 17s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 38m 29s | Avg: 38m 29s | Max: 38m 29s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  1h 18m | Avg: 39m 15s | Max: 42m 01s
      🟩 MSVC14.39          Pass: 100%/1   | Total: 47m 50s | Avg: 47m 50s | Max: 47m 50s | Hits:   2%/3081  
    🟨 cxx_family
      🟨 Clang              Pass:  82%/45  | Total: 17h 24m | Avg: 23m 12s | Max: 31m 18s
      🟨 GCC                Pass:  90%/52  | Total: 12h 16m | Avg: 14m 09s | Max:  1h 27m
      🟩 Intel              Pass: 100%/3   | Total:  1h 26m | Avg: 28m 56s | Max: 33m 17s
      🟨 MSVC               Pass:  25%/4   | Total:  2h 44m | Avg: 41m 12s | Max: 47m 50s | Hits:   2%/3081  
    🟨 jobs
      🟨 Build              Pass:  88%/96  | Total:  1d 05h | Avg: 18m 38s | Max: 47m 50s | Hits:   2%/3081  
      🟥 NVRTC              Pass:   0%/4   | Total:  1h 26m | Avg: 21m 43s | Max: 21m 53s
      🟨 Test               Pass:  66%/3   | Total:  2h 33m | Avg: 51m 13s | Max:  1h 27m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
    🟨 gpu
      🟨 v100               Pass:  84%/104 | Total:  1d 09h | Avg: 19m 32s | Max:  1h 27m | Hits:   2%/3081  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 38m 25s | Avg: 19m 12s | Max: 20m 18s
      🟨 nvcc               Pass:  86%/102 | Total:  1d 09h | Avg: 19m 32s | Max:  1h 27m | Hits:   2%/3081  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 24m 32s | Avg:  8m 10s | Max: 10m 32s
      🟩 90a                Pass: 100%/4   | Total:  1h 07m | Avg: 16m 54s | Max: 20m 30s
    🟨 std
      🟨 11                 Pass:  89%/28  | Total:  7h 12m | Avg: 15m 27s | Max: 47m 39s
      🟨 14                 Pass:  82%/28  | Total:  8h 01m | Avg: 17m 11s | Max: 38m 29s
      🟨 17                 Pass:  81%/27  | Total:  8h 55m | Avg: 19m 49s | Max: 42m 01s
      🟨 20                 Pass:  85%/20  | Total:  9h 41m | Avg: 29m 03s | Max:  1h 27m | Hits:   2%/3081  
    
  • 🟨 cub: Pass: 97%/104 | Total: 2d 21h | Avg: 39m 59s | Max: 1h 13m | Hits: 0%/2908

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/96  | Total:  2d 14h | Avg: 38m 48s | Max:  1h 13m | Hits:   0%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  7h 13m | Avg: 54m 12s | Max: 56m 50s
    🔍 ctk: 11.1 🔍
      🔍 11.1               Pass:  80%/15  | Total:  1h 43m | Avg:  6m 53s | Max: 57m 47s | Hits:   0%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 17m | Avg:  1h 05m | Max:  1h 06m
      🟩 12.6               Pass: 100%/86  | Total:  2d 16h | Avg: 44m 52s | Max:  1h 13m | Hits:   0%/2181  
    🔍 cudacxx: nvcc11.1 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 47m | Avg: 53m 43s | Max: 53m 47s
      🔍 nvcc11.1           Pass:  80%/15  | Total:  1h 43m | Avg:  6m 53s | Max: 57m 47s | Hits:   0%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 17m | Avg:  1h 05m | Max:  1h 06m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 14h | Avg: 44m 39s | Max:  1h 13m | Hits:   0%/2181  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 43s | Max: 53m 47s
      🔍 nvcc               Pass:  97%/102 | Total:  2d 19h | Avg: 39m 43s | Max:  1h 13m | Hits:   0%/2908  
    🔍 cxx: Clang9 🔍
      🔍 Clang9             Pass:  50%/6   | Total:  2h 41m | Avg: 26m 56s | Max: 53m 40s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 37m | Avg: 52m 21s | Max: 53m 09s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 27m | Avg: 51m 49s | Max: 53m 29s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 36m | Avg: 54m 06s | Max: 57m 05s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 50s | Max: 51m 16s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 49s | Max: 51m 06s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 55s | Max: 51m 44s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 33m | Avg: 53m 17s | Max: 58m 46s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 53s | Max: 51m 27s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 09m | Avg: 47m 40s | Max: 54m 35s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 51s | Avg:  3m 25s | Max:  3m 30s
      🟩 GCC7               Pass: 100%/6   | Total: 37m 28s | Avg:  6m 14s | Max:  9m 32s
      🟩 GCC8               Pass: 100%/6   | Total: 37m 32s | Avg:  6m 15s | Max:  9m 07s
      🟩 GCC9               Pass: 100%/6   | Total: 39m 50s | Avg:  6m 38s | Max: 10m 17s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 30m | Avg: 52m 39s | Max: 58m 06s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 39m | Avg: 57m 00s | Max:  1h 06m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 31m | Avg: 52m 50s | Max: 55m 25s
      🟩 GCC13              Pass: 100%/16  | Total:  9h 42m | Avg: 36m 24s | Max: 56m 52s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 50m | Avg: 56m 55s | Max: 59m 01s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 57m 47s | Avg: 57m 47s | Max: 57m 47s | Hits:   0%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 13m | Hits:   0%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:   0%/727   
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  93%/46  | Total:  1d 12h | Avg: 47m 47s | Max: 58m 46s
      🟩 GCC                Pass: 100%/51  | Total:  1d 01h | Avg: 29m 54s | Max:  1h 06m
      🟩 Intel              Pass: 100%/3   | Total:  2h 50m | Avg: 56m 55s | Max: 59m 01s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 24m | Avg:  1h 06m | Max:  1h 13m | Hits:   0%/2908  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  96%/96  | Total:  2d 17h | Avg: 41m 04s | Max:  1h 13m | Hits:   0%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 27m 04s | Avg: 27m 04s | Max: 27m 04s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 32s | Avg: 17m 32s | Max: 17m 32s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 03s | Max: 27m 25s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 46s | Max: 35m 33s
    🟨 std
      🟨 11                 Pass:  96%/28  | Total: 17h 04m | Avg: 36m 35s | Max:  1h 05m
      🟨 14                 Pass:  96%/27  | Total: 17h 33m | Avg: 39m 01s | Max:  1h 06m | Hits:   0%/1454  
      🟨 17                 Pass:  96%/26  | Total: 17h 31m | Avg: 40m 25s | Max:  1h 13m | Hits:   0%/727   
      🟩 20                 Pass: 100%/23  | Total: 17h 09m | Avg: 44m 45s | Max:  1h 08m | Hits:   0%/727   
    🟨 gpu
      🟨 v100               Pass:  97%/104 | Total:  2d 21h | Avg: 39m 59s | Max:  1h 13m | Hits:   0%/2908  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 17m | Avg:  1h 05m | Max:  1h 06m
      🟩 90a                Pass: 100%/4   | Total:  1h 34m | Avg: 23m 32s | Max: 24m 38s
    
  • 🟨 thrust: Pass: 97%/103 | Total: 2d 03h | Avg: 29m 50s | Max: 1h 23m | Hits: 19%/11150

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/95  | Total:  1d 22h | Avg: 29m 17s | Max:  1h 23m | Hits:  19%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 51m | Avg: 36m 27s | Max: 42m 26s
    🔍 ctk: 11.1 🔍
      🔍 11.1               Pass:  80%/15  | Total:  2h 34m | Avg: 10m 16s | Max:  1h 17m | Hits:   0%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 01m | Avg: 40m 20s | Max: 44m 17s
      🟩 12.6               Pass: 100%/85  | Total:  1d 22h | Avg: 32m 56s | Max:  1h 23m | Hits:  24%/8920  
    🔍 cudacxx: nvcc11.1 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 00m | Avg: 30m 27s | Max: 31m 15s
      🔍 nvcc11.1           Pass:  80%/15  | Total:  2h 34m | Avg: 10m 16s | Max:  1h 17m | Hits:   0%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 01m | Avg: 40m 20s | Max: 44m 17s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 21h | Avg: 32m 59s | Max:  1h 23m | Hits:  24%/8920  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 27s | Max: 31m 15s
      🔍 nvcc               Pass:  97%/101 | Total:  2d 02h | Avg: 29m 50s | Max:  1h 23m | Hits:  19%/11150 
    🔍 cxx: Clang9 🔍
      🔍 Clang9             Pass:  50%/6   | Total:  1h 57m | Avg: 19m 30s | Max: 40m 55s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 53m | Avg: 37m 55s | Max: 40m 32s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 25m | Avg: 36m 19s | Max: 38m 37s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 31m | Avg: 37m 57s | Max: 44m 26s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 31m | Avg: 37m 51s | Max: 44m 10s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 29m | Avg: 37m 15s | Max: 40m 20s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 27m | Avg: 36m 57s | Max: 40m 34s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 28m | Avg: 37m 07s | Max: 40m 46s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 36m | Avg: 39m 04s | Max: 46m 41s
      🟩 Clang18            Pass: 100%/9   | Total:  4h 21m | Avg: 29m 03s | Max: 39m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 15s | Avg:  3m 37s | Max:  3m 46s
      🟩 GCC7               Pass: 100%/6   | Total: 25m 11s | Avg:  4m 11s | Max:  4m 59s
      🟩 GCC8               Pass: 100%/6   | Total: 54m 21s | Avg:  9m 03s | Max: 32m 49s
      🟩 GCC9               Pass: 100%/6   | Total: 26m 40s | Avg:  4m 26s | Max:  5m 16s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 17m | Avg: 34m 27s | Max: 36m 45s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 14m | Avg: 36m 24s | Max: 44m 17s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 41m | Avg: 40m 16s | Max: 46m 41s
      🟩 GCC13              Pass: 100%/14  | Total:  6h 23m | Avg: 27m 24s | Max: 42m 42s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 35m | Avg: 51m 56s | Max: 54m 13s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m | Hits:   0%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 12m | Hits:   0%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 29s | Max:  1h 23m | Hits:  49%/4460  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  93%/46  | Total:  1d 01h | Avg: 33m 31s | Max: 46m 41s
      🟩 GCC                Pass: 100%/49  | Total: 17h 31m | Avg: 21m 26s | Max: 46m 41s
      🟩 Intel              Pass: 100%/3   | Total:  2h 35m | Avg: 51m 56s | Max: 54m 13s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 25m | Avg:  1h 05m | Max:  1h 23m | Hits:  19%/11150 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  96%/96  | Total:  2d 01h | Avg: 31m 00s | Max:  1h 23m | Hits:   0%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 10s | Avg: 11m 02s | Max: 23m 04s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 53m 34s | Avg: 17m 51s | Max: 21m 28s
    🟨 std
      🟨 11                 Pass:  96%/28  | Total: 10h 30m | Avg: 22m 30s | Max: 49m 39s
      🟨 14                 Pass:  96%/27  | Total: 14h 11m | Avg: 31m 33s | Max:  1h 17m | Hits:   0%/4460  
      🟨 17                 Pass:  96%/26  | Total: 13h 21m | Avg: 30m 50s | Max:  1h 12m | Hits:   0%/2230  
      🟩 20                 Pass: 100%/22  | Total: 13h 10m | Avg: 35m 56s | Max:  1h 23m | Hits:  49%/4460  
    🟨 gpu
      🟨 v100               Pass:  97%/103 | Total:  2d 03h | Avg: 29m 50s | Max:  1h 23m | Hits:  19%/11150 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 01m | Avg: 40m 20s | Max: 44m 17s
      🟩 90a                Pass: 100%/4   | Total:  1h 43m | Avg: 25m 53s | Max: 28m 56s
    
  • 🟩 cudax: Pass: 100%/52 | Total: 3h 06m | Avg: 3m 34s | Max: 11m 58s | Hits: 3%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 54m | Avg:  3m 37s | Max: 11m 58s | Hits:   3%/222   
      🟩 arm64              Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 13s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 06m | Avg:  3m 29s | Max: 11m 58s | Hits:   3%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 59m | Avg:  3m 37s | Max: 11m 50s | Hits:   3%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 06m | Avg:  3m 29s | Max: 11m 58s | Hits:   3%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 59m | Avg:  3m 37s | Max: 11m 50s | Hits:   3%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  3h 06m | Avg:  3m 34s | Max: 11m 58s | Hits:   3%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 43s | Avg:  3m 21s | Max:  3m 30s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 38s | Avg:  3m 19s | Max:  3m 32s
      🟩 Clang11            Pass: 100%/4   | Total: 14m 56s | Avg:  3m 44s | Max:  5m 26s
      🟩 Clang12            Pass: 100%/4   | Total: 14m 29s | Avg:  3m 37s | Max:  5m 14s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 37s | Avg:  3m 09s | Max:  3m 15s
      🟩 Clang14            Pass: 100%/4   | Total: 14m 09s | Avg:  3m 32s | Max:  4m 05s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 01s | Avg:  3m 30s | Max:  3m 33s
      🟩 Clang16            Pass: 100%/4   | Total: 12m 37s | Avg:  3m 09s | Max:  3m 23s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 47s | Avg:  3m 23s | Max:  3m 34s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 45s | Avg:  3m 52s | Max:  4m 24s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 38s | Avg:  2m 19s | Max:  2m 20s
      🟩 GCC10              Pass: 100%/4   | Total: 10m 56s | Avg:  2m 44s | Max:  2m 58s
      🟩 GCC11              Pass: 100%/4   | Total: 10m 32s | Avg:  2m 38s | Max:  2m 46s
      🟩 GCC12              Pass: 100%/7   | Total: 24m 02s | Avg:  3m 26s | Max:  4m 03s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 32s | Avg:  2m 50s | Max:  3m 13s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 58s | Avg: 11m 58s | Max: 11m 58s | Hits:   3%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s | Hits:   3%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 43m | Avg:  3m 27s | Max:  5m 26s
      🟩 GCC                Pass: 100%/20  | Total: 58m 40s | Avg:  2m 56s | Max:  4m 03s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 48s | Avg: 11m 54s | Max: 11m 58s | Hits:   3%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  3h 06m | Avg:  3m 34s | Max: 11m 58s | Hits:   3%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 46m | Avg:  3m 31s | Max: 11m 58s | Hits:   3%/222   
      🟩 Test               Pass: 100%/5   | Total: 20m 07s | Avg:  4m 01s | Max:  4m 24s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 49s | Avg:  2m 49s | Max:  2m 49s
      🟩 90a                Pass: 100%/1   | Total:  2m 36s | Avg:  2m 36s | Max:  2m 36s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 28m | Avg:  3m 08s | Max:  5m 26s
      🟩 20                 Pass: 100%/24  | Total:  1h 38m | Avg:  4m 05s | Max: 11m 58s | Hits:   3%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 26s | Avg: 16m 26s | Max: 16m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@miscco miscco force-pushed the refactor_builtins branch 6 times, most recently from d3f48b7 to de72a0f Compare September 27, 2024 12:00
Copy link
Contributor

🟩 CI finished in 2h 18m: Pass: 100%/364 | Total: 1d 23h | Avg: 7m 53s | Max: 1h 47m | Hits: 79%/25671
  • 🟩 cub: Pass: 100%/104 | Total: 12h 20m | Avg: 7m 07s | Max: 1h 47m | Hits: 99%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 47m | Avg:  7m 21s | Max:  1h 47m | Hits:  99%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 33m 49s | Avg:  4m 13s | Max:  4m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 08m | Avg:  4m 33s | Max: 15m 31s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 14m 27s | Avg:  4m 49s | Max:  5m 18s
      🟩 12.6               Pass: 100%/86  | Total: 10h 58m | Avg:  7m 39s | Max:  1h 47m | Hits:  99%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 08s | Avg:  4m 04s | Max:  4m 10s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 08m | Avg:  4m 33s | Max: 15m 31s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 27s | Avg:  4m 49s | Max:  5m 18s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 50m | Avg:  7m 44s | Max:  1h 47m | Hits:  99%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 08s | Avg:  4m 04s | Max:  4m 10s
      🟩 nvcc               Pass: 100%/102 | Total: 12h 12m | Avg:  7m 11s | Max:  1h 47m | Hits:  99%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 06s | Avg:  4m 41s | Max:  5m 39s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 05s | Avg:  5m 21s | Max:  5m 45s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 32s | Avg:  4m 38s | Max:  4m 47s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  4m 33s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 41s | Avg:  4m 40s | Max:  5m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 55s | Avg:  4m 43s | Max:  5m 00s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 06s | Avg:  4m 46s | Max:  5m 09s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  4m 42s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 32s | Avg:  4m 53s | Max:  5m 14s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 30m | Avg: 10m 00s | Max: 38m 00s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 25s | Avg:  3m 42s | Max:  3m 56s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 54s | Avg:  4m 09s | Max:  5m 03s
      🟩 GCC8               Pass: 100%/6   | Total: 24m 42s | Avg:  4m 07s | Max:  4m 34s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 43s | Avg:  4m 07s | Max:  4m 48s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 45s | Avg:  4m 41s | Max:  4m 59s
      🟩 GCC11              Pass: 100%/7   | Total: 32m 33s | Avg:  4m 39s | Max:  5m 18s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 27s | Avg:  4m 51s | Max:  5m 14s
      🟩 GCC13              Pass: 100%/16  | Total:  4h 12m | Avg: 15m 47s | Max:  1h 47m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 53s | Avg:  5m 17s | Max:  5m 26s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 31s | Avg: 15m 31s | Max: 15m 31s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 20s | Avg: 12m 40s | Max: 12m 50s | Hits:  99%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 45s | Avg: 13m 45s | Max: 13m 45s | Hits:  99%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 25m | Avg:  5m 46s | Max: 38m 00s
      🟩 GCC                Pass: 100%/51  | Total:  6h 45m | Avg:  7m 56s | Max:  1h 47m
      🟩 Intel              Pass: 100%/3   | Total: 15m 53s | Avg:  5m 17s | Max:  5m 26s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 36s | Avg: 13m 39s | Max: 15m 31s | Hits:  99%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 20m | Avg:  7m 07s | Max:  1h 47m | Hits:  99%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 50m | Avg:  4m 53s | Max: 15m 31s | Hits:  99%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 53s | Avg: 17m 53s | Max: 17m 53s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 25m | Avg: 28m 34s | Max: 38m 00s
      🟩 TestGPU            Pass: 100%/3   | Total:  2h 32m | Avg: 50m 53s | Max:  1h 47m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 27s | Avg:  4m 49s | Max:  5m 18s
      🟩 90a                Pass: 100%/4   | Total: 14m 48s | Avg:  3m 42s | Max:  3m 52s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 48m | Avg:  6m 00s | Max: 30m 29s
      🟩 14                 Pass: 100%/27  | Total:  2h 19m | Avg:  5m 09s | Max: 15m 31s | Hits:  99%/1454  
      🟩 17                 Pass: 100%/26  | Total:  2h 08m | Avg:  4m 56s | Max: 12m 30s | Hits:  99%/727   
      🟩 20                 Pass: 100%/23  | Total:  5h 05m | Avg: 13m 16s | Max:  1h 47m | Hits:  99%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 22h 40m | Avg: 13m 04s | Max: 1h 07m | Hits: 56%/11391

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 21h 15m | Avg: 13m 17s | Max:  1h 07m | Hits:  56%/11391 
      🟩 arm64              Pass: 100%/8   | Total:  1h 25m | Avg: 10m 38s | Max: 24m 54s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 26m | Avg:  5m 47s | Max: 23m 16s | Hits:  99%/2650  
      🟩 11.8               Pass: 100%/3   | Total:  1h 14m | Avg: 24m 43s | Max: 27m 42s
      🟩 12.6               Pass: 100%/86  | Total: 19h 59m | Avg: 13m 56s | Max:  1h 07m | Hits:  43%/8741  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 38m 47s | Avg: 19m 23s | Max: 21m 10s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 26m | Avg:  5m 47s | Max: 23m 16s | Hits:  99%/2650  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 14m | Avg: 24m 43s | Max: 27m 42s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 19h 20m | Avg: 13m 48s | Max:  1h 07m | Hits:  43%/8741  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 38m 47s | Avg: 19m 23s | Max: 21m 10s
      🟩 nvcc               Pass: 100%/102 | Total: 22h 01m | Avg: 12m 57s | Max:  1h 07m | Hits:  56%/11391 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 30m | Avg: 15m 07s | Max: 27m 36s
      🟩 Clang10            Pass: 100%/3   | Total: 37m 35s | Avg: 12m 31s | Max: 28m 10s
      🟩 Clang11            Pass: 100%/4   | Total: 39m 38s | Avg:  9m 54s | Max: 26m 03s
      🟩 Clang12            Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 26s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 19m | Avg: 19m 52s | Max: 27m 58s
      🟩 Clang14            Pass: 100%/4   | Total: 37m 41s | Avg:  9m 25s | Max: 25m 54s
      🟩 Clang15            Pass: 100%/4   | Total: 35m 20s | Avg:  8m 50s | Max: 22m 04s
      🟩 Clang16            Pass: 100%/4   | Total: 38m 28s | Avg:  9m 37s | Max: 25m 19s
      🟩 Clang17            Pass: 100%/4   | Total: 53m 48s | Avg: 13m 27s | Max: 25m 09s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 46m | Avg: 20m 47s | Max:  1h 07m
      🟩 GCC6               Pass: 100%/2   | Total:  5m 44s | Avg:  2m 52s | Max:  3m 03s
      🟩 GCC7               Pass: 100%/6   | Total: 19m 16s | Avg:  3m 12s | Max:  3m 38s
      🟩 GCC8               Pass: 100%/6   | Total: 19m 56s | Avg:  3m 19s | Max:  3m 49s
      🟩 GCC9               Pass: 100%/6   | Total: 19m 51s | Avg:  3m 18s | Max:  4m 10s
      🟩 GCC10              Pass: 100%/4   | Total: 51m 48s | Avg: 12m 57s | Max: 24m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 28m | Avg: 21m 10s | Max: 30m 08s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 10m | Avg: 17m 36s | Max: 25m 24s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 15m | Avg: 15m 00s | Max: 36m 10s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 35m 44s | Avg: 11m 54s | Max: 24m 19s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s | Hits:  99%/2650  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 10m | Avg: 35m 18s | Max: 37m 30s | Hits:  43%/5662  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 48m 39s | Avg: 48m 39s | Max: 48m 39s | Hits:  41%/3079  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  9h 55m | Avg: 13m 14s | Max:  1h 07m
      🟩 GCC                Pass: 100%/52  | Total:  9h 50m | Avg: 11m 21s | Max: 36m 10s
      🟩 Intel              Pass: 100%/3   | Total: 35m 44s | Avg: 11m 54s | Max: 24m 19s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 18m | Avg: 34m 37s | Max: 48m 39s | Hits:  56%/11391 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 22h 40m | Avg: 13m 04s | Max:  1h 07m | Hits:  56%/11391 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 18h 46m | Avg: 11m 43s | Max: 48m 39s | Hits:  56%/11391 
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 08m | Avg: 32m 09s | Max: 36m 10s
      🟩 Test               Pass: 100%/3   | Total:  1h 43m | Avg: 34m 33s | Max:  1h 07m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 14m | Avg: 24m 43s | Max: 27m 42s
      🟩 90a                Pass: 100%/4   | Total: 14m 34s | Avg:  3m 38s | Max:  4m 05s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  4h 59m | Avg: 10m 41s | Max: 27m 37s
      🟩 14                 Pass: 100%/28  | Total:  4h 46m | Avg: 10m 13s | Max: 33m 07s | Hits:  71%/5401  
      🟩 17                 Pass: 100%/27  | Total:  6h 26m | Avg: 14m 19s | Max: 37m 30s | Hits:  42%/2911  
      🟩 20                 Pass: 100%/20  | Total:  6h 26m | Avg: 19m 19s | Max:  1h 07m | Hits:  41%/3079  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 10h 13m | Avg: 5m 57s | Max: 22m 53s | Hits: 98%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  9h 37m | Avg:  6m 05s | Max: 22m 53s | Hits:  98%/11150 
      🟩 arm64              Pass: 100%/8   | Total: 35m 10s | Avg:  4m 23s | Max:  5m 22s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 16m | Avg:  5m 07s | Max: 20m 33s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 54s | Avg:  5m 18s | Max:  5m 41s
      🟩 12.6               Pass: 100%/85  | Total:  8h 40m | Avg:  6m 07s | Max: 22m 53s | Hits:  98%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 14s | Avg:  4m 37s | Max:  4m 44s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 16m | Avg:  5m 07s | Max: 20m 33s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 54s | Avg:  5m 18s | Max:  5m 41s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  8h 31m | Avg:  6m 09s | Max: 22m 53s | Hits:  98%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 14s | Avg:  4m 37s | Max:  4m 44s
      🟩 nvcc               Pass: 100%/101 | Total: 10h 03m | Avg:  5m 58s | Max: 22m 53s | Hits:  98%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 14s | Avg:  4m 52s | Max:  6m 01s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 05s | Avg:  5m 41s | Max:  6m 00s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 25s | Avg:  4m 51s | Max:  5m 24s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  5m 18s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 04s | Avg:  4m 46s | Max:  5m 08s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 21s | Avg:  4m 50s | Max:  4m 56s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 48s | Avg:  4m 57s | Max:  5m 33s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 14s | Avg:  5m 03s | Max:  5m 27s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 34s | Avg:  5m 08s | Max:  5m 29s
      🟩 Clang18            Pass: 100%/9   | Total: 56m 40s | Avg:  6m 17s | Max: 18m 43s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 08s | Avg:  4m 04s | Max:  4m 14s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 31s | Avg:  4m 05s | Max:  4m 48s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 04s | Avg:  4m 30s | Max:  4m 58s
      🟩 GCC9               Pass: 100%/6   | Total: 26m 14s | Avg:  4m 22s | Max:  4m 59s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 05s | Avg:  4m 46s | Max:  4m 58s
      🟩 GCC11              Pass: 100%/7   | Total: 34m 45s | Avg:  4m 57s | Max:  5m 41s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 40s | Avg:  5m 10s | Max:  5m 54s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 34m | Avg:  6m 44s | Max: 19m 17s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 46s | Avg:  5m 55s | Max:  6m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 33s | Avg: 20m 33s | Max: 20m 33s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 35m 05s | Avg: 17m 32s | Max: 17m 54s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 44m 30s | Avg: 22m 15s | Max: 22m 53s | Hits:  97%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 00m | Avg:  5m 13s | Max: 18m 43s
      🟩 GCC                Pass: 100%/49  | Total:  4h 14m | Avg:  5m 12s | Max: 19m 17s
      🟩 Intel              Pass: 100%/3   | Total: 17m 46s | Avg:  5m 55s | Max:  6m 12s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 40m | Avg: 20m 01s | Max: 22m 53s | Hits:  98%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 10h 13m | Avg:  5m 57s | Max: 22m 53s | Hits:  98%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 34m | Avg:  5m 21s | Max: 21m 37s | Hits:  98%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 45m 40s | Avg: 11m 25s | Max: 22m 53s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 52m 40s | Avg: 17m 33s | Max: 19m 17s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 54s | Avg:  5m 18s | Max:  5m 41s
      🟩 90a                Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 17s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 16m | Avg:  4m 53s | Max: 14m 40s
      🟩 14                 Pass: 100%/27  | Total:  2h 36m | Avg:  5m 46s | Max: 20m 33s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  2h 23m | Avg:  5m 31s | Max: 17m 11s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  2h 56m | Avg:  8m 01s | Max: 22m 53s | Hits:  97%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 22m | Avg: 2m 44s | Max: 10m 24s | Hits: 90%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 14m | Avg:  2m 47s | Max: 10m 24s | Hits:  90%/222   
      🟩 arm64              Pass: 100%/4   | Total:  8m 38s | Avg:  2m 09s | Max:  2m 23s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 53m 06s | Avg:  2m 47s | Max: 10m 08s | Hits:  90%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 29m | Avg:  2m 42s | Max: 10m 24s | Hits:  90%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 53m 06s | Avg:  2m 47s | Max: 10m 08s | Hits:  90%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 29m | Avg:  2m 42s | Max: 10m 24s | Hits:  90%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 10m 24s | Hits:  90%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 38s | Avg:  2m 19s | Max:  2m 27s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 39s | Avg:  2m 19s | Max:  2m 31s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 31s | Avg:  2m 22s | Max:  2m 37s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 10s | Avg:  2m 17s | Max:  2m 32s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 22s | Avg:  2m 20s | Max:  2m 52s
      🟩 Clang14            Pass: 100%/4   | Total: 10m 59s | Avg:  2m 44s | Max:  4m 01s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 46s | Avg:  2m 23s | Max:  2m 30s
      🟩 Clang16            Pass: 100%/4   | Total:  9m 06s | Avg:  2m 16s | Max:  2m 40s
      🟩 Clang17            Pass: 100%/2   | Total:  4m 55s | Avg:  2m 27s | Max:  2m 30s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 39s | Avg:  3m 19s | Max:  4m 15s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 23s | Avg:  2m 11s | Max:  2m 17s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 33s | Avg:  2m 08s | Max:  2m 12s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 23s | Avg:  2m 05s | Max:  2m 10s
      🟩 GCC12              Pass: 100%/7   | Total: 20m 43s | Avg:  2m 57s | Max:  4m 06s
      🟩 GCC13              Pass: 100%/3   | Total:  6m 22s | Avg:  2m 07s | Max:  2m 20s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 08s | Avg: 10m 08s | Max: 10m 08s | Hits:  90%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 24s | Avg: 10m 24s | Max: 10m 24s | Hits:  90%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 13m | Avg:  2m 27s | Max:  4m 15s
      🟩 GCC                Pass: 100%/20  | Total: 48m 24s | Avg:  2m 25s | Max:  4m 06s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 32s | Avg: 10m 16s | Max: 10m 24s | Hits:  90%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 10m 24s | Hits:  90%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 02m | Avg:  2m 36s | Max: 10m 24s | Hits:  90%/222   
      🟩 Test               Pass: 100%/5   | Total: 20m 06s | Avg:  4m 01s | Max:  4m 15s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 90a                Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 06m | Avg:  2m 22s | Max:  4m 06s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 10s | Max: 10m 24s | Hits:  90%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 18m 14s | Avg: 18m 14s | Max: 18m 14s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Collaborator

@ericniebler ericniebler left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

vast improvement, thanks! just a few questions.

Comment on lines -678 to -689
#if defined(_LIBCUDACXX_IS_BASE_OF) && !defined(_LIBCUDACXX_USE_IS_BASE_OF_FALLBACK)

template <class _Set, class... _Ty>
_CCCL_INLINE_VAR constexpr bool __mset_contains = (_LIBCUDACXX_IS_BASE_OF(__mtype<_Ty>, _Set) && ...);

#else

template <class _Set, class... _Ty>
_CCCL_INLINE_VAR constexpr bool __mset_contains = (_CUDA_VSTD::is_base_of_v<__mtype<_Ty>, _Set> && ...);

#endif

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as the comment above says, this is a particularly hot piece of meta-programming, so the direct use of the intrinsic here is intentional.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is no difference between what was there and what is now.

is_base_of_v is directly given by the compiler builtin without going through is_base::value, so both conditionals are exactly the same

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instantiating a variable template is not free. this instantiates N extra variable templates than using the intrinsic directly.

libcudacxx/include/cuda/std/__cccl/builtin.h Outdated Show resolved Hide resolved
# define _CCCL_BUILTIN_OPERATOR_NEW(...) __builtin_operator_new(__VA_ARGS__)
#endif // __check_builtin(__builtin_operator_new) && __check_builtin(__builtin_operator_delete)

#if __has_builtin(__decay) && defined(_CCCL_CUDA_COMPILER_CLANG)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why __has_builtin here but __check_builtin elsewhere?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wrote that at the top.

The __check_builtin facility is a clutch to work with older GCC / clang that do not support __has_builtin

However, that has some drawbacks, because the __has_keyword or __has_feature checks can lead to false positives.

So for all newer builtins that are only supported by new compilers anyhow, I am moving towards just using __has_builtin

libcudacxx/include/cuda/std/__cccl/builtin.h Outdated Show resolved Hide resolved
libcudacxx/include/cuda/std/__cccl/builtin.h Outdated Show resolved Hide resolved
@@ -62,6 +48,20 @@ struct __libcpp_is_member_pointer<_Tp _Up::*>
};
};

#if defined(_CCCL_BUILTIN_IS_MEMBER_FUNCTION_POINTER) && !defined(_LIBCUDACXX_USE_IS_MEMBER_FUNCTION_POINTER_FALLBACK)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if we should change the _LIBCUDACXX_USE_[...]_FALLBACK macros to _CCCL_USE_[...]_FALLBACK.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was contemplating that as well, however this is explicitly for a type trait in libcu++ and not something that is generically available everywhere, so I decided to keep it as is.

Not a hill to die on though

# endif

#else

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why is __libcpp_is_member_pointer defined unconditionally now?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That is just github formatting being bad.

We need the __libcpp_is_member_pointer in other places, so I moved that to the top

Comment on lines -267 to +270
# ifdef _LIBCUDACXX_IS_CONSTANT_EVALUATED // is_constant_evaluated only exists since GCC 9
# ifdef _CCCL_BUILTIN_IS_CONSTANT_EVALUATED // is_constant_evaluated only exists since GCC 9
if (__libcpp_is_constant_evaluated())
# endif // defined(_LIBCUDACXX_IS_CONSTANT_EVALUATED)
# endif // defined(_CCCL_BUILTIN_IS_CONSTANT_EVALUATED)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why are we using __libcpp_is_constant_evaluated() instead of using _CCCL_BUILTIN_IS_CONSTANT_EVALUATED()?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually in that case we could use it directly, most often __libcpp_is_constant_evaluated is used without conditionals

@@ -40,7 +40,7 @@ _CCCL_NORETURN _LIBCUDACXX_ATTRIBUTE_FORMAT(__printf__, 1, 2)
_LIBCUDACXX_HIDE_FROM_ABI void __libcpp_verbose_abort(const char*, ...)
{
::abort();
__builtin_unreachable(); // never reached, but needed to tell the compiler that the function never returns
_CCCL_UNREACHABLE(); // never reached, but needed to tell the compiler that the function never returns
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this macro be called _CCCL_BUILTIN_UNREACHABLE()?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is unfortunately not a builtin but a pile of hacks that sometimes is a builtin

@miscco miscco force-pushed the refactor_builtins branch 4 times, most recently from 58caca7 to c85690a Compare October 1, 2024 08:14
Copy link
Contributor

github-actions bot commented Oct 1, 2024

🟨 CI finished in 2h 30m: Pass: 97%/364 | Total: 6d 16h | Avg: 26m 22s | Max: 1h 22m | Hits: 10%/25671
  • 🟨 thrust: Pass: 93%/103 | Total: 1d 23h | Avg: 27m 30s | Max: 1h 22m | Hits: 19%/11150

    🔍 cpu: arm64 🔍
      🟩 amd64              Pass: 100%/95  | Total:  1d 22h | Avg: 29m 27s | Max:  1h 22m | Hits:  19%/11150 
      🔍 arm64              Pass:  12%/8   | Total: 34m 51s | Avg:  4m 21s | Max: 24m 17s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  7h 30m | Avg: 30m 03s | Max:  1h 08m | Hits:   0%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  1h 58m | Avg: 39m 21s | Max: 44m 05s
      🔍 12.6               Pass:  91%/85  | Total:  1d 13h | Avg: 26m 38s | Max:  1h 22m | Hits:  24%/8920  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 29s | Avg: 25m 14s | Max: 26m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 30m | Avg: 30m 03s | Max:  1h 08m | Hits:   0%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 58m | Avg: 39m 21s | Max: 44m 05s
      🔍 nvcc12.6           Pass:  91%/83  | Total:  1d 12h | Avg: 26m 40s | Max:  1h 22m | Hits:  24%/8920  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 29s | Avg: 25m 14s | Max: 26m 13s
      🔍 nvcc               Pass:  93%/101 | Total:  1d 22h | Avg: 27m 33s | Max:  1h 22m | Hits:  19%/11150 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  92%/96  | Total:  1d 21h | Avg: 28m 38s | Max:  1h 22m | Hits:   0%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 19s | Avg: 11m 04s | Max: 22m 14s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 39m 11s | Avg: 13m 03s | Max: 14m 38s
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 51m | Avg: 28m 30s | Max: 33m 34s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 33m | Avg: 31m 01s | Max: 33m 56s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 09s | Max: 32m 28s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 00s | Max: 33m 52s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 14s | Max: 31m 41s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 46s | Max: 32m 58s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 19s | Max: 33m 26s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 54s | Max: 34m 13s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 02m | Avg: 30m 37s | Max: 34m 08s
      🟨 Clang18            Pass:  55%/9   | Total:  1h 51m | Avg: 12m 22s | Max: 33m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 45s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 00m | Avg: 20m 02s | Max: 34m 35s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 01m | Avg: 20m 17s | Max: 35m 44s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 31m | Avg: 35m 14s | Max: 39m 25s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 11m | Avg: 32m 48s | Max: 40m 35s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 52m | Avg: 33m 14s | Max: 44m 05s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 13m | Avg: 33m 25s | Max: 36m 49s
      🟨 GCC13              Pass:  78%/14  | Total:  3h 19m | Avg: 14m 15s | Max: 34m 32s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 17m | Avg: 45m 40s | Max: 52m 11s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:   0%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 08m | Hits:   0%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 21s | Max:  1h 22m | Hits:  49%/4460  
    🟨 cxx_family
      🟨 Clang              Pass:  91%/46  | Total: 20h 31m | Avg: 26m 46s | Max: 34m 13s
      🟨 GCC                Pass:  93%/49  | Total: 19h 17m | Avg: 23m 37s | Max: 44m 05s
      🟩 Intel              Pass: 100%/3   | Total:  2h 17m | Avg: 45m 40s | Max: 52m 11s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 06m | Avg:  1h 01m | Max:  1h 22m | Hits:  19%/11150 
    🟨 gpu
      🟨 v100               Pass:  93%/103 | Total:  1d 23h | Avg: 27m 30s | Max:  1h 22m | Hits:  19%/11150 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 58m | Avg: 39m 21s | Max: 44m 05s
      🟩 90a                Pass: 100%/4   | Total:  1h 12m | Avg: 18m 07s | Max: 19m 31s
    🟨 std
      🟨 11                 Pass:  96%/28  | Total: 10h 30m | Avg: 22m 31s | Max: 37m 19s
      🟨 14                 Pass:  92%/27  | Total: 13h 21m | Avg: 29m 40s | Max:  1h 08m | Hits:   0%/4460  
      🟨 17                 Pass:  92%/26  | Total: 13h 12m | Avg: 30m 29s | Max:  1h 08m | Hits:   0%/2230  
      🟨 20                 Pass:  90%/22  | Total: 10h 08m | Avg: 27m 39s | Max:  1h 22m | Hits:  49%/4460  
    
  • 🟨 libcudacxx: Pass: 99%/104 | Total: 1d 16h | Avg: 23m 36s | Max: 1h 19m | Hits: 2%/11391

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  98%/96  | Total:  1d 14h | Avg: 23m 58s | Max:  1h 19m | Hits:   2%/11391 
      🟩 arm64              Pass: 100%/8   | Total:  2h 33m | Avg: 19m 07s | Max: 25m 48s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  5h 34m | Avg: 22m 17s | Max: 38m 57s | Hits:   3%/2650  
      🟩 11.8               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 27s | Max: 27m 53s
      🔍 12.6               Pass:  98%/86  | Total:  1d 10h | Avg: 23m 48s | Max:  1h 19m | Hits:   2%/8741  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 37m 28s | Avg: 18m 44s | Max: 19m 04s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  5h 34m | Avg: 22m 17s | Max: 38m 57s | Hits:   3%/2650  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 13m | Avg: 24m 27s | Max: 27m 53s
      🔍 nvcc12.6           Pass:  98%/84  | Total:  1d 09h | Avg: 23m 55s | Max:  1h 19m | Hits:   2%/8741  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 37m 28s | Avg: 18m 44s | Max: 19m 04s
      🔍 nvcc               Pass:  99%/102 | Total:  1d 16h | Avg: 23m 41s | Max:  1h 19m | Hits:   2%/11391 
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  2h 20m | Avg: 23m 20s | Max: 27m 39s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 16m | Avg: 25m 22s | Max: 29m 59s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 32m | Avg: 23m 01s | Max: 29m 42s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 27m | Avg: 21m 58s | Max: 25m 48s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 29m | Avg: 22m 24s | Max: 25m 50s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 30m | Avg: 22m 42s | Max: 29m 56s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 36m | Avg: 24m 03s | Max: 27m 08s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 21m | Avg: 20m 18s | Max: 27m 17s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 33m | Avg: 23m 27s | Max: 27m 05s
      🟩 Clang18            Pass: 100%/8   | Total:  3h 43m | Avg: 27m 56s | Max:  1h 15m
      🟩 GCC6               Pass: 100%/2   | Total: 42m 21s | Avg: 21m 10s | Max: 23m 20s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 58m | Avg: 19m 43s | Max: 26m 44s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 48m | Avg: 18m 08s | Max: 26m 16s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 18m | Avg: 23m 08s | Max: 27m 24s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 29m | Avg: 22m 17s | Max: 26m 46s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 44m | Avg: 23m 31s | Max: 27m 53s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 33m | Avg: 23m 25s | Max: 27m 32s
      🔍 GCC13              Pass:  94%/17  | Total:  6h 20m | Avg: 22m 23s | Max:  1h 19m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 17m | Avg: 25m 47s | Max: 29m 55s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 38m 57s | Avg: 38m 57s | Max: 38m 57s | Hits:   3%/2650  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 21m | Avg: 40m 58s | Max: 42m 54s | Hits:   2%/5662  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 48m 29s | Avg: 48m 29s | Max: 48m 29s | Hits:   2%/3079  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/45  | Total: 17h 51m | Avg: 23m 48s | Max:  1h 15m
      🔍 GCC                Pass:  98%/52  | Total: 18h 56m | Avg: 21m 51s | Max:  1h 19m
      🟩 Intel              Pass: 100%/3   | Total:  1h 17m | Avg: 25m 47s | Max: 29m 55s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 49m | Avg: 42m 20s | Max: 48m 29s | Hits:   2%/11391 
    🔍 jobs: NVRTC 🔍
      🟩 Build              Pass: 100%/96  | Total:  1d 12h | Avg: 22m 31s | Max: 48m 29s | Hits:   2%/11391 
      🔍 NVRTC              Pass:  75%/4   | Total:  2h 03m | Avg: 30m 49s | Max: 37m 40s
      🟩 Test               Pass: 100%/3   | Total:  2h 46m | Avg: 55m 20s | Max:  1h 19m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 27s | Avg:  2m 27s | Max:  2m 27s
    🔍 std: 11 🔍
      🔍 11                 Pass:  96%/28  | Total:  9h 07m | Avg: 19m 33s | Max: 24m 24s
      🟩 14                 Pass: 100%/28  | Total:  9h 34m | Avg: 20m 30s | Max: 39m 03s | Hits:   3%/5401  
      🟩 17                 Pass: 100%/27  | Total: 11h 17m | Avg: 25m 05s | Max: 42m 54s | Hits:   2%/2911  
      🟩 20                 Pass: 100%/20  | Total: 10h 53m | Avg: 32m 39s | Max:  1h 19m | Hits:   2%/3079  
    🟨 gpu
      🟨 v100               Pass:  99%/104 | Total:  1d 16h | Avg: 23m 36s | Max:  1h 19m | Hits:   2%/11391 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 13m | Avg: 24m 27s | Max: 27m 53s
      🟩 90a                Pass: 100%/4   | Total: 42m 49s | Avg: 10m 42s | Max: 13m 41s
    
  • 🟩 cub: Pass: 100%/104 | Total: 2d 21h | Avg: 39m 57s | Max: 1h 05m | Hits: 0%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  2d 15h | Avg: 39m 26s | Max:  1h 05m | Hits:   0%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  6h 09m | Avg: 46m 09s | Max: 48m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 31m | Avg: 38m 04s | Max: 58m 18s | Hits:   0%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 57m | Avg: 59m 18s | Max: 59m 51s
      🟩 12.6               Pass: 100%/86  | Total:  2d 08h | Avg: 39m 36s | Max:  1h 05m | Hits:   0%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 39m | Avg: 49m 47s | Max: 49m 53s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 31m | Avg: 38m 04s | Max: 58m 18s | Hits:   0%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 57m | Avg: 59m 18s | Max: 59m 51s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 07h | Avg: 39m 21s | Max:  1h 05m | Hits:   0%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 39m | Avg: 49m 47s | Max: 49m 53s
      🟩 nvcc               Pass: 100%/102 | Total:  2d 19h | Avg: 39m 45s | Max:  1h 05m | Hits:   0%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 02m | Avg: 40m 23s | Max: 48m 33s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 12m | Avg: 44m 10s | Max: 44m 17s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 01m | Avg: 45m 15s | Max: 47m 55s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 52m | Avg: 43m 04s | Max: 43m 36s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 50m | Avg: 42m 42s | Max: 43m 11s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 58m | Avg: 44m 42s | Max: 48m 02s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 55m | Avg: 43m 46s | Max: 45m 06s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 51m | Avg: 42m 48s | Max: 43m 33s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 03m | Avg: 45m 55s | Max: 50m 08s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 07m | Avg: 40m 48s | Max: 49m 53s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 45s | Avg:  3m 52s | Max:  3m 58s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 33m | Avg: 25m 35s | Max: 44m 20s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 33m | Avg: 25m 39s | Max: 44m 18s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 53m | Avg: 48m 54s | Max: 57m 28s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 01m | Avg: 45m 24s | Max: 50m 39s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 57m | Avg: 51m 08s | Max: 59m 51s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 56m | Avg: 44m 06s | Max: 44m 58s
      🟩 GCC13              Pass: 100%/16  | Total:  7h 40m | Avg: 28m 47s | Max: 48m 01s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 23m | Avg: 47m 48s | Max: 48m 47s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 58m 18s | Avg: 58m 18s | Max: 58m 18s | Hits:   0%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:   0%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:   0%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 08h | Avg: 42m 56s | Max: 50m 08s
      🟩 GCC                Pass: 100%/51  | Total:  1d 05h | Avg: 35m 00s | Max: 59m 51s
      🟩 Intel              Pass: 100%/3   | Total:  2h 23m | Avg: 47m 48s | Max: 48m 47s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 11m | Avg:  1h 02m | Max:  1h 05m | Hits:   0%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  2d 21h | Avg: 39m 57s | Max:  1h 05m | Hits:   0%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 18h | Avg: 41m 37s | Max:  1h 05m | Hits:   0%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 42s | Avg: 18m 42s | Max: 18m 42s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 05s | Avg: 16m 05s | Max: 16m 05s
      🟩 HostLaunch         Pass: 100%/3   | Total: 55m 35s | Avg: 18m 31s | Max: 19m 48s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 48s | Max: 23m 15s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 57m | Avg: 59m 18s | Max: 59m 51s
      🟩 90a                Pass: 100%/4   | Total:  1h 02m | Avg: 15m 31s | Max: 16m 37s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 17h 31m | Avg: 37m 34s | Max: 59m 01s
      🟩 14                 Pass: 100%/27  | Total: 18h 43m | Avg: 41m 36s | Max:  1h 02m | Hits:   0%/1454  
      🟩 17                 Pass: 100%/26  | Total: 18h 08m | Avg: 41m 52s | Max:  1h 04m | Hits:   0%/727   
      🟩 20                 Pass: 100%/23  | Total: 14h 50m | Avg: 38m 44s | Max:  1h 05m | Hits:   0%/727   
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 22m | Avg: 2m 44s | Max: 11m 44s | Hits: 3%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 15m | Avg:  2m 49s | Max: 11m 44s | Hits:   3%/222   
      🟩 arm64              Pass: 100%/4   | Total:  7m 14s | Avg:  1m 48s | Max:  1m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 54m 51s | Avg:  2m 53s | Max: 11m 36s | Hits:   3%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 11m 44s | Hits:   3%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 54m 51s | Avg:  2m 53s | Max: 11m 36s | Hits:   3%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 11m 44s | Hits:   3%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 11m 44s | Hits:   3%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 35s | Avg:  2m 17s | Max:  2m 23s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 45s | Avg:  2m 22s | Max:  2m 34s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 07s | Avg:  2m 16s | Max:  2m 22s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 03s | Avg:  2m 15s | Max:  2m 25s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 08s | Avg:  2m 17s | Max:  2m 28s
      🟩 Clang14            Pass: 100%/4   | Total: 10m 32s | Avg:  2m 38s | Max:  3m 52s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 45s | Avg:  2m 22s | Max:  2m 25s
      🟩 Clang16            Pass: 100%/4   | Total:  8m 56s | Avg:  2m 14s | Max:  2m 48s
      🟩 Clang17            Pass: 100%/2   | Total:  4m 31s | Avg:  2m 15s | Max:  2m 16s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  3m 51s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 05s | Avg:  3m 02s | Max:  3m 03s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 09s | Avg:  2m 02s | Max:  2m 05s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 40s | Avg:  2m 10s | Max:  2m 19s
      🟩 GCC12              Pass: 100%/7   | Total: 19m 37s | Avg:  2m 48s | Max:  3m 42s
      🟩 GCC13              Pass: 100%/3   | Total:  5m 22s | Avg:  1m 47s | Max:  1m 56s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 36s | Avg: 11m 36s | Max: 11m 36s | Hits:   3%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 44s | Avg: 11m 44s | Max: 11m 44s | Hits:   3%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 11m | Avg:  2m 23s | Max:  3m 52s
      🟩 GCC                Pass: 100%/20  | Total: 47m 53s | Avg:  2m 23s | Max:  3m 42s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 20s | Avg: 11m 40s | Max: 11m 44s | Hits:   3%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 11m 44s | Hits:   3%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 04m | Avg:  2m 38s | Max: 11m 44s | Hits:   3%/222   
      🟩 Test               Pass: 100%/5   | Total: 18m 32s | Avg:  3m 42s | Max:  3m 52s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 90a                Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 06m | Avg:  2m 22s | Max:  3m 42s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 11s | Max: 11m 44s | Hits:   3%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 45s | Avg: 14m 45s | Max: 14m 45s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

github-actions bot commented Oct 1, 2024

🟩 CI finished in 3h 07m: Pass: 100%/364 | Total: 7d 18h | Avg: 30m 46s | Max: 1h 21m | Hits: 10%/25691
  • 🟩 cub: Pass: 100%/104 | Total: 3d 08h | Avg: 46m 37s | Max: 1h 19m | Hits: 2%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 01h | Avg: 45m 53s | Max:  1h 19m | Hits:   2%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  7h 23m | Avg: 55m 28s | Max:  1h 02m
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 10h 13m | Avg: 40m 53s | Max:  1h 01m | Hits:   2%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 18m | Avg:  1h 06m | Max:  1h 07m
      🟩 12.6               Pass: 100%/86  | Total:  2d 19h | Avg: 46m 56s | Max:  1h 19m | Hits:   2%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 54m | Avg: 57m 26s | Max:  1h 00m
      🟩 nvcc11.1           Pass: 100%/15  | Total: 10h 13m | Avg: 40m 53s | Max:  1h 01m | Hits:   2%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 18m | Avg:  1h 06m | Max:  1h 07m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 17h | Avg: 46m 41s | Max:  1h 19m | Hits:   2%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 26s | Max:  1h 00m
      🟩 nvcc               Pass: 100%/102 | Total:  3d 06h | Avg: 46m 24s | Max:  1h 19m | Hits:   2%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 55m | Avg: 49m 13s | Max: 57m 34s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 38m | Avg: 52m 57s | Max: 56m 37s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 33m | Avg: 53m 18s | Max: 56m 29s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 39s | Max: 54m 55s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 44s | Max: 52m 28s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 56s | Max: 51m 58s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 10s | Max: 54m 48s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 25m | Avg: 51m 17s | Max: 56m 20s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 54s | Max: 53m 40s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 05m | Avg: 47m 19s | Max:  1h 00m
      🟩 GCC6               Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 52s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 32m | Avg: 25m 24s | Max: 43m 32s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 12m | Avg: 32m 04s | Max: 49m 34s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 56m | Avg: 49m 21s | Max: 55m 36s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 41m | Avg: 55m 16s | Max: 58m 26s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 42m | Avg: 57m 32s | Max:  1h 07m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 33m | Avg: 53m 23s | Max: 54m 52s
      🟩 GCC13              Pass: 100%/16  | Total: 10h 05m | Avg: 37m 51s | Max:  1h 19m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 53m | Avg: 57m 53s | Max:  1h 00m
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits:   2%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 07m | Hits:   2%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m | Hits:   2%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 14h | Avg: 50m 31s | Max:  1h 00m
      🟩 GCC                Pass: 100%/51  | Total:  1d 10h | Avg: 41m 00s | Max:  1h 19m
      🟩 Intel              Pass: 100%/3   | Total:  2h 53m | Avg: 57m 53s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 19m | Avg:  1h 04m | Max:  1h 07m | Hits:   2%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 08h | Avg: 46m 37s | Max:  1h 19m | Hits:   2%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 05h | Avg: 48m 11s | Max:  1h 07m | Hits:   2%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 19m | Avg:  1h 19m | Max:  1h 19m
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 05s | Avg: 16m 05s | Max: 16m 05s
      🟩 HostLaunch         Pass: 100%/3   | Total: 51m 38s | Avg: 17m 12s | Max: 19m 38s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 15m | Avg: 25m 15s | Max: 31m 54s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 18m | Avg:  1h 06m | Max:  1h 07m
      🟩 90a                Pass: 100%/4   | Total:  1h 30m | Avg: 22m 37s | Max: 23m 40s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 20h 34m | Avg: 44m 04s | Max:  1h 05m
      🟩 14                 Pass: 100%/27  | Total: 21h 34m | Avg: 47m 56s | Max:  1h 06m | Hits:   2%/1454  
      🟩 17                 Pass: 100%/26  | Total: 21h 02m | Avg: 48m 32s | Max:  1h 07m | Hits:   2%/727   
      🟩 20                 Pass: 100%/23  | Total: 17h 37m | Avg: 45m 59s | Max:  1h 19m | Hits:   2%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 1d 18h | Avg: 24m 19s | Max: 1h 16m | Hits: 2%/11411

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  1d 15h | Avg: 24m 30s | Max:  1h 16m | Hits:   2%/11411 
      🟩 arm64              Pass: 100%/8   | Total:  2h 57m | Avg: 22m 12s | Max: 28m 02s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  5h 22m | Avg: 21m 29s | Max: 42m 30s | Hits:   3%/2655  
      🟩 11.8               Pass: 100%/3   | Total:  1h 15m | Avg: 25m 13s | Max: 28m 48s
      🟩 12.6               Pass: 100%/86  | Total:  1d 11h | Avg: 24m 47s | Max:  1h 16m | Hits:   2%/8756  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 39m 36s | Avg: 19m 48s | Max: 20m 20s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  5h 22m | Avg: 21m 29s | Max: 42m 30s | Hits:   3%/2655  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 15m | Avg: 25m 13s | Max: 28m 48s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  1d 10h | Avg: 24m 54s | Max:  1h 16m | Hits:   2%/8756  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 39m 36s | Avg: 19m 48s | Max: 20m 20s
      🟩 nvcc               Pass: 100%/102 | Total:  1d 17h | Avg: 24m 25s | Max:  1h 16m | Hits:   2%/11411 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 32m | Avg: 25m 23s | Max: 33m 43s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 16m | Avg: 25m 22s | Max: 30m 21s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 42m | Avg: 25m 43s | Max: 31m 01s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 38m | Avg: 24m 44s | Max: 28m 11s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 37m | Avg: 24m 15s | Max: 28m 18s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 41m | Avg: 25m 29s | Max: 30m 29s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 38m | Avg: 24m 33s | Max: 29m 05s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 37m | Avg: 24m 22s | Max: 28m 18s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 33m | Avg: 23m 20s | Max: 28m 01s
      🟩 Clang18            Pass: 100%/8   | Total:  3h 55m | Avg: 29m 23s | Max:  1h 16m
      🟩 GCC6               Pass: 100%/2   | Total: 18m 23s | Avg:  9m 11s | Max: 13m 18s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 12m | Avg: 22m 04s | Max: 26m 10s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 52m | Avg: 18m 45s | Max: 27m 06s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 16m | Avg: 22m 47s | Max: 27m 00s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 40m | Avg: 25m 06s | Max: 28m 56s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 50m | Avg: 24m 21s | Max: 28m 48s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 42m | Avg: 25m 30s | Max: 29m 16s
      🟩 GCC13              Pass: 100%/17  | Total:  5h 34m | Avg: 19m 41s | Max: 30m 40s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 29m | Avg: 29m 51s | Max: 36m 48s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 42m 30s | Avg: 42m 30s | Max: 42m 30s | Hits:   3%/2655  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 21m | Avg: 40m 49s | Max: 41m 50s | Hits:   2%/5672  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 55m 28s | Avg: 55m 28s | Max: 55m 28s | Hits:   2%/3084  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total: 19h 13m | Avg: 25m 38s | Max:  1h 16m
      🟩 GCC                Pass: 100%/52  | Total: 18h 27m | Avg: 21m 18s | Max: 30m 40s
      🟩 Intel              Pass: 100%/3   | Total:  1h 29m | Avg: 29m 51s | Max: 36m 48s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 59m | Avg: 44m 54s | Max: 55m 28s | Hits:   2%/11411 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  1d 18h | Avg: 24m 19s | Max:  1h 16m | Hits:   2%/11411 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  1d 14h | Avg: 24m 11s | Max: 55m 28s | Hits:   2%/11411 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 42m | Avg: 25m 32s | Max: 28m 09s
      🟩 Test               Pass: 100%/3   | Total:  1h 43m | Avg: 34m 37s | Max:  1h 16m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 15m | Avg: 25m 13s | Max: 28m 48s
      🟩 90a                Pass: 100%/4   | Total:  1h 05m | Avg: 16m 16s | Max: 20m 10s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  8h 41m | Avg: 18m 37s | Max: 26m 06s
      🟩 14                 Pass: 100%/28  | Total: 10h 57m | Avg: 23m 28s | Max: 42m 30s | Hits:   3%/5411  
      🟩 17                 Pass: 100%/27  | Total: 12h 10m | Avg: 27m 03s | Max: 41m 50s | Hits:   2%/2916  
      🟩 20                 Pass: 100%/20  | Total: 10h 19m | Avg: 30m 58s | Max:  1h 16m | Hits:   2%/3084  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 12h | Avg: 35m 05s | Max: 1h 21m | Hits: 19%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 07h | Avg: 34m 58s | Max:  1h 21m | Hits:  19%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 51m | Avg: 36m 28s | Max: 42m 05s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 15m | Avg: 33m 03s | Max:  1h 08m | Hits:   0%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 18m | Avg: 46m 10s | Max: 50m 19s
      🟩 12.6               Pass: 100%/85  | Total:  2d 01h | Avg: 35m 03s | Max:  1h 21m | Hits:  24%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 05m | Avg: 32m 47s | Max: 33m 28s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 15m | Avg: 33m 03s | Max:  1h 08m | Hits:   0%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 18m | Avg: 46m 10s | Max: 50m 19s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  2d 00h | Avg: 35m 06s | Max:  1h 21m | Hits:  24%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 47s | Max: 33m 28s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 11h | Avg: 35m 08s | Max:  1h 21m | Hits:  19%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 36m | Avg: 36m 02s | Max: 42m 46s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 05m | Avg: 41m 41s | Max: 45m 34s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 29m | Avg: 37m 24s | Max: 44m 45s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 35m | Avg: 38m 46s | Max: 43m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 30m | Avg: 37m 42s | Max: 40m 21s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 31m | Avg: 37m 50s | Max: 43m 52s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 27m | Avg: 36m 55s | Max: 39m 44s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 31m | Avg: 37m 53s | Max: 42m 37s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 40m | Avg: 40m 06s | Max: 44m 43s
      🟩 Clang18            Pass: 100%/9   | Total:  4h 31m | Avg: 30m 08s | Max: 43m 35s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 29s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 07m | Avg: 21m 18s | Max: 38m 31s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 11m | Avg: 21m 50s | Max: 39m 21s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 39m | Avg: 36m 35s | Max: 42m 53s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 38m | Avg: 39m 30s | Max: 46m 40s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 48m | Avg: 41m 10s | Max: 50m 19s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 44m | Avg: 41m 05s | Max: 47m 39s
      🟩 GCC13              Pass: 100%/14  | Total:  6h 02m | Avg: 25m 55s | Max: 42m 05s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 28m | Avg: 49m 33s | Max: 53m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:   0%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m | Hits:   0%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 15s | Max:  1h 21m | Hits:  49%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 03h | Avg: 36m 30s | Max: 45m 34s
      🟩 GCC                Pass: 100%/49  | Total:  1d 00h | Avg: 29m 48s | Max: 50m 19s
      🟩 Intel              Pass: 100%/3   | Total:  2h 28m | Avg: 49m 33s | Max: 53m 33s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 21m | Hits:  19%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 12h | Avg: 35m 05s | Max:  1h 21m | Hits:  19%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 10h | Avg: 36m 40s | Max:  1h 21m | Hits:   0%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 01s | Avg: 11m 30s | Max: 23m 19s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 47m 08s | Avg: 15m 42s | Max: 19m 26s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 18m | Avg: 46m 10s | Max: 50m 19s
      🟩 90a                Pass: 100%/4   | Total:  1h 36m | Avg: 24m 00s | Max: 27m 06s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 13h 13m | Avg: 28m 21s | Max: 43m 06s
      🟩 14                 Pass: 100%/27  | Total: 16h 47m | Avg: 37m 19s | Max:  1h 16m | Hits:   0%/4460  
      🟩 17                 Pass: 100%/26  | Total: 17h 11m | Avg: 39m 40s | Max:  1h 16m | Hits:   0%/2230  
      🟩 20                 Pass: 100%/22  | Total: 13h 01m | Avg: 35m 30s | Max:  1h 21m | Hits:  49%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 3h 11m | Avg: 3m 41s | Max: 14m 29s | Hits: 4%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 59m | Avg:  3m 44s | Max: 14m 29s | Hits:   4%/222   
      🟩 arm64              Pass: 100%/4   | Total: 11m 54s | Avg:  2m 58s | Max:  3m 30s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 12m | Avg:  3m 49s | Max: 14m 29s | Hits:   4%/111   
      🟩 12.6               Pass: 100%/33  | Total:  1h 59m | Avg:  3m 36s | Max: 12m 03s | Hits:   4%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 12m | Avg:  3m 49s | Max: 14m 29s | Hits:   4%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 59m | Avg:  3m 36s | Max: 12m 03s | Hits:   4%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  3h 11m | Avg:  3m 41s | Max: 14m 29s | Hits:   4%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 40s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 41s | Avg:  3m 20s | Max:  3m 25s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 39s | Avg:  3m 09s | Max:  3m 28s
      🟩 Clang12            Pass: 100%/4   | Total: 13m 08s | Avg:  3m 17s | Max:  3m 24s
      🟩 Clang13            Pass: 100%/4   | Total: 13m 26s | Avg:  3m 21s | Max:  3m 49s
      🟩 Clang14            Pass: 100%/4   | Total: 13m 13s | Avg:  3m 18s | Max:  3m 43s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 46s
      🟩 Clang16            Pass: 100%/4   | Total: 13m 18s | Avg:  3m 19s | Max:  3m 36s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 41s | Avg:  3m 20s | Max:  3m 23s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 13s | Avg:  3m 36s | Max:  3m 51s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 08s | Avg:  3m 04s | Max:  3m 06s
      🟩 GCC10              Pass: 100%/4   | Total: 12m 50s | Avg:  3m 12s | Max:  3m 20s
      🟩 GCC11              Pass: 100%/4   | Total: 13m 08s | Avg:  3m 17s | Max:  3m 26s
      🟩 GCC12              Pass: 100%/7   | Total: 24m 41s | Avg:  3m 31s | Max:  3m 54s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 13s | Avg:  2m 44s | Max:  2m 53s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 14m 29s | Avg: 14m 29s | Max: 14m 29s | Hits:   4%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 03s | Avg: 12m 03s | Max: 12m 03s | Hits:   4%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 40m | Avg:  3m 20s | Max:  3m 51s
      🟩 GCC                Pass: 100%/20  | Total:  1h 05m | Avg:  3m 15s | Max:  3m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 26m 32s | Avg: 13m 16s | Max: 14m 29s | Hits:   4%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  3h 11m | Avg:  3m 41s | Max: 14m 29s | Hits:   4%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 52m | Avg:  3m 40s | Max: 14m 29s | Hits:   4%/222   
      🟩 Test               Pass: 100%/5   | Total: 19m 10s | Avg:  3m 50s | Max:  3m 54s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s
      🟩 90a                Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 31m | Avg:  3m 15s | Max:  3m 54s
      🟩 20                 Pass: 100%/24  | Total:  1h 40m | Avg:  4m 11s | Max: 14m 29s | Hits:   4%/222   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 364)

# Runner
297 linux-amd64-cpu16
28 linux-arm64-cpu16
24 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@miscco miscco merged commit e4f48cf into NVIDIA:main Oct 2, 2024
382 checks passed
@miscco miscco deleted the refactor_builtins branch October 2, 2024 07:06
fbusato pushed a commit to fbusato/cccl that referenced this pull request Oct 2, 2024
* Move builtin detection to its own file

* Try to reenable more builtins

* Address review comments
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

4 participants