Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

replace _CCCL_ALWAYS_INLINE with _CCCL_FORCEINLINE #2439

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

ericniebler
Copy link
Collaborator

Description

cccl has _CCCL_FORCEINLINE and _CCCL_ALWAYS_INLINE. there should be only one. also, _CCCL_FORCEINLINE currently expands to inline when not using a CUDA compiler. that is unexpected. it should expand to either __attribute__((always_inline)) or __forceinline depending on which is supported by the host compiler.

closes #2438

This PR moves the definition of _CCCL_FORCEINLINE from execution_space.h to visibility.h. it also changes the definition to expand directly to either __inline__ __attribute__((always_inline)) or __forceinline rather then indirectly through the __forceinline__ macro defined in host_defines.h.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@miscco miscco requested a review from a team as a code owner September 20, 2024 06:47
Copy link
Contributor

🟨 CI finished in 1h 19m: Pass: 99%/368 | Total: 2d 03h | Avg: 8m 22s | Max: 49m 36s | Hits: 74%/25647
  • 🟨 libcudacxx: Pass: 99%/104 | Total: 1d 01h | Avg: 14m 52s | Max: 49m 36s | Hits: 43%/11375

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  98%/96  | Total: 23h 57m | Avg: 14m 58s | Max: 49m 36s | Hits:  43%/11375 
      🟩 arm64              Pass: 100%/8   | Total:  1h 48m | Avg: 13m 34s | Max: 23m 30s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  3h 11m | Avg: 12m 44s | Max: 38m 05s | Hits:  46%/2646  
      🟩 11.8               Pass: 100%/3   | Total:  1h 10m | Avg: 23m 30s | Max: 26m 57s
      🔍 12.6               Pass:  98%/86  | Total: 21h 25m | Avg: 14m 56s | Max: 49m 36s | Hits:  43%/8729  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 36m 57s | Avg: 18m 28s | Max: 20m 06s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 11m | Avg: 12m 44s | Max: 38m 05s | Hits:  46%/2646  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 10m | Avg: 23m 30s | Max: 26m 57s
      🔍 nvcc12.6           Pass:  98%/84  | Total: 20h 48m | Avg: 14m 51s | Max: 49m 36s | Hits:  43%/8729  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 36m 57s | Avg: 18m 28s | Max: 20m 06s
      🔍 nvcc               Pass:  99%/102 | Total:  1d 01h | Avg: 14m 48s | Max: 49m 36s | Hits:  43%/11375 
    🔍 cxx: Clang18 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  1h 03m | Avg: 10m 32s | Max: 24m 24s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 32s | Avg:  5m 10s | Max:  5m 20s
      🟩 Clang11            Pass: 100%/4   | Total: 36m 05s | Avg:  9m 01s | Max: 23m 27s
      🟩 Clang12            Pass: 100%/4   | Total: 15m 45s | Avg:  3m 56s | Max:  4m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 17m | Avg: 19m 17s | Max: 25m 54s
      🟩 Clang14            Pass: 100%/4   | Total: 59m 25s | Avg: 14m 51s | Max: 25m 37s
      🟩 Clang15            Pass: 100%/4   | Total: 51m 24s | Avg: 12m 51s | Max: 22m 52s
      🟩 Clang16            Pass: 100%/4   | Total: 52m 51s | Avg: 13m 12s | Max: 24m 16s
      🟩 Clang17            Pass: 100%/4   | Total: 54m 47s | Avg: 13m 41s | Max: 24m 59s
      🔍 Clang18            Pass:  87%/8   | Total:  1h 54m | Avg: 14m 17s | Max: 26m 27s
      🟩 GCC6               Pass: 100%/2   | Total: 27m 10s | Avg: 13m 35s | Max: 24m 24s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 29m | Avg: 14m 52s | Max: 23m 57s
      🟩 GCC8               Pass: 100%/6   | Total: 18m 44s | Avg:  3m 07s | Max:  4m 01s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 17m | Avg: 12m 52s | Max: 24m 04s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 18m | Avg: 19m 42s | Max: 28m 45s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 24m | Avg: 20m 34s | Max: 28m 32s
      🟩 GCC12              Pass: 100%/4   | Total: 54m 52s | Avg: 13m 43s | Max: 27m 30s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 54m | Avg: 17m 19s | Max: 49m 36s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 03m | Avg: 21m 12s | Max: 33m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 38m 05s | Avg: 38m 05s | Max: 38m 05s | Hits:  46%/2646  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 10m | Avg: 35m 25s | Max: 37m 33s | Hits:  43%/5654  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s | Hits:  41%/3075  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  97%/45  | Total:  9h 00m | Avg: 12m 00s | Max: 26m 27s
      🟩 GCC                Pass: 100%/52  | Total: 13h 04m | Avg: 15m 05s | Max: 49m 36s
      🟩 Intel              Pass: 100%/3   | Total:  1h 03m | Avg: 21m 12s | Max: 33m 03s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 37m | Avg: 39m 27s | Max: 48m 55s | Hits:  43%/11375 
    🔍 jobs: Test 🔍
      🟩 Build              Pass: 100%/96  | Total: 22h 33m | Avg: 14m 06s | Max: 48m 55s | Hits:  43%/11375 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 36m | Avg: 24m 08s | Max: 30m 05s
      🔍 Test               Pass:  66%/3   | Total:  1h 34m | Avg: 31m 22s | Max: 49m 36s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/28  | Total:  5h 41m | Avg: 12m 10s | Max: 49m 36s
      🟩 14                 Pass: 100%/28  | Total:  7h 08m | Avg: 15m 18s | Max: 38m 05s | Hits:  45%/5393  
      🟩 17                 Pass: 100%/27  | Total:  7h 39m | Avg: 17m 00s | Max: 37m 33s | Hits:  42%/2907  
      🔍 20                 Pass:  95%/20  | Total:  5h 15m | Avg: 15m 47s | Max: 48m 55s | Hits:  41%/3075  
    🟨 gpu
      🟨 v100               Pass:  99%/104 | Total:  1d 01h | Avg: 14m 52s | Max: 49m 36s | Hits:  43%/11375 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 10m | Avg: 23m 30s | Max: 26m 57s
      🟩 90a                Pass: 100%/4   | Total: 14m 19s | Avg:  3m 34s | Max:  4m 19s
    
  • 🟩 cub: Pass: 100%/104 | Total: 10h 41m | Avg: 6m 09s | Max: 30m 15s | Hits: 99%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 07m | Avg:  6m 19s | Max: 30m 15s | Hits:  99%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 33m 56s | Avg:  4m 14s | Max:  5m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 06m | Avg:  4m 27s | Max: 15m 50s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  4m 58s
      🟩 12.6               Pass: 100%/86  | Total:  9h 19m | Avg:  6m 30s | Max: 30m 15s | Hits:  99%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 41s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 06m | Avg:  4m 27s | Max: 15m 50s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  4m 58s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 12m | Avg:  6m 34s | Max: 30m 15s | Hits:  99%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 41s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 33m | Avg:  6m 12s | Max: 30m 15s | Hits:  99%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 32s | Avg:  4m 35s | Max:  5m 27s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 10s | Avg:  5m 23s | Max:  5m 33s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 29s | Avg:  4m 37s | Max:  4m 48s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 23s | Avg:  4m 35s | Max:  4m 50s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 00s | Avg:  4m 30s | Max:  5m 04s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 19s | Avg:  4m 34s | Max:  4m 42s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 30s | Avg:  4m 37s | Max:  4m 53s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 00s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  4m 57s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 13m | Avg:  8m 07s | Max: 26m 44s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 26s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 49s | Avg:  3m 58s | Max:  4m 48s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 49s | Avg:  3m 58s | Max:  4m 39s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 52s | Avg:  4m 08s | Max:  4m 46s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 12s | Avg:  4m 33s | Max:  4m 57s
      🟩 GCC11              Pass: 100%/7   | Total: 33m 46s | Avg:  4m 49s | Max:  4m 58s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 04s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 51m | Avg: 10m 43s | Max: 30m 15s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 56s | Avg:  5m 18s | Max:  5m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 50s | Avg: 15m 50s | Max: 15m 50s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 49s | Avg: 13m 24s | Max: 14m 13s | Hits:  99%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 51s | Avg: 13m 51s | Max: 13m 51s | Hits:  99%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 06m | Avg:  5m 21s | Max: 26m 44s
      🟩 GCC                Pass: 100%/51  | Total:  5h 21m | Avg:  6m 18s | Max: 30m 15s
      🟩 Intel              Pass: 100%/3   | Total: 15m 56s | Avg:  5m 18s | Max:  5m 40s
      🟩 MSVC               Pass: 100%/4   | Total: 56m 30s | Avg: 14m 07s | Max: 15m 50s | Hits:  99%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 10h 41m | Avg:  6m 09s | Max: 30m 15s | Hits:  99%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 46m | Avg:  4m 51s | Max: 15m 50s | Hits:  99%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 44s | Avg: 22m 44s | Max: 22m 44s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 48s | Avg: 18m 48s | Max: 18m 48s
      🟩 HostLaunch         Pass: 100%/3   | Total: 53m 52s | Avg: 17m 57s | Max: 18m 29s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 19m | Avg: 26m 29s | Max: 30m 15s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  4m 58s
      🟩 90a                Pass: 100%/4   | Total: 14m 14s | Avg:  3m 33s | Max:  3m 53s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 31m | Avg:  5m 23s | Max: 22m 30s
      🟩 14                 Pass: 100%/27  | Total:  2h 21m | Avg:  5m 14s | Max: 15m 50s | Hits:  99%/1454  
      🟩 17                 Pass: 100%/26  | Total:  2h 06m | Avg:  4m 51s | Max: 12m 36s | Hits:  99%/727   
      🟩 20                 Pass: 100%/23  | Total:  3h 42m | Avg:  9m 40s | Max: 30m 15s | Hits:  99%/727   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 12h 03m | Avg: 7m 01s | Max: 25m 34s | Hits: 99%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total: 10h 57m | Avg:  6m 55s | Max: 25m 34s | Hits:  99%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  1h 06m | Avg:  8m 17s | Max: 20m 59s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 20m | Avg:  5m 21s | Max: 20m 13s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 30s
      🟩 12.6               Pass: 100%/85  | Total: 10h 28m | Avg:  7m 23s | Max: 25m 34s | Hits:  99%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 20s | Avg:  4m 40s | Max:  4m 50s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 20m | Avg:  5m 21s | Max: 20m 13s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 30s
      🟩 nvcc12.6           Pass: 100%/83  | Total: 10h 19m | Avg:  7m 27s | Max: 25m 34s | Hits:  99%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 20s | Avg:  4m 40s | Max:  4m 50s
      🟩 nvcc               Pass: 100%/101 | Total: 11h 54m | Avg:  7m 04s | Max: 25m 34s | Hits:  99%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 34s | Avg:  5m 05s | Max:  6m 24s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 44s | Avg:  5m 34s | Max:  5m 53s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  5m 16s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 14s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 20s | Avg:  4m 50s | Max:  5m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 47s | Avg:  4m 56s | Max:  5m 17s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 33s | Avg:  4m 53s | Max:  5m 15s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 32s | Avg:  4m 53s | Max:  5m 07s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 38s | Avg:  4m 54s | Max:  5m 28s
      🟩 Clang18            Pass: 100%/9   | Total: 51m 40s | Avg:  5m 44s | Max: 13m 02s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 48s
      🟩 GCC7               Pass: 100%/6   | Total: 25m 13s | Avg:  4m 12s | Max:  4m 51s
      🟩 GCC8               Pass: 100%/6   | Total: 32m 29s | Avg:  5m 24s | Max:  8m 25s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 45s | Avg:  4m 17s | Max:  4m 55s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 44s | Avg:  4m 41s | Max:  5m 22s
      🟩 GCC11              Pass: 100%/7   | Total: 34m 48s | Avg:  4m 58s | Max:  5m 30s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 10m | Avg: 17m 32s | Max: 24m 05s
      🟩 GCC13              Pass: 100%/14  | Total:  2h 37m | Avg: 11m 14s | Max: 20m 59s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 20s | Avg:  6m 06s | Max:  6m 45s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 13s | Avg: 20m 13s | Max: 20m 13s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 34m 42s | Avg: 17m 21s | Max: 17m 21s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 43m 55s | Avg: 21m 57s | Max: 25m 34s | Hits:  99%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  3h 54m | Avg:  5m 06s | Max: 13m 02s
      🟩 GCC                Pass: 100%/49  | Total:  6h 11m | Avg:  7m 35s | Max: 24m 05s
      🟩 Intel              Pass: 100%/3   | Total: 18m 20s | Avg:  6m 06s | Max:  6m 45s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 38m | Avg: 19m 46s | Max: 25m 34s | Hits:  99%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 12h 03m | Avg:  7m 01s | Max: 25m 34s | Hits:  99%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 10h 32m | Avg:  6m 35s | Max: 24m 05s | Hits:  99%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 47m 33s | Avg: 11m 53s | Max: 25m 34s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 43m 43s | Avg: 14m 34s | Max: 16m 24s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 30s
      🟩 90a                Pass: 100%/4   | Total: 44m 34s | Avg: 11m 08s | Max: 14m 46s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 36m | Avg:  5m 34s | Max: 14m 17s
      🟩 14                 Pass: 100%/27  | Total:  3h 15m | Avg:  7m 14s | Max: 20m 13s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  3h 07m | Avg:  7m 13s | Max: 24m 05s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  3h 04m | Avg:  8m 21s | Max: 25m 34s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 22m | Avg: 2m 44s | Max: 11m 39s | Hits: 89%/214

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 14m | Avg:  2m 48s | Max: 11m 39s | Hits:  89%/214   
      🟩 arm64              Pass: 100%/4   | Total:  7m 57s | Avg:  1m 59s | Max:  2m 27s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 54m 20s | Avg:  2m 51s | Max: 11m 39s | Hits:  89%/107   
      🟩 12.6               Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 10m 54s | Hits:  89%/107   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 54m 20s | Avg:  2m 51s | Max: 11m 39s | Hits:  89%/107   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 10m 54s | Hits:  89%/107   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 11m 39s | Hits:  89%/214   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 36s | Avg:  2m 18s | Max:  2m 20s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 47s | Avg:  2m 23s | Max:  2m 36s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 33s | Avg:  2m 23s | Max:  2m 30s
      🟩 Clang12            Pass: 100%/4   | Total:  8m 36s | Avg:  2m 09s | Max:  2m 14s
      🟩 Clang13            Pass: 100%/4   | Total:  8m 39s | Avg:  2m 09s | Max:  2m 16s
      🟩 Clang14            Pass: 100%/4   | Total: 10m 40s | Avg:  2m 40s | Max:  4m 07s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 26s | Avg:  2m 13s | Max:  2m 13s
      🟩 Clang16            Pass: 100%/4   | Total:  8m 46s | Avg:  2m 11s | Max:  2m 27s
      🟩 Clang17            Pass: 100%/2   | Total:  4m 46s | Avg:  2m 23s | Max:  2m 23s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  4m 52s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 21s | Avg:  2m 10s | Max:  2m 18s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 47s | Avg:  2m 11s | Max:  2m 18s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 51s | Avg:  2m 12s | Max:  2m 35s
      🟩 GCC12              Pass: 100%/7   | Total: 20m 20s | Avg:  2m 54s | Max:  4m 20s
      🟩 GCC13              Pass: 100%/3   | Total:  5m 42s | Avg:  1m 54s | Max:  2m 10s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 39s | Avg: 11m 39s | Max: 11m 39s | Hits:  89%/107   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 54s | Avg: 10m 54s | Max: 10m 54s | Hits:  89%/107   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 11m | Avg:  2m 23s | Max:  4m 52s
      🟩 GCC                Pass: 100%/20  | Total: 48m 01s | Avg:  2m 24s | Max:  4m 20s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 33s | Avg: 11m 16s | Max: 11m 39s | Hits:  89%/214   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 11m 39s | Hits:  89%/214   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 00m | Avg:  2m 34s | Max: 11m 39s | Hits:  89%/214   
      🟩 Test               Pass: 100%/5   | Total: 21m 28s | Avg:  4m 17s | Max:  4m 52s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
      🟩 90a                Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 05m | Avg:  2m 21s | Max:  4m 20s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 11s | Max: 11m 39s | Hits:  89%/214   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 16m 11s | Avg: 4m 02s | Max: 4m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  4m 03s
      🟩 12.6               Pass: 100%/2   | Total:  8m 36s | Avg:  4m 18s | Max:  4m 29s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  4m 03s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 36s | Avg:  4m 18s | Max:  4m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 29s
      🟩 GCC                Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  4m 07s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
+/- Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 368)

# Runner
297 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 2h 32m: Pass: 100%/368 | Total: 2d 03h | Avg: 8m 24s | Max: 49m 36s | Hits: 74%/25647
  • 🟩 cub: Pass: 100%/104 | Total: 10h 41m | Avg: 6m 09s | Max: 30m 15s | Hits: 99%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 10h 07m | Avg:  6m 19s | Max: 30m 15s | Hits:  99%/2908  
      🟩 arm64              Pass: 100%/8   | Total: 33m 56s | Avg:  4m 14s | Max:  5m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 06m | Avg:  4m 27s | Max: 15m 50s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  4m 58s
      🟩 12.6               Pass: 100%/86  | Total:  9h 19m | Avg:  6m 30s | Max: 30m 15s | Hits:  99%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 41s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 06m | Avg:  4m 27s | Max: 15m 50s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  4m 58s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 12m | Avg:  6m 34s | Max: 30m 15s | Hits:  99%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 41s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 33m | Avg:  6m 12s | Max: 30m 15s | Hits:  99%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 32s | Avg:  4m 35s | Max:  5m 27s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 10s | Avg:  5m 23s | Max:  5m 33s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 29s | Avg:  4m 37s | Max:  4m 48s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 23s | Avg:  4m 35s | Max:  4m 50s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 00s | Avg:  4m 30s | Max:  5m 04s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 19s | Avg:  4m 34s | Max:  4m 42s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 30s | Avg:  4m 37s | Max:  4m 53s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 00s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  4m 57s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 13m | Avg:  8m 07s | Max: 26m 44s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 26s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 49s | Avg:  3m 58s | Max:  4m 48s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 49s | Avg:  3m 58s | Max:  4m 39s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 52s | Avg:  4m 08s | Max:  4m 46s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 12s | Avg:  4m 33s | Max:  4m 57s
      🟩 GCC11              Pass: 100%/7   | Total: 33m 46s | Avg:  4m 49s | Max:  4m 58s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 04s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 51m | Avg: 10m 43s | Max: 30m 15s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 56s | Avg:  5m 18s | Max:  5m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 50s | Avg: 15m 50s | Max: 15m 50s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 49s | Avg: 13m 24s | Max: 14m 13s | Hits:  99%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 51s | Avg: 13m 51s | Max: 13m 51s | Hits:  99%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 06m | Avg:  5m 21s | Max: 26m 44s
      🟩 GCC                Pass: 100%/51  | Total:  5h 21m | Avg:  6m 18s | Max: 30m 15s
      🟩 Intel              Pass: 100%/3   | Total: 15m 56s | Avg:  5m 18s | Max:  5m 40s
      🟩 MSVC               Pass: 100%/4   | Total: 56m 30s | Avg: 14m 07s | Max: 15m 50s | Hits:  99%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 10h 41m | Avg:  6m 09s | Max: 30m 15s | Hits:  99%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  7h 46m | Avg:  4m 51s | Max: 15m 50s | Hits:  99%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 44s | Avg: 22m 44s | Max: 22m 44s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 48s | Avg: 18m 48s | Max: 18m 48s
      🟩 HostLaunch         Pass: 100%/3   | Total: 53m 52s | Avg: 17m 57s | Max: 18m 29s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 19m | Avg: 26m 29s | Max: 30m 15s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  4m 58s
      🟩 90a                Pass: 100%/4   | Total: 14m 14s | Avg:  3m 33s | Max:  3m 53s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 31m | Avg:  5m 23s | Max: 22m 30s
      🟩 14                 Pass: 100%/27  | Total:  2h 21m | Avg:  5m 14s | Max: 15m 50s | Hits:  99%/1454  
      🟩 17                 Pass: 100%/26  | Total:  2h 06m | Avg:  4m 51s | Max: 12m 36s | Hits:  99%/727   
      🟩 20                 Pass: 100%/23  | Total:  3h 42m | Avg:  9m 40s | Max: 30m 15s | Hits:  99%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 1d 01h | Avg: 14m 59s | Max: 49m 36s | Hits: 43%/11375

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  1d 00h | Avg: 15m 06s | Max: 49m 36s | Hits:  43%/11375 
      🟩 arm64              Pass: 100%/8   | Total:  1h 48m | Avg: 13m 34s | Max: 23m 30s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 11m | Avg: 12m 44s | Max: 38m 05s | Hits:  46%/2646  
      🟩 11.8               Pass: 100%/3   | Total:  1h 10m | Avg: 23m 30s | Max: 26m 57s
      🟩 12.6               Pass: 100%/86  | Total: 21h 37m | Avg: 15m 05s | Max: 49m 36s | Hits:  43%/8729  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 36m 57s | Avg: 18m 28s | Max: 20m 06s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 11m | Avg: 12m 44s | Max: 38m 05s | Hits:  46%/2646  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 10m | Avg: 23m 30s | Max: 26m 57s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 21h 00m | Avg: 15m 00s | Max: 49m 36s | Hits:  43%/8729  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 36m 57s | Avg: 18m 28s | Max: 20m 06s
      🟩 nvcc               Pass: 100%/102 | Total:  1d 01h | Avg: 14m 55s | Max: 49m 36s | Hits:  43%/11375 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 03m | Avg: 10m 32s | Max: 24m 24s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 32s | Avg:  5m 10s | Max:  5m 20s
      🟩 Clang11            Pass: 100%/4   | Total: 36m 05s | Avg:  9m 01s | Max: 23m 27s
      🟩 Clang12            Pass: 100%/4   | Total: 15m 45s | Avg:  3m 56s | Max:  4m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 17m | Avg: 19m 17s | Max: 25m 54s
      🟩 Clang14            Pass: 100%/4   | Total: 59m 25s | Avg: 14m 51s | Max: 25m 37s
      🟩 Clang15            Pass: 100%/4   | Total: 51m 24s | Avg: 12m 51s | Max: 22m 52s
      🟩 Clang16            Pass: 100%/4   | Total: 52m 51s | Avg: 13m 12s | Max: 24m 16s
      🟩 Clang17            Pass: 100%/4   | Total: 54m 47s | Avg: 13m 41s | Max: 24m 59s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 06m | Avg: 15m 52s | Max: 39m 04s
      🟩 GCC6               Pass: 100%/2   | Total: 27m 10s | Avg: 13m 35s | Max: 24m 24s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 29m | Avg: 14m 52s | Max: 23m 57s
      🟩 GCC8               Pass: 100%/6   | Total: 18m 44s | Avg:  3m 07s | Max:  4m 01s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 17m | Avg: 12m 52s | Max: 24m 04s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 18m | Avg: 19m 42s | Max: 28m 45s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 24m | Avg: 20m 34s | Max: 28m 32s
      🟩 GCC12              Pass: 100%/4   | Total: 54m 52s | Avg: 13m 43s | Max: 27m 30s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 54m | Avg: 17m 19s | Max: 49m 36s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 03m | Avg: 21m 12s | Max: 33m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 38m 05s | Avg: 38m 05s | Max: 38m 05s | Hits:  46%/2646  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 10m | Avg: 35m 25s | Max: 37m 33s | Hits:  43%/5654  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 48m 55s | Avg: 48m 55s | Max: 48m 55s | Hits:  41%/3075  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  9h 13m | Avg: 12m 17s | Max: 39m 04s
      🟩 GCC                Pass: 100%/52  | Total: 13h 04m | Avg: 15m 05s | Max: 49m 36s
      🟩 Intel              Pass: 100%/3   | Total:  1h 03m | Avg: 21m 12s | Max: 33m 03s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 37m | Avg: 39m 27s | Max: 48m 55s | Hits:  43%/11375 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  1d 01h | Avg: 14m 59s | Max: 49m 36s | Hits:  43%/11375 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 22h 33m | Avg: 14m 06s | Max: 48m 55s | Hits:  43%/11375 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 36m | Avg: 24m 08s | Max: 30m 05s
      🟩 Test               Pass: 100%/3   | Total:  1h 46m | Avg: 35m 35s | Max: 49m 36s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 10m | Avg: 23m 30s | Max: 26m 57s
      🟩 90a                Pass: 100%/4   | Total: 14m 19s | Avg:  3m 34s | Max:  4m 19s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  5h 41m | Avg: 12m 10s | Max: 49m 36s
      🟩 14                 Pass: 100%/28  | Total:  7h 08m | Avg: 15m 18s | Max: 38m 05s | Hits:  45%/5393  
      🟩 17                 Pass: 100%/27  | Total:  7h 39m | Avg: 17m 00s | Max: 37m 33s | Hits:  42%/2907  
      🟩 20                 Pass: 100%/20  | Total:  5h 28m | Avg: 16m 24s | Max: 48m 55s | Hits:  41%/3075  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 12h 03m | Avg: 7m 01s | Max: 25m 34s | Hits: 99%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total: 10h 57m | Avg:  6m 55s | Max: 25m 34s | Hits:  99%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  1h 06m | Avg:  8m 17s | Max: 20m 59s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 20m | Avg:  5m 21s | Max: 20m 13s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 30s
      🟩 12.6               Pass: 100%/85  | Total: 10h 28m | Avg:  7m 23s | Max: 25m 34s | Hits:  99%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 20s | Avg:  4m 40s | Max:  4m 50s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 20m | Avg:  5m 21s | Max: 20m 13s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 30s
      🟩 nvcc12.6           Pass: 100%/83  | Total: 10h 19m | Avg:  7m 27s | Max: 25m 34s | Hits:  99%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 20s | Avg:  4m 40s | Max:  4m 50s
      🟩 nvcc               Pass: 100%/101 | Total: 11h 54m | Avg:  7m 04s | Max: 25m 34s | Hits:  99%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 34s | Avg:  5m 05s | Max:  6m 24s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 44s | Avg:  5m 34s | Max:  5m 53s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  5m 16s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 14s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 20s | Avg:  4m 50s | Max:  5m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 47s | Avg:  4m 56s | Max:  5m 17s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 33s | Avg:  4m 53s | Max:  5m 15s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 32s | Avg:  4m 53s | Max:  5m 07s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 38s | Avg:  4m 54s | Max:  5m 28s
      🟩 Clang18            Pass: 100%/9   | Total: 51m 40s | Avg:  5m 44s | Max: 13m 02s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 48s
      🟩 GCC7               Pass: 100%/6   | Total: 25m 13s | Avg:  4m 12s | Max:  4m 51s
      🟩 GCC8               Pass: 100%/6   | Total: 32m 29s | Avg:  5m 24s | Max:  8m 25s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 45s | Avg:  4m 17s | Max:  4m 55s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 44s | Avg:  4m 41s | Max:  5m 22s
      🟩 GCC11              Pass: 100%/7   | Total: 34m 48s | Avg:  4m 58s | Max:  5m 30s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 10m | Avg: 17m 32s | Max: 24m 05s
      🟩 GCC13              Pass: 100%/14  | Total:  2h 37m | Avg: 11m 14s | Max: 20m 59s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 20s | Avg:  6m 06s | Max:  6m 45s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 13s | Avg: 20m 13s | Max: 20m 13s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 34m 42s | Avg: 17m 21s | Max: 17m 21s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 43m 55s | Avg: 21m 57s | Max: 25m 34s | Hits:  99%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  3h 54m | Avg:  5m 06s | Max: 13m 02s
      🟩 GCC                Pass: 100%/49  | Total:  6h 11m | Avg:  7m 35s | Max: 24m 05s
      🟩 Intel              Pass: 100%/3   | Total: 18m 20s | Avg:  6m 06s | Max:  6m 45s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 38m | Avg: 19m 46s | Max: 25m 34s | Hits:  99%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 12h 03m | Avg:  7m 01s | Max: 25m 34s | Hits:  99%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 10h 32m | Avg:  6m 35s | Max: 24m 05s | Hits:  99%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 47m 33s | Avg: 11m 53s | Max: 25m 34s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 43m 43s | Avg: 14m 34s | Max: 16m 24s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 45s | Avg:  4m 55s | Max:  5m 30s
      🟩 90a                Pass: 100%/4   | Total: 44m 34s | Avg: 11m 08s | Max: 14m 46s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 36m | Avg:  5m 34s | Max: 14m 17s
      🟩 14                 Pass: 100%/27  | Total:  3h 15m | Avg:  7m 14s | Max: 20m 13s | Hits:  99%/4460  
      🟩 17                 Pass: 100%/26  | Total:  3h 07m | Avg:  7m 13s | Max: 24m 05s | Hits:  99%/2230  
      🟩 20                 Pass: 100%/22  | Total:  3h 04m | Avg:  8m 21s | Max: 25m 34s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 22m | Avg: 2m 44s | Max: 11m 39s | Hits: 89%/214

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 14m | Avg:  2m 48s | Max: 11m 39s | Hits:  89%/214   
      🟩 arm64              Pass: 100%/4   | Total:  7m 57s | Avg:  1m 59s | Max:  2m 27s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 54m 20s | Avg:  2m 51s | Max: 11m 39s | Hits:  89%/107   
      🟩 12.6               Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 10m 54s | Hits:  89%/107   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 54m 20s | Avg:  2m 51s | Max: 11m 39s | Hits:  89%/107   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 28m | Avg:  2m 40s | Max: 10m 54s | Hits:  89%/107   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 11m 39s | Hits:  89%/214   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 36s | Avg:  2m 18s | Max:  2m 20s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 47s | Avg:  2m 23s | Max:  2m 36s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 33s | Avg:  2m 23s | Max:  2m 30s
      🟩 Clang12            Pass: 100%/4   | Total:  8m 36s | Avg:  2m 09s | Max:  2m 14s
      🟩 Clang13            Pass: 100%/4   | Total:  8m 39s | Avg:  2m 09s | Max:  2m 16s
      🟩 Clang14            Pass: 100%/4   | Total: 10m 40s | Avg:  2m 40s | Max:  4m 07s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 26s | Avg:  2m 13s | Max:  2m 13s
      🟩 Clang16            Pass: 100%/4   | Total:  8m 46s | Avg:  2m 11s | Max:  2m 27s
      🟩 Clang17            Pass: 100%/2   | Total:  4m 46s | Avg:  2m 23s | Max:  2m 23s
      🟩 Clang18            Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  4m 52s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 21s | Avg:  2m 10s | Max:  2m 18s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 47s | Avg:  2m 11s | Max:  2m 18s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 51s | Avg:  2m 12s | Max:  2m 35s
      🟩 GCC12              Pass: 100%/7   | Total: 20m 20s | Avg:  2m 54s | Max:  4m 20s
      🟩 GCC13              Pass: 100%/3   | Total:  5m 42s | Avg:  1m 54s | Max:  2m 10s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 39s | Avg: 11m 39s | Max: 11m 39s | Hits:  89%/107   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 54s | Avg: 10m 54s | Max: 10m 54s | Hits:  89%/107   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 11m | Avg:  2m 23s | Max:  4m 52s
      🟩 GCC                Pass: 100%/20  | Total: 48m 01s | Avg:  2m 24s | Max:  4m 20s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 33s | Avg: 11m 16s | Max: 11m 39s | Hits:  89%/214   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 22m | Avg:  2m 44s | Max: 11m 39s | Hits:  89%/214   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 00m | Avg:  2m 34s | Max: 11m 39s | Hits:  89%/214   
      🟩 Test               Pass: 100%/5   | Total: 21m 28s | Avg:  4m 17s | Max:  4m 52s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
      🟩 90a                Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 05m | Avg:  2m 21s | Max:  4m 20s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 11s | Max: 11m 39s | Hits:  89%/214   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 16m 11s | Avg: 4m 02s | Max: 4m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  4m 03s
      🟩 12.6               Pass: 100%/2   | Total:  8m 36s | Avg:  4m 18s | Max:  4m 29s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  4m 03s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 36s | Avg:  4m 18s | Max:  4m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 29s
      🟩 GCC                Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  4m 07s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 29s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 34s | Avg: 14m 34s | Max: 14m 34s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
+/- Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 368)

# Runner
297 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

template <class Vector>
void TestTransformInputOutputIterator()
THRUST_DISABLE_BROKEN_GCC_VECTORIZER void TestTransformInputOutputIterator()
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This fixes our tests, but won't gcc still be miscompiling Thrust for users?

Copy link
Collaborator

@miscco miscco Sep 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That is nothing we can change. I want to note that this is exceptionally frickle and dependent on exact sizes and optimization settings, so I dont see anything we can do there

Copy link

copy-pr-bot bot commented Sep 20, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@@ -39,7 +39,7 @@ struct wrapped_function

_CCCL_EXEC_CHECK_DISABLE
template <typename... Ts>
_CCCL_FORCEINLINE _CCCL_HOST_DEVICE Result operator()(Ts&&... args) const
inline _CCCL_HOST_DEVICE Result operator()(Ts&&... args) const
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@miscco at least locally, this change avoids the gcc optimizer issue.

@ericniebler
Copy link
Collaborator Author

/ok to test

Copy link
Contributor

🟨 CI finished in 2h 00m: Pass: 99%/368 | Total: 7d 00h | Avg: 27m 29s | Max: 1h 25m | Hits: 54%/25647
  • 🟨 thrust: Pass: 98%/103 | Total: 2d 12h | Avg: 35m 01s | Max: 1h 14m | Hits: 46%/11150

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/95  | Total:  2d 07h | Avg: 35m 07s | Max:  1h 14m | Hits:  46%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  4h 29m | Avg: 33m 43s | Max: 37m 47s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  8h 45m | Avg: 35m 01s | Max:  1h 14m | Hits:  32%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 05m | Avg: 41m 48s | Max: 45m 31s
      🔍 12.6               Pass:  97%/85  | Total:  2d 01h | Avg: 34m 46s | Max:  1h 13m | Hits:  49%/8920  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 00m | Avg: 30m 27s | Max: 31m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 45m | Avg: 35m 01s | Max:  1h 14m | Hits:  32%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 05m | Avg: 41m 48s | Max: 45m 31s
      🔍 nvcc12.6           Pass:  97%/83  | Total:  2d 00h | Avg: 34m 53s | Max:  1h 13m | Hits:  49%/8920  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 27s | Max: 31m 23s
      🔍 nvcc               Pass:  98%/101 | Total:  2d 11h | Avg: 35m 06s | Max:  1h 14m | Hits:  46%/11150 
    🔍 cxx: GCC12 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  3h 32m | Avg: 35m 27s | Max: 42m 06s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 54m | Avg: 38m 01s | Max: 42m 19s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 10s | Max: 40m 53s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 01s | Max: 35m 47s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 01s | Max: 37m 54s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 23m | Avg: 35m 51s | Max: 41m 26s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 35m | Avg: 38m 53s | Max: 42m 06s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 27m | Avg: 36m 50s | Max: 41m 26s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 27m | Avg: 36m 53s | Max: 41m 47s
      🟩 Clang18            Pass: 100%/9   | Total:  4h 08m | Avg: 27m 33s | Max: 38m 25s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 25s | Max: 32m 18s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 19m | Avg: 33m 17s | Max: 37m 51s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 24m | Avg: 34m 02s | Max: 42m 20s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 35m | Avg: 35m 54s | Max: 40m 16s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 34m | Avg: 38m 41s | Max: 42m 20s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 34m | Avg: 39m 13s | Max: 45m 31s
      🔍 GCC12              Pass:  50%/4   | Total:  1h 48m | Avg: 27m 08s | Max: 40m 53s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 47m | Avg: 24m 49s | Max: 37m 47s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 27m | Avg: 49m 04s | Max: 56m 18s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 14m | Avg:  1h 14m | Max:  1h 14m | Hits:  32%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 11m | Hits:  32%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 36m | Avg: 48m 09s | Max:  1h 13m | Hits:  66%/4460  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/46  | Total:  1d 02h | Avg: 34m 28s | Max: 42m 19s
      🔍 GCC                Pass:  95%/49  | Total:  1d 02h | Avg: 31m 57s | Max: 45m 31s
      🟩 Intel              Pass: 100%/3   | Total:  2h 27m | Avg: 49m 04s | Max: 56m 18s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 08m | Avg:  1h 01m | Max:  1h 14m | Hits:  46%/11150 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  97%/96  | Total:  2d 10h | Avg: 36m 40s | Max:  1h 14m | Hits:  32%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 50m 09s | Avg: 12m 32s | Max: 23m 07s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 36m 34s | Avg: 12m 11s | Max: 12m 49s
    🟨 std
      🟩 11                 Pass: 100%/28  | Total: 13h 47m | Avg: 29m 32s | Max: 39m 41s
      🟨 14                 Pass:  96%/27  | Total: 17h 39m | Avg: 39m 14s | Max:  1h 14m | Hits:  32%/4460  
      🟨 17                 Pass:  96%/26  | Total: 16h 32m | Avg: 38m 11s | Max:  1h 11m | Hits:  32%/2230  
      🟩 20                 Pass: 100%/22  | Total: 12h 07m | Avg: 33m 04s | Max:  1h 13m | Hits:  66%/4460  
    🟨 gpu
      🟨 v100               Pass:  98%/103 | Total:  2d 12h | Avg: 35m 01s | Max:  1h 14m | Hits:  46%/11150 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 05m | Avg: 41m 48s | Max: 45m 31s
      🟩 90a                Pass: 100%/4   | Total:  1h 35m | Avg: 23m 58s | Max: 26m 25s
    
  • 🟩 cub: Pass: 100%/104 | Total: 3d 10h | Avg: 47m 31s | Max: 1h 13m | Hits: 65%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 03h | Avg: 47m 02s | Max:  1h 13m | Hits:  65%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  7h 07m | Avg: 53m 24s | Max: 55m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 11h 14m | Avg: 44m 56s | Max: 53m 44s | Hits:  65%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 28m | Avg:  1h 09m | Max:  1h 13m
      🟩 12.6               Pass: 100%/86  | Total:  2d 19h | Avg: 47m 12s | Max:  1h 02m | Hits:  65%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 47m | Avg: 53m 31s | Max: 54m 14s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 14m | Avg: 44m 56s | Max: 53m 44s | Hits:  65%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 28m | Avg:  1h 09m | Max:  1h 13m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 17h | Avg: 47m 03s | Max:  1h 02m | Hits:  65%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 31s | Max: 54m 14s
      🟩 nvcc               Pass: 100%/102 | Total:  3d 08h | Avg: 47m 24s | Max:  1h 13m | Hits:  65%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 39m | Avg: 46m 35s | Max: 49m 37s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 28m | Avg: 49m 36s | Max: 50m 01s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 20m | Avg: 50m 08s | Max: 51m 35s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 13s | Max: 57m 06s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 17m | Avg: 49m 20s | Max: 49m 42s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 04s | Max: 56m 21s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 18m | Avg: 49m 42s | Max: 50m 48s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 37s | Max: 55m 25s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 18m | Avg: 49m 36s | Max: 50m 38s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 55m | Avg: 46m 12s | Max: 56m 53s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 34m | Avg: 47m 27s | Max: 47m 35s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 44m | Avg: 47m 24s | Max: 55m 30s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 53m | Avg: 48m 57s | Max: 56m 27s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 41m | Avg: 46m 50s | Max: 50m 03s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 23m | Avg: 50m 46s | Max: 53m 03s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 45m | Avg: 57m 59s | Max:  1h 13m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 24m | Avg: 51m 14s | Max: 54m 15s
      🟩 GCC13              Pass: 100%/16  | Total:  8h 41m | Avg: 32m 34s | Max: 55m 39s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 43m | Avg: 54m 38s | Max: 56m 45s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 53m 44s | Avg: 53m 44s | Max: 53m 44s | Hits:  65%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  65%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits:  65%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 13h | Avg: 48m 55s | Max: 57m 06s
      🟩 GCC                Pass: 100%/51  | Total:  1d 14h | Avg: 44m 53s | Max:  1h 13m
      🟩 Intel              Pass: 100%/3   | Total:  2h 43m | Avg: 54m 38s | Max: 56m 45s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 58m | Avg: 59m 37s | Max:  1h 02m | Hits:  65%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 10h | Avg: 47m 31s | Max:  1h 13m | Hits:  65%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 07h | Avg: 49m 53s | Max:  1h 13m | Hits:  65%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 01s | Avg: 17m 01s | Max: 17m 01s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 11s | Avg: 16m 11s | Max: 16m 11s
      🟩 HostLaunch         Pass: 100%/3   | Total: 51m 44s | Avg: 17m 14s | Max: 17m 24s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 08m | Avg: 22m 57s | Max: 23m 13s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 28m | Avg:  1h 09m | Max:  1h 13m
      🟩 90a                Pass: 100%/4   | Total:  1h 30m | Avg: 22m 34s | Max: 24m 11s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 22h 03m | Avg: 47m 16s | Max:  1h 13m
      🟩 14                 Pass: 100%/27  | Total: 22h 19m | Avg: 49m 37s | Max:  1h 09m | Hits:  65%/1454  
      🟩 17                 Pass: 100%/26  | Total: 21h 35m | Avg: 49m 49s | Max:  1h 05m | Hits:  65%/727   
      🟩 20                 Pass: 100%/23  | Total: 16h 23m | Avg: 42m 46s | Max:  1h 01m | Hits:  65%/727   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 22h 52m | Avg: 13m 11s | Max: 1h 25m | Hits: 59%/11375

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 22h 03m | Avg: 13m 47s | Max:  1h 25m | Hits:  59%/11375 
      🟩 arm64              Pass: 100%/8   | Total: 48m 47s | Avg:  6m 05s | Max: 22m 17s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 26m | Avg: 13m 44s | Max: 37m 59s | Hits:  46%/2646  
      🟩 11.8               Pass: 100%/3   | Total: 54m 58s | Avg: 18m 19s | Max: 27m 18s
      🟩 12.6               Pass: 100%/86  | Total: 18h 31m | Avg: 12m 55s | Max:  1h 25m | Hits:  63%/8729  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 39m 16s | Avg: 19m 38s | Max: 21m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 26m | Avg: 13m 44s | Max: 37m 59s | Hits:  46%/2646  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 54m 58s | Avg: 18m 19s | Max: 27m 18s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 17h 51m | Avg: 12m 45s | Max:  1h 25m | Hits:  63%/8729  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 39m 16s | Avg: 19m 38s | Max: 21m 23s
      🟩 nvcc               Pass: 100%/102 | Total: 22h 12m | Avg: 13m 04s | Max:  1h 25m | Hits:  59%/11375 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 21m | Avg: 13m 31s | Max: 27m 20s
      🟩 Clang10            Pass: 100%/3   | Total: 58m 34s | Avg: 19m 31s | Max: 30m 53s
      🟩 Clang11            Pass: 100%/4   | Total: 36m 18s | Avg:  9m 04s | Max: 24m 22s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 53s
      🟩 Clang13            Pass: 100%/4   | Total: 16m 19s | Avg:  4m 04s | Max:  4m 21s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 10m | Avg: 17m 31s | Max: 24m 53s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 00m | Avg: 15m 00s | Max: 28m 08s
      🟩 Clang16            Pass: 100%/4   | Total: 41m 42s | Avg: 10m 25s | Max: 28m 02s
      🟩 Clang17            Pass: 100%/4   | Total: 48m 37s | Avg: 12m 09s | Max: 20m 12s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 34m | Avg: 11m 48s | Max: 22m 17s
      🟩 GCC6               Pass: 100%/2   | Total: 25m 26s | Avg: 12m 43s | Max: 22m 31s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 29m | Avg: 14m 50s | Max: 21m 13s
      🟩 GCC8               Pass: 100%/6   | Total: 59m 53s | Avg:  9m 58s | Max: 23m 56s
      🟩 GCC9               Pass: 100%/6   | Total: 38m 26s | Avg:  6m 24s | Max: 20m 38s
      🟩 GCC10              Pass: 100%/4   | Total: 36m 50s | Avg:  9m 12s | Max: 24m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 26m | Avg: 12m 21s | Max: 27m 18s
      🟩 GCC12              Pass: 100%/4   | Total: 32m 34s | Avg:  8m 08s | Max: 20m 34s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 29m | Avg: 15m 51s | Max:  1h 25m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 17m | Avg: 25m 55s | Max: 28m 52s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 37m 59s | Avg: 37m 59s | Max: 37m 59s | Hits:  46%/2646  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 18m | Avg: 39m 02s | Max: 39m 27s | Hits:  43%/5654  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 28s | Avg: 15m 28s | Max: 15m 28s | Hits:  99%/3075  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  8h 44m | Avg: 11m 39s | Max: 30m 53s
      🟩 GCC                Pass: 100%/52  | Total: 10h 38m | Avg: 12m 16s | Max:  1h 25m
      🟩 Intel              Pass: 100%/3   | Total:  1h 17m | Avg: 25m 55s | Max: 28m 52s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 11m | Avg: 32m 53s | Max: 39m 27s | Hits:  59%/11375 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 22h 52m | Avg: 13m 11s | Max:  1h 25m | Hits:  59%/11375 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 19h 24m | Avg: 12m 07s | Max: 39m 27s | Hits:  59%/11375 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 31m | Avg: 22m 46s | Max: 27m 30s
      🟩 Test               Pass: 100%/3   | Total:  1h 53m | Avg: 37m 59s | Max:  1h 25m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 54m 58s | Avg: 18m 19s | Max: 27m 18s
      🟩 90a                Pass: 100%/4   | Total: 15m 48s | Avg:  3m 57s | Max:  4m 21s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  5h 00m | Avg: 10m 44s | Max: 22m 55s
      🟩 14                 Pass: 100%/28  | Total:  6h 15m | Avg: 13m 25s | Max: 38m 38s | Hits:  46%/5393  
      🟩 17                 Pass: 100%/27  | Total:  6h 41m | Avg: 14m 51s | Max: 39m 27s | Hits:  42%/2907  
      🟩 20                 Pass: 100%/20  | Total:  4h 51m | Avg: 14m 35s | Max:  1h 25m | Hits:  99%/3075  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 2h 40m | Avg: 3m 05s | Max: 11m 40s | Hits: 88%/214

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2h 30m | Avg:  3m 07s | Max: 11m 40s | Hits:  88%/214   
      🟩 arm64              Pass: 100%/4   | Total: 10m 32s | Avg:  2m 38s | Max:  2m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total: 57m 58s | Avg:  3m 03s | Max: 10m 11s | Hits:  88%/107   
      🟩 12.6               Pass: 100%/33  | Total:  1h 42m | Avg:  3m 06s | Max: 11m 40s | Hits:  88%/107   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total: 57m 58s | Avg:  3m 03s | Max: 10m 11s | Hits:  88%/107   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  1h 42m | Avg:  3m 06s | Max: 11m 40s | Hits:  88%/107   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  2h 40m | Avg:  3m 05s | Max: 11m 40s | Hits:  88%/214   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 13s | Avg:  2m 36s | Max:  2m 37s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 11s | Avg:  2m 35s | Max:  2m 49s
      🟩 Clang11            Pass: 100%/4   | Total: 10m 17s | Avg:  2m 34s | Max:  3m 06s
      🟩 Clang12            Pass: 100%/4   | Total: 10m 39s | Avg:  2m 39s | Max:  3m 00s
      🟩 Clang13            Pass: 100%/4   | Total: 10m 00s | Avg:  2m 30s | Max:  2m 35s
      🟩 Clang14            Pass: 100%/4   | Total: 13m 11s | Avg:  3m 17s | Max:  4m 47s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 35s | Avg:  2m 47s | Max:  3m 06s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 01s | Avg:  2m 45s | Max:  2m 56s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 50s | Avg:  2m 55s | Max:  2m 55s
      🟩 Clang18            Pass: 100%/2   | Total:  6m 52s | Avg:  3m 26s | Max:  4m 15s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 48s | Avg:  2m 24s | Max:  2m 26s
      🟩 GCC10              Pass: 100%/4   | Total:  9m 42s | Avg:  2m 25s | Max:  3m 02s
      🟩 GCC11              Pass: 100%/4   | Total: 10m 14s | Avg:  2m 33s | Max:  3m 08s
      🟩 GCC12              Pass: 100%/7   | Total: 22m 17s | Avg:  3m 11s | Max:  4m 33s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 03s | Avg:  2m 41s | Max:  2m 43s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 11s | Avg: 10m 11s | Max: 10m 11s | Hits:  88%/107   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 40s | Avg: 11m 40s | Max: 11m 40s | Hits:  88%/107   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 23m | Avg:  2m 47s | Max:  4m 47s
      🟩 GCC                Pass: 100%/20  | Total: 55m 04s | Avg:  2m 45s | Max:  4m 33s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 51s | Avg: 10m 55s | Max: 11m 40s | Hits:  88%/214   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  2h 40m | Avg:  3m 05s | Max: 11m 40s | Hits:  88%/214   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 19m | Avg:  2m 57s | Max: 11m 40s | Hits:  88%/214   
      🟩 Test               Pass: 100%/5   | Total: 21m 35s | Avg:  4m 19s | Max:  4m 47s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
      🟩 90a                Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 15m | Avg:  2m 41s | Max:  4m 33s
      🟩 20                 Pass: 100%/24  | Total:  1h 25m | Avg:  3m 33s | Max: 11m 40s | Hits:  88%/214   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 17m 22s | Avg: 4m 20s | Max: 4m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 17m 22s | Avg:  4m 20s | Max:  4m 55s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 08s
      🟩 12.6               Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  4m 55s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 08s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  4m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 22s | Avg:  4m 20s | Max:  4m 55s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 55s | Avg:  4m 55s | Max:  4m 55s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 40s | Avg:  4m 40s | Max:  4m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 03s | Avg:  4m 31s | Max:  4m 55s
      🟩 GCC                Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 17m 22s | Avg:  4m 20s | Max:  4m 55s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 17m 22s | Avg:  4m 20s | Max:  4m 55s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
+/- Thrust
+/- CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 368)

# Runner
297 linux-amd64-cpu16
28 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

[BUG]: unify _CCCL_FORCEINLINE and _CCCL_ALWAYS_INLINE
2 participants