Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor cuda::ceil_div to take two different types #2376

Merged
merged 12 commits into from
Sep 19, 2024

Conversation

miscco
Copy link
Collaborator

@miscco miscco commented Sep 5, 2024

We already use a similar function in cub.

Deprecate that and replace it with cuda::ceil_div

Copy link
Contributor

@bernhardmgruber bernhardmgruber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work, thx a lot!

cub/cub/util_math.cuh Show resolved Hide resolved
libcudacxx/include/cuda/__cmath/ceil_div.h Outdated Show resolved Hide resolved
@fbusato
Copy link
Contributor

fbusato commented Sep 6, 2024

the code is pretty inefficient on GPU. Should we create another PR for that?

@bernhardmgruber
Copy link
Contributor

the code is pretty inefficient on GPU. Should we create another PR for that?

@miscco and @gonzalobg spent some time inspecting SASS while coming up with the current version. I am curious about your suggestion to improve it :) Be careful not overflow, so the classic (a + b - 1) / b does not work.

@fbusato
Copy link
Contributor

fbusato commented Sep 6, 2024

this was the solution I proposed a while ago

template<typename T>
//HOST_DEVICE_NODISCARD
constexpr T ceil_div(T value, T div) {
    //ASSERT_OR_ASSUME(is_zero_or_positive(value))
   //ASSERT_OR_ASSUME(div   > 0)
    using U     = ::cuda::std::__make_unsigned_t<T>;
    auto value1 = static_cast<U>(value);
    auto div1   = static_cast<U>(div);
    auto ret1   = ::cuda::std::is_unsigned<T>::value ? (value1 / div1) + (value1 % div1 > 0)
                                                     : (value1 + div1 - 1) / div1; // faster
    auto ret   = static_cast<T>(ret1);
    //ASSERT_OR_ASSUME(ret >= value / div)
    return ret;
}

Performance notes:

  • Signed type version is faster than unsigned type version
  • 64-bit type version is very slow on gpu arch if the divisor is not a compile-time constant
  • Optimized for compile-time divisor

@bernhardmgruber
Copy link
Contributor

I think (value1 / div1) + (value1 % div1 > 0) was rejected and (value1 / div1) + (value1 / div1) * div1 != 0) was deemed better. But I really like the optimization for signed types! Because casting to unsigned gives you enough range to do the fast version (value1 + div1 - 1) / div1. Nice!

Let's get this PR in and then please propose the improved version! Thx!

@miscco
Copy link
Collaborator Author

miscco commented Sep 9, 2024

I believe the optimization from @fbusato points to a flaw in the API, which is what to do with negative values.

Right now, we handle negative integer values by rounding up towards zero, which is actually incorrect.

The question I have is whether we want to restrict the API towards positive values

Copy link
Contributor

@bernhardmgruber bernhardmgruber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In principle, looks good. I cannot assess though whether the new version for signed integers (a + b - 1) / b is strictly better.

libcudacxx/include/cuda/__cmath/ceil_div.h Show resolved Hide resolved
libcudacxx/include/cuda/__cmath/ceil_div.h Outdated Show resolved Hide resolved
Copy link
Contributor

github-actions bot commented Sep 9, 2024

🟨 CI finished in 4h 34m: Pass: 96%/417 | Total: 8d 10h | Avg: 29m 10s | Max: 1h 17m | Hits: 32%/38811
  • 🟨 cub: Pass: 93%/132 | Total: 3d 23h | Avg: 43m 11s | Max: 1h 11m | Hits: 2%/4296

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/124 | Total:  3d 15h | Avg: 42m 29s | Max:  1h 11m | Hits:   2%/4296  
      🟩 arm64              Pass: 100%/8   | Total:  7h 11m | Avg: 53m 58s | Max: 58m 16s
    🔍 ctk: 12.5 🔍
      🟩 11.1               Pass: 100%/15  | Total: 11h 16m | Avg: 45m 05s | Max: 51m 49s | Hits:   2%/716   
      🟩 11.8               Pass: 100%/3   | Total:  3h 21m | Avg:  1h 07m | Max:  1h 11m
      🔍 12.5               Pass:  92%/114 | Total:  3d 08h | Avg: 42m 18s | Max:  1h 08m | Hits:   2%/3580  
    🔍 cudacxx: nvcc12.5 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 52m 47s | Avg: 26m 23s | Max: 27m 19s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 16m | Avg: 45m 05s | Max: 51m 49s | Hits:   2%/716   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 21m | Avg:  1h 07m | Max:  1h 11m
      🔍 nvcc12.5           Pass:  91%/112 | Total:  3d 07h | Avg: 42m 35s | Max:  1h 08m | Hits:   2%/3580  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 47s | Avg: 26m 23s | Max: 27m 19s
      🔍 nvcc               Pass:  93%/130 | Total:  3d 22h | Avg: 43m 26s | Max:  1h 11m | Hits:   2%/4296  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 46m | Avg: 47m 46s | Max: 52m 35s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 31m | Avg: 50m 24s | Max: 52m 00s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 47s | Max: 56m 37s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 20m | Avg: 50m 01s | Max: 51m 32s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 00s | Max: 55m 19s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 34s | Max: 54m 19s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 33m | Avg: 53m 21s | Max: 56m 18s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 44s | Max: 56m 49s
      🟨 Clang17            Pass:  84%/26  | Total: 13h 41m | Avg: 31m 36s | Max: 58m 16s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 27m | Avg: 43m 32s | Max: 43m 37s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 39m | Avg: 46m 32s | Max: 50m 09s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 47m | Avg: 47m 53s | Max: 51m 50s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 58m | Avg: 49m 45s | Max: 56m 24s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 27m | Avg: 51m 47s | Max: 53m 36s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 43m | Avg: 57m 36s | Max:  1h 11m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 30m | Avg: 52m 42s | Max: 55m 31s
      🟨 GCC13              Pass:  82%/29  | Total: 14h 31m | Avg: 30m 02s | Max: 55m 55s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 42m | Avg: 54m 17s | Max: 54m 37s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 51m 49s | Avg: 51m 49s | Max: 51m 49s | Hits:   2%/716   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 08m | Hits:   2%/1432  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 14m | Avg:  1h 04m | Max:  1h 07m | Hits:   2%/2148  
    🟨 cxx_family
      🟨 Clang              Pass:  93%/59  | Total:  1d 17h | Avg: 42m 36s | Max: 58m 16s
      🟨 GCC                Pass:  92%/64  | Total:  1d 20h | Avg: 41m 19s | Max:  1h 11m
      🟩 Intel              Pass: 100%/3   | Total:  2h 42m | Avg: 54m 17s | Max: 54m 37s
      🟩 MSVC               Pass: 100%/6   | Total:  6h 19m | Avg:  1h 03m | Max:  1h 08m | Hits:   2%/4296  
    🟨 jobs
      🟩 Build              Pass: 100%/99  | Total:  3d 11h | Avg: 50m 35s | Max:  1h 11m | Hits:   2%/4296  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 29m | Avg: 18m 44s | Max: 21m 43s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 14m | Avg: 16m 51s | Max: 19m 06s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 36m | Avg: 19m 34s | Max: 23m 38s
      🟥 SmallGMem          Pass:   0%/1   | Total: 34m 47s | Avg: 34m 47s | Max: 34m 47s
      🟥 TestGPU            Pass:   0%/8   | Total:  3h 36m | Avg: 27m 03s | Max: 33m 06s
    🟨 gpu
      🟨 v100               Pass:  93%/132 | Total:  3d 23h | Avg: 43m 11s | Max:  1h 11m | Hits:   2%/4296  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 21m | Avg:  1h 07m | Max:  1h 11m
      🟩 90a                Pass: 100%/4   | Total:  1h 29m | Avg: 22m 19s | Max: 23m 12s
    🟨 std
      🟨 11                 Pass:  94%/34  | Total:  1d 00h | Avg: 42m 48s | Max:  1h 03m
      🟨 14                 Pass:  94%/37  | Total:  1d 03h | Avg: 44m 32s | Max:  1h 11m | Hits:   2%/2148  
      🟨 17                 Pass:  91%/37  | Total:  1d 02h | Avg: 43m 38s | Max:  1h 06m | Hits:   2%/1432  
      🟨 20                 Pass:  91%/24  | Total: 16h 23m | Avg: 40m 57s | Max:  1h 07m | Hits:   2%/716   
    
  • 🟨 libcudacxx: Pass: 95%/112 | Total: 1d 16h | Avg: 21m 53s | Max: 1h 03m | Hits: 35%/14320

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/104 | Total:  1d 14h | Avg: 22m 15s | Max:  1h 03m | Hits:  35%/14320 
      🟩 arm64              Pass: 100%/8   | Total:  2h 16m | Avg: 17m 06s | Max: 20m 37s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 39m 31s | Avg: 19m 45s | Max: 21m 45s
      🔍 nvcc               Pass:  95%/110 | Total:  1d 16h | Avg: 21m 55s | Max:  1h 03m | Hits:  35%/14320 
    🟨 ctk
      🟨 11.1               Pass:  93%/15  | Total:  4h 32m | Avg: 18m 08s | Max: 29m 34s
      🟩 11.8               Pass: 100%/3   | Total: 59m 58s | Avg: 19m 59s | Max: 22m 30s
      🟨 12.5               Pass:  95%/94  | Total:  1d 11h | Avg: 22m 32s | Max:  1h 03m | Hits:  35%/14320 
    🟨 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 39m 31s | Avg: 19m 45s | Max: 21m 45s
      🟨 nvcc11.1           Pass:  93%/15  | Total:  4h 32m | Avg: 18m 08s | Max: 29m 34s
      🟩 nvcc11.8           Pass: 100%/3   | Total: 59m 58s | Avg: 19m 59s | Max: 22m 30s
      🟨 nvcc12.5           Pass:  95%/92  | Total:  1d 10h | Avg: 22m 36s | Max:  1h 03m | Hits:  35%/14320 
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 58m | Avg: 19m 40s | Max: 22m 45s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 33s | Max: 24m 51s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 22m | Avg: 20m 30s | Max: 21m 32s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 22m | Avg: 20m 39s | Max: 22m 04s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 20m | Avg: 20m 01s | Max: 20m 48s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 20m | Avg: 20m 06s | Max: 22m 04s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 21m | Avg: 20m 25s | Max: 21m 32s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 21m | Avg: 20m 15s | Max: 23m 35s
      🟩 Clang17            Pass: 100%/14  | Total:  6h 05m | Avg: 26m 05s | Max:  1h 03m
      🟩 GCC6               Pass: 100%/2   | Total: 35m 13s | Avg: 17m 36s | Max: 20m 58s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 49m | Avg: 18m 10s | Max: 21m 12s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 47m | Avg: 17m 57s | Max: 20m 52s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 50m | Avg: 18m 27s | Max: 22m 13s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 19m | Avg: 19m 53s | Max: 22m 33s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 21m | Avg: 20m 12s | Max: 22m 30s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 18m | Avg: 19m 34s | Max: 21m 57s
      🟨 GCC13              Pass:  80%/21  | Total:  8h 13m | Avg: 23m 30s | Max:  1h 02m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 03m | Avg: 21m 18s | Max: 22m 37s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 29m 34s | Avg: 29m 34s | Max: 29m 34s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 00s | Max: 36m 24s | Hits:  35%/5628  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 41m | Avg: 33m 46s | Max: 39m 01s | Hits:  35%/8692  
    🟨 cxx_family
      🟩 Clang              Pass: 100%/47  | Total: 17h 18m | Avg: 22m 06s | Max:  1h 03m
      🟨 GCC                Pass:  92%/56  | Total: 19h 15m | Avg: 20m 38s | Max:  1h 02m
      🟩 Intel              Pass: 100%/3   | Total:  1h 03m | Avg: 21m 18s | Max: 22m 37s
      🟨 MSVC               Pass:  83%/6   | Total:  3h 12m | Avg: 32m 09s | Max: 39m 01s | Hits:  35%/14320 
    🟨 jobs
      🟨 Build              Pass:  98%/99  | Total:  1d 09h | Avg: 20m 02s | Max: 39m 01s | Hits:  35%/14320 
      🟥 NVRTC              Pass:   0%/4   | Total:  1h 28m | Avg: 22m 06s | Max: 26m 12s
      🟩 Test               Pass: 100%/8   | Total:  6h 17m | Avg: 47m 11s | Max:  1h 03m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟨 gpu
      🟨 v100               Pass:  95%/112 | Total:  1d 16h | Avg: 21m 53s | Max:  1h 03m | Hits:  35%/14320 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 59m 58s | Avg: 19m 59s | Max: 22m 30s
      🟩 90a                Pass: 100%/4   | Total: 53m 20s | Avg: 13m 20s | Max: 15m 53s
    🟨 std
      🟨 11                 Pass:  96%/29  | Total:  9h 51m | Avg: 20m 23s | Max: 38m 48s
      🟨 14                 Pass:  93%/32  | Total: 10h 42m | Avg: 20m 05s | Max: 46m 04s | Hits:  36%/5468  
      🟨 17                 Pass:  96%/31  | Total: 11h 52m | Avg: 22m 59s | Max: 56m 41s | Hits:  35%/5788  
      🟨 20                 Pass:  94%/19  | Total:  8h 22m | Avg: 26m 25s | Max:  1h 03m | Hits:  33%/3064  
    
  • 🟥 pycuda: Pass: 0%/1 | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 ctk
      🟥 12.5               Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 cudacxx
      🟥 nvcc12.5           Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 gpu
      🟥 v100               Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
    
  • 🟩 thrust: Pass: 100%/118 | Total: 2d 16h | Avg: 32m 33s | Max: 1h 17m | Hits: 36%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  2d 11h | Avg: 32m 31s | Max:  1h 17m | Hits:  36%/20079 
      🟩 arm64              Pass: 100%/8   | Total:  4h 24m | Avg: 33m 02s | Max: 38m 56s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 06m | Avg: 32m 26s | Max:  1h 03m | Hits:   4%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  2h 05m | Avg: 41m 42s | Max: 47m 21s
      🟩 12.5               Pass: 100%/100 | Total:  2d 05h | Avg: 32m 17s | Max:  1h 17m | Hits:  40%/17848 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  1h 08m | Avg: 34m 14s | Max: 34m 34s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 06m | Avg: 32m 26s | Max:  1h 03m | Hits:   4%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 05m | Avg: 41m 42s | Max: 47m 21s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  2d 04h | Avg: 32m 15s | Max:  1h 17m | Hits:  40%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 14s | Max: 34m 34s
      🟩 nvcc               Pass: 100%/116 | Total:  2d 14h | Avg: 32m 31s | Max:  1h 17m | Hits:  36%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 11m | Avg: 31m 50s | Max: 36m 16s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 46m | Avg: 35m 25s | Max: 40m 05s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 02s | Max: 40m 14s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 33s | Max: 36m 39s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 10s | Max: 37m 10s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 25m | Avg: 36m 15s | Max: 39m 30s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 43s | Max: 40m 50s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 37s | Max: 38m 37s
      🟩 Clang17            Pass: 100%/18  | Total:  7h 04m | Avg: 23m 35s | Max: 38m 30s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 10s | Max: 34m 26s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 15m | Avg: 32m 34s | Max: 41m 04s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 12m | Avg: 32m 04s | Max: 35m 23s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 23m | Avg: 33m 50s | Max: 39m 51s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 23m | Avg: 35m 50s | Max: 43m 04s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 17m | Avg: 36m 44s | Max: 47m 21s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 38m | Avg: 39m 39s | Max: 43m 56s
      🟩 GCC13              Pass: 100%/20  | Total:  7h 37m | Avg: 22m 51s | Max: 38m 56s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 14m | Avg: 44m 58s | Max: 47m 44s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:   4%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 11m | Hits:   4%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  4h 43m | Avg: 47m 16s | Max:  1h 17m | Hits:  52%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  1d 01h | Avg: 30m 25s | Max: 40m 50s
      🟩 GCC                Pass: 100%/55  | Total:  1d 03h | Avg: 30m 21s | Max: 47m 21s
      🟩 Intel              Pass: 100%/3   | Total:  2h 14m | Avg: 44m 58s | Max: 47m 44s
      🟩 MSVC               Pass: 100%/9   | Total:  8h 05m | Avg: 53m 56s | Max:  1h 17m | Hits:  36%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  2d 16h | Avg: 32m 33s | Max:  1h 17m | Hits:  36%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 11h | Avg: 36m 17s | Max:  1h 17m | Hits:   4%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 58m | Avg: 10m 46s | Max: 21m 31s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 10m | Avg: 16m 20s | Max: 23m 39s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 05m | Avg: 41m 42s | Max: 47m 21s
      🟩 90a                Pass: 100%/4   | Total:  1h 31m | Avg: 22m 52s | Max: 26m 13s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 13h 21m | Avg: 26m 42s | Max: 41m 05s
      🟩 14                 Pass: 100%/34  | Total: 19h 38m | Avg: 34m 39s | Max:  1h 11m | Hits:  28%/8924  
      🟩 17                 Pass: 100%/33  | Total: 19h 20m | Avg: 35m 10s | Max:  1h 17m | Hits:  36%/6693  
      🟩 20                 Pass: 100%/21  | Total: 11h 40m | Avg: 33m 22s | Max:  1h 15m | Hits:  52%/4462  
    
  • 🟩 cudax: Pass: 100%/54 | Total: 2h 37m | Avg: 2m 55s | Max: 9m 08s | Hits: 20%/116

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  2h 29m | Avg:  2m 59s | Max:  9m 08s | Hits:  20%/116   
      🟩 arm64              Pass: 100%/4   | Total:  8m 32s | Avg:  2m 08s | Max:  2m 19s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 07m | Avg:  2m 56s | Max:  7m 06s | Hits:  20%/58    
      🟩 12.5               Pass: 100%/31  | Total:  1h 30m | Avg:  2m 54s | Max:  9m 08s | Hits:  20%/58    
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 07m | Avg:  2m 56s | Max:  7m 06s | Hits:  20%/58    
      🟩 nvcc12.5           Pass: 100%/31  | Total:  1h 30m | Avg:  2m 54s | Max:  9m 08s | Hits:  20%/58    
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  2h 37m | Avg:  2m 55s | Max:  9m 08s | Hits:  20%/116   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 09s | Avg:  2m 34s | Max:  2m 40s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 54s | Avg:  2m 57s | Max:  3m 17s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 58s | Avg:  2m 29s | Max:  2m 38s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 01s | Avg:  2m 45s | Max:  2m 55s
      🟩 Clang13            Pass: 100%/4   | Total: 10m 23s | Avg:  2m 35s | Max:  2m 41s
      🟩 Clang14            Pass: 100%/6   | Total: 18m 07s | Avg:  3m 01s | Max:  3m 48s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 00s | Avg:  2m 30s | Max:  2m 30s
      🟩 Clang16            Pass: 100%/6   | Total: 17m 46s | Avg:  2m 57s | Max:  4m 01s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 08s | Avg:  2m 34s | Max:  2m 34s
      🟩 GCC10              Pass: 100%/4   | Total: 10m 35s | Avg:  2m 38s | Max:  2m 47s
      🟩 GCC11              Pass: 100%/4   | Total: 10m 04s | Avg:  2m 31s | Max:  2m 48s
      🟩 GCC12              Pass: 100%/12  | Total: 32m 36s | Avg:  2m 43s | Max:  3m 42s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 06s | Avg:  7m 06s | Max:  7m 06s | Hits:  20%/58    
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 08s | Avg:  9m 08s | Max:  9m 08s | Hits:  20%/58    
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 23m | Avg:  2m 46s | Max:  4m 01s
      🟩 GCC                Pass: 100%/22  | Total: 58m 23s | Avg:  2m 39s | Max:  3m 42s
      🟩 MSVC               Pass: 100%/2   | Total: 16m 14s | Avg:  8m 07s | Max:  9m 08s | Hits:  20%/116   
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  2h 37m | Avg:  2m 55s | Max:  9m 08s | Hits:  20%/116   
    🟩 jobs
      🟩 Build              Pass: 100%/46  | Total:  2h 08m | Avg:  2m 47s | Max:  9m 08s | Hits:  20%/116   
      🟩 Test               Pass: 100%/8   | Total: 29m 31s | Avg:  3m 41s | Max:  4m 01s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 90a                Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s
    🟩 std
      🟩 17                 Pass: 100%/30  | Total:  1h 20m | Avg:  2m 41s | Max:  3m 52s
      🟩 20                 Pass: 100%/24  | Total:  1h 16m | Avg:  3m 12s | Max:  9m 08s | Hits:  20%/116   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 417)

# Runner
304 linux-amd64-cpu16
62 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

@fbusato
Copy link
Contributor

fbusato commented Sep 10, 2024

@miscco please keep this function working only with non-negative numbers. Ceiling division with negative numbers is extremely rare and prohibits optimizations (fast path)

@miscco
Copy link
Collaborator Author

miscco commented Sep 10, 2024

@miscco please keep this function working only with non-negative numbers. Ceiling division with negative numbers is extremely rare and prohibits optimizations (fast path)

Yeah I somehow overlooked that we in fact do assert that

Copy link
Contributor

🟨 CI finished in 11h 35m: Pass: 99%/433 | Total: 8d 21h | Avg: 29m 38s | Max: 1h 18m | Hits: 20%/38228
  • 🟨 cub: Pass: 97%/136 | Total: 4d 04h | Avg: 44m 16s | Max: 1h 18m | Hits: 2%/3635

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/128 | Total:  3d 21h | Avg: 43m 39s | Max:  1h 18m | Hits:   2%/3635  
      🟩 arm64              Pass: 100%/8   | Total:  7h 11m | Avg: 53m 59s | Max: 58m 55s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 56s | Max: 56m 25s
      🔍 nvcc               Pass:  97%/134 | Total:  4d 02h | Avg: 44m 06s | Max:  1h 18m | Hits:   2%/3635  
    🟨 ctk
      🟨 11.1               Pass:  93%/15  | Total: 10h 56m | Avg: 43m 45s | Max: 49m 10s
      🟩 11.8               Pass: 100%/3   | Total:  3h 34m | Avg:  1h 11m | Max:  1h 13m
      🟨 12.6               Pass:  98%/118 | Total:  3d 13h | Avg: 43m 38s | Max:  1h 18m | Hits:   2%/3635  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 49m | Avg: 54m 56s | Max: 56m 25s
      🟨 nvcc11.1           Pass:  93%/15  | Total: 10h 56m | Avg: 43m 45s | Max: 49m 10s
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 34m | Avg:  1h 11m | Max:  1h 13m
      🟨 nvcc12.6           Pass:  98%/116 | Total:  3d 12h | Avg: 43m 27s | Max:  1h 18m | Hits:   2%/3635  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 02m | Avg: 50m 29s | Max: 56m 30s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 34m | Avg: 51m 29s | Max: 53m 10s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 21s | Max: 56m 27s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 25m | Avg: 51m 20s | Max: 53m 32s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 09s | Max: 53m 23s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 27m | Avg: 51m 54s | Max: 53m 38s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 32m | Avg: 53m 14s | Max: 55m 12s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 20m | Avg: 50m 08s | Max: 51m 55s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 27m | Avg: 51m 52s | Max: 56m 02s
      🟩 Clang18            Pass: 100%/26  | Total: 14h 32m | Avg: 33m 34s | Max: 58m 55s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 27m | Avg: 43m 52s | Max: 45m 33s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 59m | Avg: 49m 57s | Max: 56m 25s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 49m | Avg: 48m 13s | Max: 53m 25s
      🟩 GCC9               Pass: 100%/6   | Total:  5h 02m | Avg: 50m 20s | Max: 56m 08s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 31m | Avg: 52m 47s | Max: 55m 25s
      🟩 GCC11              Pass: 100%/7   | Total:  7h 12m | Avg:  1h 01m | Max:  1h 13m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 27m | Avg: 51m 47s | Max: 53m 14s
      🟨 GCC13              Pass:  93%/29  | Total: 14h 59m | Avg: 31m 00s | Max:  1h 18m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 54m | Avg: 58m 10s | Max:  1h 01m
      🟥 MSVC14.16          Pass:   0%/1   | Total: 10m 00s | Avg: 10m 00s | Max: 10m 00s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m | Hits:   2%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 21m | Avg:  1h 07m | Max:  1h 09m | Hits:   2%/2181  
    🟨 cxx_family
      🟩 Clang              Pass: 100%/63  | Total:  1d 22h | Avg: 44m 09s | Max: 58m 55s
      🟨 GCC                Pass:  96%/64  | Total:  1d 21h | Avg: 42m 37s | Max:  1h 18m
      🟩 Intel              Pass: 100%/3   | Total:  2h 54m | Avg: 58m 10s | Max:  1h 01m
      🟨 MSVC               Pass:  83%/6   | Total:  5h 35m | Avg: 55m 55s | Max:  1h 09m | Hits:   2%/3635  
    🟨 jobs
      🟨 Build              Pass:  99%/103 | Total:  3d 16h | Avg: 51m 38s | Max:  1h 13m | Hits:   2%/3635  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 29m | Avg: 18m 42s | Max: 25m 31s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 08m | Avg: 16m 01s | Max: 20m 33s
      🟨 HostLaunch         Pass:  87%/8   | Total:  3h 39m | Avg: 27m 22s | Max:  1h 18m
      🟥 SmallGMem          Pass:   0%/1   | Total: 13m 30s | Avg: 13m 30s | Max: 13m 30s
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 11m | Avg: 23m 58s | Max: 28m 06s
    🟨 std
      🟩 11                 Pass: 100%/35  | Total:  1d 02h | Avg: 45m 54s | Max:  1h 18m
      🟨 14                 Pass:  97%/38  | Total:  1d 03h | Avg: 43m 44s | Max:  1h 08m | Hits:   2%/1454  
      🟨 17                 Pass:  94%/38  | Total:  1d 04h | Avg: 44m 50s | Max:  1h 12m | Hits:   2%/1454  
      🟩 20                 Pass: 100%/25  | Total: 17h 27m | Avg: 41m 55s | Max:  1h 09m | Hits:   2%/727   
    🟨 gpu
      🟨 v100               Pass:  97%/136 | Total:  4d 04h | Avg: 44m 16s | Max:  1h 18m | Hits:   2%/3635  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 34m | Avg:  1h 11m | Max:  1h 13m
      🟩 90a                Pass: 100%/4   | Total:  1h 32m | Avg: 23m 12s | Max: 23m 55s
    
  • 🟨 libcudacxx: Pass: 99%/116 | Total: 1d 18h | Avg: 22m 13s | Max: 1h 18m | Hits: 3%/14320

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/108 | Total:  1d 16h | Avg: 22m 34s | Max:  1h 18m | Hits:   3%/14320 
      🟩 arm64              Pass: 100%/8   | Total:  2h 19m | Avg: 17m 24s | Max: 21m 47s
    🔍 ctk: 11.1 🔍
      🔍 11.1               Pass:  93%/15  | Total:  4h 26m | Avg: 17m 45s | Max: 30m 20s
      🟩 11.8               Pass: 100%/3   | Total: 58m 54s | Avg: 19m 38s | Max: 20m 45s
      🟩 12.6               Pass: 100%/98  | Total:  1d 13h | Avg: 22m 58s | Max:  1h 18m | Hits:   3%/14320 
    🔍 cudacxx: nvcc11.1 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 40m 08s | Avg: 20m 04s | Max: 21m 19s
      🔍 nvcc11.1           Pass:  93%/15  | Total:  4h 26m | Avg: 17m 45s | Max: 30m 20s
      🟩 nvcc11.8           Pass: 100%/3   | Total: 58m 54s | Avg: 19m 38s | Max: 20m 45s
      🟩 nvcc12.6           Pass: 100%/96  | Total:  1d 12h | Avg: 23m 02s | Max:  1h 18m | Hits:   3%/14320 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 40m 08s | Avg: 20m 04s | Max: 21m 19s
      🔍 nvcc               Pass:  99%/114 | Total:  1d 18h | Avg: 22m 15s | Max:  1h 18m | Hits:   3%/14320 
    🚨 cxx: MSVC14.16 🚨
      🟩 Clang9             Pass: 100%/6   | Total:  1h 49m | Avg: 18m 18s | Max: 21m 39s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 06m | Avg: 22m 02s | Max: 25m 46s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 19m | Avg: 19m 53s | Max: 21m 58s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 18m | Avg: 19m 31s | Max: 20m 27s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 19m | Avg: 19m 46s | Max: 20m 46s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 16m | Avg: 19m 14s | Max: 19m 54s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 20m | Avg: 20m 08s | Max: 21m 14s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 20m | Avg: 20m 04s | Max: 22m 50s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 18m | Avg: 19m 43s | Max: 21m 39s
      🟩 Clang18            Pass: 100%/14  | Total:  7h 01m | Avg: 30m 05s | Max:  1h 09m
      🟩 GCC6               Pass: 100%/2   | Total: 35m 36s | Avg: 17m 48s | Max: 21m 49s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 49m | Avg: 18m 19s | Max: 21m 20s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 46m | Avg: 17m 45s | Max: 21m 57s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 48m | Avg: 18m 02s | Max: 21m 35s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 18m | Avg: 19m 41s | Max: 22m 47s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 17m | Avg: 19m 37s | Max: 21m 15s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 19m | Avg: 19m 48s | Max: 21m 15s
      🟩 GCC13              Pass: 100%/21  | Total:  8h 25m | Avg: 24m 04s | Max:  1h 18m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 07m | Avg: 22m 30s | Max: 25m 40s
      🔥 MSVC14.16          Pass:   0%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 25s | Max: 34m 27s | Hits:   3%/5628  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 42m | Avg: 34m 14s | Max: 38m 12s | Hits:   3%/8692  
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/51  | Total: 19h 10m | Avg: 22m 33s | Max:  1h 09m
      🟩 GCC                Pass: 100%/56  | Total: 19h 21m | Avg: 20m 44s | Max:  1h 18m
      🟩 Intel              Pass: 100%/3   | Total:  1h 07m | Avg: 22m 30s | Max: 25m 40s
      🔍 MSVC               Pass:  83%/6   | Total:  3h 17m | Avg: 32m 59s | Max: 38m 12s | Hits:   3%/14320 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  99%/103 | Total:  1d 10h | Avg: 19m 51s | Max: 38m 12s | Hits:   3%/14320 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 10m | Avg: 17m 37s | Max: 19m 17s
      🟩 Test               Pass: 100%/8   | Total:  7h 40m | Avg: 57m 35s | Max:  1h 18m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 36s | Avg:  1m 36s | Max:  1m 36s
    🔍 std: 14 🔍
      🟩 11                 Pass: 100%/30  | Total: 10h 30m | Avg: 21m 01s | Max: 44m 01s
      🔍 14                 Pass:  96%/33  | Total: 10h 59m | Avg: 19m 59s | Max: 49m 21s | Hits:   3%/5468  
      🟩 17                 Pass: 100%/32  | Total: 12h 26m | Avg: 23m 19s | Max:  1h 09m | Hits:   3%/5788  
      🟩 20                 Pass: 100%/20  | Total:  8h 59m | Avg: 26m 57s | Max:  1h 18m | Hits:   3%/3064  
    🟨 gpu
      🟨 v100               Pass:  99%/116 | Total:  1d 18h | Avg: 22m 13s | Max:  1h 18m | Hits:   3%/14320 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 58m 54s | Avg: 19m 38s | Max: 20m 45s
      🟩 90a                Pass: 100%/4   | Total: 53m 45s | Avg: 13m 26s | Max: 15m 33s
    
  • 🟩 thrust: Pass: 100%/122 | Total: 2d 19h | Avg: 33m 03s | Max: 1h 14m | Hits: 36%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/114 | Total:  2d 14h | Avg: 33m 00s | Max:  1h 14m | Hits:  36%/20079 
      🟩 arm64              Pass: 100%/8   | Total:  4h 30m | Avg: 33m 49s | Max: 38m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 23m | Avg: 33m 34s | Max:  1h 04m | Hits:   4%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  2h 08m | Avg: 42m 44s | Max: 47m 04s
      🟩 12.6               Pass: 100%/104 | Total:  2d 08h | Avg: 32m 42s | Max:  1h 14m | Hits:  40%/17848 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 02m | Avg: 31m 08s | Max: 31m 59s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 23m | Avg: 33m 34s | Max:  1h 04m | Hits:   4%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 08m | Avg: 42m 44s | Max: 47m 04s
      🟩 nvcc12.6           Pass: 100%/102 | Total:  2d 07h | Avg: 32m 44s | Max:  1h 14m | Hits:  40%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 08s | Max: 31m 59s
      🟩 nvcc               Pass: 100%/120 | Total:  2d 18h | Avg: 33m 05s | Max:  1h 14m | Hits:  36%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 25m | Avg: 34m 13s | Max: 41m 28s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 45m | Avg: 35m 09s | Max: 37m 54s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 28s | Max: 37m 03s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 15m | Avg: 33m 59s | Max: 36m 54s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 23m | Avg: 35m 45s | Max: 41m 14s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 34m | Avg: 38m 30s | Max: 41m 49s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 30s | Max: 38m 37s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 21m | Avg: 35m 25s | Max: 40m 20s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 11s | Max: 37m 36s
      🟩 Clang18            Pass: 100%/18  | Total:  7h 09m | Avg: 23m 50s | Max: 38m 35s
      🟩 GCC6               Pass: 100%/2   | Total: 59m 43s | Avg: 29m 51s | Max: 33m 46s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 08m | Avg: 31m 29s | Max: 36m 58s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 32m | Avg: 35m 25s | Max: 42m 39s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 19m | Avg: 33m 19s | Max: 40m 02s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 23m | Avg: 35m 53s | Max: 42m 00s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 21m | Avg: 37m 21s | Max: 47m 04s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 32m | Avg: 38m 05s | Max: 44m 46s
      🟩 GCC13              Pass: 100%/20  | Total:  7h 50m | Avg: 23m 32s | Max: 38m 39s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 16m | Avg: 45m 39s | Max: 50m 17s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:   4%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 14m | Hits:   4%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  4h 35m | Avg: 45m 52s | Max:  1h 13m | Hits:  52%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  1d 04h | Avg: 31m 24s | Max: 41m 49s
      🟩 GCC                Pass: 100%/55  | Total:  1d 04h | Avg: 30m 42s | Max: 47m 04s
      🟩 Intel              Pass: 100%/3   | Total:  2h 16m | Avg: 45m 39s | Max: 50m 17s
      🟩 MSVC               Pass: 100%/9   | Total:  8h 00m | Avg: 53m 23s | Max:  1h 14m | Hits:  36%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/122 | Total:  2d 19h | Avg: 33m 03s | Max:  1h 14m | Hits:  36%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  2d 15h | Avg: 36m 46s | Max:  1h 14m | Hits:   4%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 57m | Avg: 10m 42s | Max: 22m 22s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 08m | Avg: 16m 07s | Max: 18m 51s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 08m | Avg: 42m 44s | Max: 47m 04s
      🟩 90a                Pass: 100%/4   | Total:  1h 41m | Avg: 25m 22s | Max: 28m 09s
    🟩 std
      🟩 11                 Pass: 100%/31  | Total: 14h 05m | Avg: 27m 15s | Max: 40m 21s
      🟩 14                 Pass: 100%/35  | Total: 20h 22m | Avg: 34m 54s | Max:  1h 13m | Hits:  28%/8924  
      🟩 17                 Pass: 100%/34  | Total: 20h 26m | Avg: 36m 03s | Max:  1h 14m | Hits:  36%/6693  
      🟩 20                 Pass: 100%/22  | Total: 12h 20m | Avg: 33m 40s | Max:  1h 12m | Hits:  52%/4462  
    
  • 🟩 cudax: Pass: 100%/58 | Total: 3h 06m | Avg: 3m 13s | Max: 8m 37s | Hits: 14%/194

    🟩 cpu
      🟩 amd64              Pass: 100%/54  | Total:  2h 54m | Avg:  3m 13s | Max:  8m 37s | Hits:  14%/194   
      🟩 arm64              Pass: 100%/4   | Total: 12m 05s | Avg:  3m 01s | Max:  5m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 10m | Avg:  3m 03s | Max:  7m 55s | Hits:  24%/97    
      🟩 12.6               Pass: 100%/35  | Total:  1h 56m | Avg:  3m 19s | Max:  8m 37s | Hits:   5%/97    
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 10m | Avg:  3m 03s | Max:  7m 55s | Hits:  24%/97    
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1h 56m | Avg:  3m 19s | Max:  8m 37s | Hits:   5%/97    
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/58  | Total:  3h 06m | Avg:  3m 13s | Max:  8m 37s | Hits:  14%/194   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 37s | Avg:  2m 48s | Max:  3m 04s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 42s | Avg:  2m 51s | Max:  3m 06s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 25s | Avg:  2m 51s | Max:  3m 01s
      🟩 Clang12            Pass: 100%/4   | Total: 10m 43s | Avg:  2m 40s | Max:  3m 02s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 00s | Avg:  3m 00s | Max:  3m 37s
      🟩 Clang14            Pass: 100%/6   | Total: 19m 07s | Avg:  3m 11s | Max:  3m 48s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 08s | Avg:  3m 34s | Max:  3m 35s
      🟩 Clang16            Pass: 100%/4   | Total: 14m 35s | Avg:  3m 38s | Max:  5m 00s
      🟩 Clang17            Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 49s
      🟩 Clang18            Pass: 100%/4   | Total: 13m 25s | Avg:  3m 21s | Max:  4m 00s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 56s | Avg:  2m 58s | Max:  3m 20s
      🟩 GCC10              Pass: 100%/4   | Total: 10m 27s | Avg:  2m 36s | Max:  2m 45s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 15s | Avg:  2m 48s | Max:  3m 17s
      🟩 GCC12              Pass: 100%/9   | Total: 27m 43s | Avg:  3m 04s | Max:  3m 38s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 34s | Avg:  2m 31s | Max:  2m 55s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 55s | Avg:  7m 55s | Max:  7m 55s | Hits:  24%/97    
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 37s | Avg:  8m 37s | Max:  8m 37s | Hits:   5%/97    
    🟩 cxx_family
      🟩 Clang              Pass: 100%/34  | Total:  1h 47m | Avg:  3m 09s | Max:  5m 00s
      🟩 GCC                Pass: 100%/22  | Total:  1h 02m | Avg:  2m 51s | Max:  3m 38s
      🟩 MSVC               Pass: 100%/2   | Total: 16m 32s | Avg:  8m 16s | Max:  8m 37s | Hits:  14%/194   
    🟩 gpu
      🟩 v100               Pass: 100%/58  | Total:  3h 06m | Avg:  3m 13s | Max:  8m 37s | Hits:  14%/194   
    🟩 jobs
      🟩 Build              Pass: 100%/50  | Total:  2h 36m | Avg:  3m 08s | Max:  8m 37s | Hits:  14%/194   
      🟩 Test               Pass: 100%/8   | Total: 29m 45s | Avg:  3m 43s | Max:  4m 00s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 23s | Avg:  2m 23s | Max:  2m 23s
      🟩 90a                Pass: 100%/1   | Total:  2m 55s | Avg:  2m 55s | Max:  2m 55s
    🟩 std
      🟩 17                 Pass: 100%/32  | Total:  1h 35m | Avg:  2m 58s | Max:  3m 59s
      🟩 20                 Pass: 100%/26  | Total:  1h 31m | Avg:  3m 31s | Max:  8m 37s | Hits:  14%/194   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 433)

# Runner
320 linux-amd64-cpu16
62 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 7h 30m: Pass: 97%/433 | Total: 12d 19h | Avg: 42m 33s | Max: 1h 52m | Hits: 31%/41653
  • 🟨 cub: Pass: 93%/136 | Total: 7d 03h | Avg: 1h 15m | Max: 1h 52m | Hits: 2%/4362

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/128 | Total:  6d 15h | Avg:  1h 14m | Max:  1h 52m | Hits:   2%/4362  
      🟩 arm64              Pass: 100%/8   | Total: 12h 12m | Avg:  1h 31m | Max:  1h 37m
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total: 17h 48m | Avg:  1h 11m | Max:  1h 18m | Hits:   2%/727   
      🟩 11.8               Pass: 100%/3   | Total:  5h 20m | Avg:  1h 46m | Max:  1h 52m
      🔍 12.6               Pass:  92%/118 | Total:  6d 04h | Avg:  1h 15m | Max:  1h 37m | Hits:   2%/3635  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 37m | Avg:  1h 18m | Max:  1h 19m
      🟩 nvcc11.1           Pass: 100%/15  | Total: 17h 48m | Avg:  1h 11m | Max:  1h 18m | Hits:   2%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  5h 20m | Avg:  1h 46m | Max:  1h 52m
      🔍 nvcc12.6           Pass:  92%/116 | Total:  6d 01h | Avg:  1h 15m | Max:  1h 37m | Hits:   2%/3635  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 37m | Avg:  1h 18m | Max:  1h 19m
      🔍 nvcc               Pass:  93%/134 | Total:  7d 00h | Avg:  1h 15m | Max:  1h 52m | Hits:   2%/4362  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  6h 41m | Avg:  1h 06m | Max:  1h 27m
      🟩 Clang10            Pass: 100%/3   | Total:  2h 46m | Avg: 55m 39s | Max: 58m 07s
      🟩 Clang11            Pass: 100%/4   | Total:  5h 44m | Avg:  1h 26m | Max:  1h 30m
      🟩 Clang12            Pass: 100%/4   | Total:  5h 30m | Avg:  1h 22m | Max:  1h 25m
      🟩 Clang13            Pass: 100%/4   | Total:  5h 37m | Avg:  1h 24m | Max:  1h 26m
      🟩 Clang14            Pass: 100%/4   | Total:  5h 29m | Avg:  1h 22m | Max:  1h 23m
      🟩 Clang15            Pass: 100%/4   | Total:  5h 37m | Avg:  1h 24m | Max:  1h 30m
      🟩 Clang16            Pass: 100%/4   | Total:  5h 56m | Avg:  1h 29m | Max:  1h 35m
      🟩 Clang17            Pass: 100%/4   | Total:  5h 18m | Avg:  1h 19m | Max:  1h 31m
      🟨 Clang18            Pass:  84%/26  | Total:  1d 12h | Avg:  1h 24m | Max:  1h 37m
      🟩 GCC6               Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 16m
      🟩 GCC7               Pass: 100%/6   | Total:  7h 48m | Avg:  1h 18m | Max:  1h 27m
      🟩 GCC8               Pass: 100%/6   | Total:  7h 46m | Avg:  1h 17m | Max:  1h 25m
      🟩 GCC9               Pass: 100%/6   | Total:  7h 04m | Avg:  1h 10m | Max:  1h 27m
      🟩 GCC10              Pass: 100%/4   | Total:  5h 48m | Avg:  1h 27m | Max:  1h 31m
      🟩 GCC11              Pass: 100%/7   | Total:  9h 28m | Avg:  1h 21m | Max:  1h 52m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 44m | Avg: 56m 11s | Max: 58m 17s
      🟨 GCC13              Pass:  82%/29  | Total:  1d 07h | Avg:  1h 05m | Max:  1h 35m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  3h 30m | Avg:  1h 10m | Max:  1h 31m
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits:   2%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 13m | Hits:   2%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 45m | Avg:  1h 15m | Max:  1h 18m | Hits:   2%/2181  
    🟨 cxx_family
      🟨 Clang              Pass:  93%/63  | Total:  3d 13h | Avg:  1h 21m | Max:  1h 37m
      🟨 GCC                Pass:  92%/64  | Total:  3d 03h | Avg:  1h 10m | Max:  1h 52m
      🟩 Intel              Pass: 100%/3   | Total:  3h 30m | Avg:  1h 10m | Max:  1h 31m
      🟩 MSVC               Pass: 100%/6   | Total:  7h 10m | Avg:  1h 11m | Max:  1h 18m | Hits:   2%/4362  
    🟨 jobs
      🟩 Build              Pass: 100%/103 | Total:  5d 10h | Avg:  1h 16m | Max:  1h 52m | Hits:   2%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  9h 59m | Avg:  1h 14m | Max:  1h 27m
      🟩 GraphCapture       Pass: 100%/8   | Total:  9h 34m | Avg:  1h 11m | Max:  1h 22m
      🟩 HostLaunch         Pass: 100%/8   | Total:  9h 49m | Avg:  1h 13m | Max:  1h 24m
      🟥 SmallGMem          Pass:   0%/1   | Total: 30m 14s | Avg: 30m 14s | Max: 30m 14s
      🟥 TestGPU            Pass:   0%/8   | Total: 10h 43m | Avg:  1h 20m | Max:  1h 31m
    🟨 gpu
      🟨 v100               Pass:  93%/136 | Total:  7d 03h | Avg:  1h 15m | Max:  1h 52m | Hits:   2%/4362  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  5h 20m | Avg:  1h 46m | Max:  1h 52m
      🟩 90a                Pass: 100%/4   | Total:  1h 50m | Avg: 27m 36s | Max: 29m 31s
    🟨 std
      🟨 11                 Pass:  94%/35  | Total:  1d 21h | Avg:  1h 18m | Max:  1h 44m
      🟨 14                 Pass:  94%/38  | Total:  2d 00h | Avg:  1h 16m | Max:  1h 43m | Hits:   2%/2181  
      🟨 17                 Pass:  92%/38  | Total:  1d 20h | Avg:  1h 09m | Max:  1h 52m | Hits:   2%/1454  
      🟨 20                 Pass:  92%/25  | Total:  1d 09h | Avg:  1h 19m | Max:  1h 35m | Hits:   2%/727   
    
  • 🟨 libcudacxx: Pass: 99%/116 | Total: 2d 08h | Avg: 29m 04s | Max: 1h 24m | Hits: 34%/17017

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/108 | Total:  2d 04h | Avg: 29m 16s | Max:  1h 24m | Hits:  34%/17017 
      🟩 arm64              Pass: 100%/8   | Total:  3h 32m | Avg: 26m 33s | Max: 30m 33s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  6h 07m | Avg: 24m 30s | Max: 37m 58s | Hits:  36%/2644  
      🟩 11.8               Pass: 100%/3   | Total:  1h 26m | Avg: 28m 45s | Max: 33m 28s
      🔍 12.6               Pass:  98%/98  | Total:  2d 00h | Avg: 29m 47s | Max:  1h 24m | Hits:  34%/14373 
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 47m 57s | Avg: 23m 58s | Max: 24m 51s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 07m | Avg: 24m 30s | Max: 37m 58s | Hits:  36%/2644  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 26m | Avg: 28m 45s | Max: 33m 28s
      🔍 nvcc12.6           Pass:  98%/96  | Total:  1d 23h | Avg: 29m 54s | Max:  1h 24m | Hits:  34%/14373 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 47m 57s | Avg: 23m 58s | Max: 24m 51s
      🔍 nvcc               Pass:  99%/114 | Total:  2d 07h | Avg: 29m 10s | Max:  1h 24m | Hits:  34%/17017 
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  2h 37m | Avg: 26m 11s | Max: 36m 35s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 26m | Avg: 28m 44s | Max: 36m 11s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 10s | Max: 31m 20s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 36s | Max: 29m 40s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 51m | Avg: 27m 47s | Max: 32m 05s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 35s | Max: 28m 39s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 51m | Avg: 27m 48s | Max: 33m 18s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 44m | Avg: 26m 09s | Max: 30m 33s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 00s | Max: 31m 04s
      🟩 Clang18            Pass: 100%/14  | Total:  7h 13m | Avg: 30m 56s | Max:  1h 15m
      🟩 GCC6               Pass: 100%/2   | Total: 45m 13s | Avg: 22m 36s | Max: 24m 51s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 26m | Avg: 24m 25s | Max: 28m 10s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 20m | Avg: 23m 25s | Max: 27m 07s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 32m | Avg: 25m 27s | Max: 28m 20s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 52m | Avg: 28m 07s | Max: 32m 51s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 19m | Avg: 28m 32s | Max: 34m 03s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 55m | Avg: 28m 46s | Max: 31m 39s
      🔍 GCC13              Pass:  95%/21  | Total: 11h 13m | Avg: 32m 04s | Max:  1h 24m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 37m | Avg: 32m 26s | Max: 39m 10s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 37m 58s | Avg: 37m 58s | Max: 37m 58s | Hits:  36%/2644  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 19m | Avg: 39m 39s | Max: 43m 57s | Hits:  34%/5650  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 19m | Avg: 46m 38s | Max: 52m 56s | Hits:  33%/8723  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/51  | Total: 23h 53m | Avg: 28m 05s | Max:  1h 15m
      🔍 GCC                Pass:  98%/56  | Total:  1d 02h | Avg: 28m 19s | Max:  1h 24m
      🟩 Intel              Pass: 100%/3   | Total:  1h 37m | Avg: 32m 26s | Max: 39m 10s
      🟩 MSVC               Pass: 100%/6   | Total:  4h 17m | Avg: 42m 51s | Max: 52m 56s | Hits:  34%/17017 
    🔍 jobs: Test 🔍
      🟩 Build              Pass: 100%/103 | Total:  1d 23h | Avg: 27m 28s | Max: 52m 56s | Hits:  34%/17017 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 35m | Avg: 23m 56s | Max: 31m 47s
      🔍 Test               Pass:  87%/8   | Total:  7h 26m | Avg: 55m 48s | Max:  1h 24m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 50s | Avg:  1m 50s | Max:  1m 50s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/30  | Total: 12h 07m | Avg: 24m 15s | Max: 49m 49s
      🟩 14                 Pass: 100%/33  | Total: 15h 08m | Avg: 27m 31s | Max:  1h 03m | Hits:  36%/8134  
      🟩 17                 Pass: 100%/32  | Total: 17h 51m | Avg: 33m 28s | Max:  1h 19m | Hits:  33%/5810  
      🔍 20                 Pass:  95%/20  | Total: 11h 04m | Avg: 33m 12s | Max:  1h 24m | Hits:  31%/3073  
    🟨 gpu
      🟨 v100               Pass:  99%/116 | Total:  2d 08h | Avg: 29m 04s | Max:  1h 24m | Hits:  34%/17017 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 26m | Avg: 28m 45s | Max: 33m 28s
      🟩 90a                Pass: 100%/4   | Total:  1h 21m | Avg: 20m 29s | Max: 23m 48s
    
  • 🟩 thrust: Pass: 100%/122 | Total: 3d 01h | Avg: 36m 07s | Max: 1h 33m | Hits: 36%/20070

    🟩 cpu
      🟩 amd64              Pass: 100%/114 | Total:  2d 20h | Avg: 35m 57s | Max:  1h 33m | Hits:  36%/20070 
      🟩 arm64              Pass: 100%/8   | Total:  5h 09m | Avg: 38m 41s | Max: 45m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 09m | Avg: 36m 36s | Max:  1h 07m | Hits:   4%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  2h 27m | Avg: 49m 08s | Max: 50m 28s
      🟩 12.6               Pass: 100%/104 | Total:  2d 13h | Avg: 35m 41s | Max:  1h 33m | Hits:  40%/17840 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 01m | Avg: 30m 55s | Max: 31m 36s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 09m | Avg: 36m 36s | Max:  1h 07m | Hits:   4%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 27m | Avg: 49m 08s | Max: 50m 28s
      🟩 nvcc12.6           Pass: 100%/102 | Total:  2d 12h | Avg: 35m 46s | Max:  1h 33m | Hits:  40%/17840 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 55s | Max: 31m 36s
      🟩 nvcc               Pass: 100%/120 | Total:  3d 00h | Avg: 36m 13s | Max:  1h 33m | Hits:  36%/20070 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 31m | Avg: 35m 13s | Max: 39m 48s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 00m | Avg: 40m 15s | Max: 42m 56s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 31m | Avg: 37m 49s | Max: 40m 22s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 29m | Avg: 37m 15s | Max: 41m 59s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 28m | Avg: 37m 06s | Max: 41m 09s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 33m | Avg: 38m 25s | Max: 41m 43s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 41m | Avg: 40m 17s | Max: 44m 34s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 33m | Avg: 38m 17s | Max: 42m 02s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 35m | Avg: 38m 52s | Max: 43m 14s
      🟩 Clang18            Pass: 100%/18  | Total:  7h 36m | Avg: 25m 20s | Max: 42m 59s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 28s | Max: 36m 49s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 39m | Avg: 36m 31s | Max: 42m 25s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 42m | Avg: 37m 08s | Max: 41m 35s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 44m | Avg: 37m 25s | Max: 44m 54s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 40m | Avg: 40m 12s | Max: 44m 11s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 00m | Avg: 42m 54s | Max: 50m 28s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 45m | Avg: 41m 22s | Max: 46m 14s
      🟩 GCC13              Pass: 100%/20  | Total:  8h 22m | Avg: 25m 06s | Max: 45m 49s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 28m | Avg: 49m 28s | Max: 54m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:   4%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 14m | Hits:   4%/4460  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  5h 22m | Avg: 53m 43s | Max:  1h 33m | Hits:  52%/13380 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  1d 07h | Avg: 33m 49s | Max: 44m 34s
      🟩 GCC                Pass: 100%/55  | Total:  1d 07h | Avg: 33m 51s | Max: 50m 28s
      🟩 Intel              Pass: 100%/3   | Total:  2h 28m | Avg: 49m 28s | Max: 54m 12s
      🟩 MSVC               Pass: 100%/9   | Total:  8h 56m | Avg: 59m 37s | Max:  1h 33m | Hits:  36%/20070 
    🟩 gpu
      🟩 v100               Pass: 100%/122 | Total:  3d 01h | Avg: 36m 07s | Max:  1h 33m | Hits:  36%/20070 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  2d 21h | Avg: 40m 12s | Max:  1h 33m | Hits:   4%/13380 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 09m | Avg: 11m 45s | Max: 24m 46s | Hits:  99%/6690  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 17m | Avg: 17m 09s | Max: 23m 10s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 27m | Avg: 49m 08s | Max: 50m 28s
      🟩 90a                Pass: 100%/4   | Total:  1h 37m | Avg: 24m 19s | Max: 26m 26s
    🟩 std
      🟩 11                 Pass: 100%/31  | Total: 15h 41m | Avg: 30m 21s | Max: 47m 06s
      🟩 14                 Pass: 100%/35  | Total: 22h 29m | Avg: 38m 33s | Max:  1h 22m | Hits:  28%/8920  
      🟩 17                 Pass: 100%/34  | Total: 21h 54m | Avg: 38m 40s | Max:  1h 13m | Hits:  36%/6690  
      🟩 20                 Pass: 100%/22  | Total: 13h 22m | Avg: 36m 29s | Max:  1h 33m | Hits:  52%/4460  
    
  • 🟩 cudax: Pass: 100%/58 | Total: 5h 34m | Avg: 5m 45s | Max: 14m 46s | Hits: 22%/204

    🟩 cpu
      🟩 amd64              Pass: 100%/54  | Total:  5h 10m | Avg:  5m 44s | Max: 14m 46s | Hits:  22%/204   
      🟩 arm64              Pass: 100%/4   | Total: 23m 54s | Avg:  5m 58s | Max:  6m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  2h 09m | Avg:  5m 37s | Max: 14m 46s | Hits:  22%/102   
      🟩 12.6               Pass: 100%/35  | Total:  3h 25m | Avg:  5m 51s | Max: 14m 18s | Hits:  22%/102   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  2h 09m | Avg:  5m 37s | Max: 14m 46s | Hits:  22%/102   
      🟩 nvcc12.6           Pass: 100%/35  | Total:  3h 25m | Avg:  5m 51s | Max: 14m 18s | Hits:  22%/102   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/58  | Total:  5h 34m | Avg:  5m 45s | Max: 14m 46s | Hits:  22%/204   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total: 13m 19s | Avg:  6m 39s | Max:  6m 46s
      🟩 Clang10            Pass: 100%/2   | Total: 14m 17s | Avg:  7m 08s | Max:  7m 26s
      🟩 Clang11            Pass: 100%/4   | Total: 26m 17s | Avg:  6m 34s | Max:  7m 12s
      🟩 Clang12            Pass: 100%/4   | Total: 25m 08s | Avg:  6m 17s | Max:  7m 14s
      🟩 Clang13            Pass: 100%/4   | Total: 23m 30s | Avg:  5m 52s | Max:  7m 02s
      🟩 Clang14            Pass: 100%/6   | Total: 24m 52s | Avg:  4m 08s | Max:  6m 23s
      🟩 Clang15            Pass: 100%/2   | Total: 12m 08s | Avg:  6m 04s | Max:  7m 03s
      🟩 Clang16            Pass: 100%/4   | Total: 25m 13s | Avg:  6m 18s | Max:  6m 49s
      🟩 Clang17            Pass: 100%/2   | Total: 12m 44s | Avg:  6m 22s | Max:  6m 36s
      🟩 Clang18            Pass: 100%/4   | Total: 14m 35s | Avg:  3m 38s | Max:  4m 24s
      🟩 GCC9               Pass: 100%/2   | Total: 12m 11s | Avg:  6m 05s | Max:  6m 27s
      🟩 GCC10              Pass: 100%/4   | Total: 24m 47s | Avg:  6m 11s | Max:  7m 41s
      🟩 GCC11              Pass: 100%/4   | Total: 23m 29s | Avg:  5m 52s | Max:  6m 15s
      🟩 GCC12              Pass: 100%/9   | Total: 35m 26s | Avg:  3m 56s | Max:  6m 36s
      🟩 GCC13              Pass: 100%/3   | Total: 17m 13s | Avg:  5m 44s | Max:  6m 03s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 14m 46s | Avg: 14m 46s | Max: 14m 46s | Hits:  22%/102   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s | Hits:  22%/102   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/34  | Total:  3h 12m | Avg:  5m 38s | Max:  7m 26s
      🟩 GCC                Pass: 100%/22  | Total:  1h 53m | Avg:  5m 08s | Max:  7m 41s
      🟩 MSVC               Pass: 100%/2   | Total: 29m 04s | Avg: 14m 32s | Max: 14m 46s | Hits:  22%/204   
    🟩 gpu
      🟩 v100               Pass: 100%/58  | Total:  5h 34m | Avg:  5m 45s | Max: 14m 46s | Hits:  22%/204   
    🟩 jobs
      🟩 Build              Pass: 100%/50  | Total:  5h 02m | Avg:  6m 02s | Max: 14m 46s | Hits:  22%/204   
      🟩 Test               Pass: 100%/8   | Total: 31m 59s | Avg:  3m 59s | Max:  4m 24s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  6m 36s | Avg:  6m 36s | Max:  6m 36s
      🟩 90a                Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
    🟩 std
      🟩 17                 Pass: 100%/32  | Total:  2h 57m | Avg:  5m 32s | Max:  7m 26s
      🟩 20                 Pass: 100%/26  | Total:  2h 36m | Avg:  6m 01s | Max: 14m 46s | Hits:  22%/204   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 21s | Avg: 15m 21s | Max: 15m 21s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 433)

# Runner
320 linux-amd64-cpu16
62 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 1d 09h: Pass: 97%/433 | Total: 8d 17h | Avg: 29m 03s | Max: 1h 42m | Hits: 52%/41653
  • 🟨 cub: Pass: 92%/136 | Total: 5d 07h | Avg: 56m 26s | Max: 1h 42m | Hits: 40%/4362

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/128 | Total:  4d 22h | Avg: 55m 32s | Max:  1h 42m | Hits:  40%/4362  
      🟩 arm64              Pass: 100%/8   | Total:  9h 25m | Avg:  1h 10m | Max:  1h 29m
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total: 11h 43m | Avg: 46m 53s | Max:  1h 08m | Hits:  40%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 43m | Avg: 54m 22s | Max: 57m 22s
      🔍 12.6               Pass:  91%/118 | Total:  4d 17h | Avg: 57m 42s | Max:  1h 42m | Hits:  40%/3635  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 58m | Avg: 59m 29s | Max:  1h 00m
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 43m | Avg: 46m 53s | Max:  1h 08m | Hits:  40%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 43m | Avg: 54m 22s | Max: 57m 22s
      🔍 nvcc12.6           Pass:  91%/116 | Total:  4d 15h | Avg: 57m 40s | Max:  1h 42m | Hits:  40%/3635  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 58m | Avg: 59m 29s | Max:  1h 00m
      🔍 nvcc               Pass:  92%/134 | Total:  5d 05h | Avg: 56m 23s | Max:  1h 42m | Hits:  40%/4362  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  7h 26m | Avg:  1h 14m | Max:  1h 27m
      🟩 Clang10            Pass: 100%/3   | Total:  4h 03m | Avg:  1h 21m | Max:  1h 27m
      🟩 Clang11            Pass: 100%/4   | Total:  5h 22m | Avg:  1h 20m | Max:  1h 26m
      🟩 Clang12            Pass: 100%/4   | Total:  5h 18m | Avg:  1h 19m | Max:  1h 25m
      🟩 Clang13            Pass: 100%/4   | Total:  5h 15m | Avg:  1h 18m | Max:  1h 22m
      🟩 Clang14            Pass: 100%/4   | Total:  2h 54m | Avg: 43m 32s | Max: 44m 24s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 59m | Avg: 44m 56s | Max: 47m 55s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 58m | Avg: 44m 38s | Max: 45m 53s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 43m 43s
      🟨 Clang18            Pass:  84%/26  | Total:  1d 07h | Avg:  1h 12m | Max:  1h 42m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 20m | Avg: 40m 15s | Max: 40m 41s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 17m | Avg: 42m 57s | Max: 46m 49s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 09m | Avg: 41m 36s | Max: 44m 24s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 55m | Avg: 29m 11s | Max: 43m 27s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 56m | Avg: 44m 01s | Max: 45m 02s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 40m | Avg: 31m 29s | Max: 57m 22s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 28s | Avg:  4m 52s | Max:  5m 21s
      🟨 GCC13              Pass:  79%/29  | Total:  1d 04h | Avg: 58m 13s | Max:  1h 41m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  3h 55m | Avg:  1h 18m | Max:  1h 31m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 49s | Avg: 55m 49s | Max: 55m 49s | Hits:  40%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 05m | Hits:  40%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 19m | Avg:  1h 06m | Max:  1h 11m | Hits:  40%/2181  
    🟨 cxx_family
      🟨 Clang              Pass:  93%/63  | Total:  2d 21h | Avg:  1h 06m | Max:  1h 42m
      🟨 GCC                Pass:  90%/64  | Total:  1d 23h | Avg: 44m 48s | Max:  1h 41m
      🟩 Intel              Pass: 100%/3   | Total:  3h 55m | Avg:  1h 18m | Max:  1h 31m
      🟩 MSVC               Pass: 100%/6   | Total:  6h 23m | Avg:  1h 03m | Max:  1h 11m | Hits:  40%/4362  
    🟨 jobs
      🟩 Build              Pass: 100%/103 | Total:  3d 14h | Avg: 50m 32s | Max:  1h 31m | Hits:  40%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total: 10h 01m | Avg:  1h 15m | Max:  1h 25m
      🟨 GraphCapture       Pass:  87%/8   | Total:  8h 51m | Avg:  1h 06m | Max:  1h 22m
      🟩 HostLaunch         Pass: 100%/8   | Total: 10h 04m | Avg:  1h 15m | Max:  1h 24m
      🟥 SmallGMem          Pass:   0%/1   | Total: 30m 08s | Avg: 30m 08s | Max: 30m 08s
      🟥 TestGPU            Pass:   0%/8   | Total: 11h 42m | Avg:  1h 27m | Max:  1h 42m
    🟨 gpu
      🟨 v100               Pass:  92%/136 | Total:  5d 07h | Avg: 56m 26s | Max:  1h 42m | Hits:  40%/4362  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 43m | Avg: 54m 22s | Max: 57m 22s
      🟩 90a                Pass: 100%/4   | Total:  1h 36m | Avg: 24m 12s | Max: 25m 45s
    🟨 std
      🟨 11                 Pass:  94%/35  | Total:  1d 10h | Avg: 59m 06s | Max:  1h 41m
      🟨 14                 Pass:  92%/38  | Total:  1d 11h | Avg: 55m 18s | Max:  1h 29m | Hits:  40%/2181  
      🟨 17                 Pass:  92%/38  | Total:  1d 09h | Avg: 52m 32s | Max:  1h 39m | Hits:  40%/1454  
      🟨 20                 Pass:  92%/25  | Total:  1d 01h | Avg:  1h 00m | Max:  1h 42m | Hits:  40%/727   
    
  • 🟩 thrust: Pass: 100%/122 | Total: 1d 12h | Avg: 17m 59s | Max: 1h 16m | Hits: 52%/20070

    🟩 cpu
      🟩 amd64              Pass: 100%/114 | Total:  1d 09h | Avg: 17m 47s | Max:  1h 16m | Hits:  52%/20070 
      🟩 arm64              Pass: 100%/8   | Total:  2h 47m | Avg: 20m 55s | Max: 40m 20s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 33m | Avg: 14m 15s | Max:  1h 07m | Hits:  29%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 44m 52s | Avg: 14m 57s | Max: 32m 45s
      🟩 12.6               Pass: 100%/104 | Total:  1d 08h | Avg: 18m 37s | Max:  1h 16m | Hits:  55%/17840 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 33m | Avg: 14m 15s | Max:  1h 07m | Hits:  29%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 44m 52s | Avg: 14m 57s | Max: 32m 45s
      🟩 nvcc12.6           Pass: 100%/102 | Total:  1d 08h | Avg: 18m 53s | Max:  1h 16m | Hits:  55%/17840 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 38s
      🟩 nvcc               Pass: 100%/120 | Total:  1d 12h | Avg: 18m 12s | Max:  1h 16m | Hits:  52%/20070 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 23m | Avg: 33m 53s | Max: 39m 48s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 53m | Avg: 37m 55s | Max: 44m 44s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 11s | Max: 37m 58s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 27m | Avg: 36m 54s | Max: 43m 09s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 00s | Max: 37m 53s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 39s | Avg:  5m 24s | Max:  5m 58s
      🟩 Clang15            Pass: 100%/4   | Total: 22m 16s | Avg:  5m 34s | Max:  5m 51s
      🟩 Clang16            Pass: 100%/4   | Total: 22m 24s | Avg:  5m 36s | Max:  6m 05s
      🟩 Clang17            Pass: 100%/4   | Total: 22m 22s | Avg:  5m 35s | Max:  6m 08s
      🟩 Clang18            Pass: 100%/18  | Total:  2h 50m | Avg:  9m 29s | Max: 29m 00s
      🟩 GCC6               Pass: 100%/2   | Total:  9m 10s | Avg:  4m 35s | Max:  4m 51s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 50s | Avg:  4m 48s | Max:  5m 10s
      🟩 GCC8               Pass: 100%/6   | Total: 30m 19s | Avg:  5m 03s | Max:  5m 20s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 55s | Avg:  4m 59s | Max:  5m 45s
      🟩 GCC10              Pass: 100%/4   | Total: 23m 11s | Avg:  5m 47s | Max:  6m 00s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 06m | Avg:  9m 29s | Max: 32m 45s
      🟩 GCC12              Pass: 100%/4   | Total: 59m 55s | Avg: 14m 58s | Max: 42m 26s
      🟩 GCC13              Pass: 100%/20  | Total:  4h 50m | Avg: 14m 31s | Max: 40m 20s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 25m | Avg: 48m 25s | Max: 51m 47s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  29%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits:  29%/4460  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  4h 51m | Avg: 48m 33s | Max:  1h 16m | Hits:  64%/13380 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total: 16h 45m | Avg: 18m 16s | Max: 44m 44s
      🟩 GCC                Pass: 100%/55  | Total:  8h 58m | Avg:  9m 47s | Max: 42m 26s
      🟩 Intel              Pass: 100%/3   | Total:  2h 25m | Avg: 48m 25s | Max: 51m 47s
      🟩 MSVC               Pass: 100%/9   | Total:  8h 27m | Avg: 56m 21s | Max:  1h 16m | Hits:  52%/20070 
    🟩 gpu
      🟩 v100               Pass: 100%/122 | Total:  1d 12h | Avg: 17m 59s | Max:  1h 16m | Hits:  52%/20070 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  1d 07h | Avg: 18m 31s | Max:  1h 16m | Hits:  29%/13380 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 05m | Avg: 11m 23s | Max: 23m 23s | Hits:  99%/6690  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 43m | Avg: 20m 25s | Max: 29m 00s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 44m 52s | Avg: 14m 57s | Max: 32m 45s
      🟩 90a                Pass: 100%/4   | Total: 18m 29s | Avg:  4m 37s | Max:  4m 42s
    🟩 std
      🟩 11                 Pass: 100%/31  | Total:  7h 17m | Avg: 14m 06s | Max: 43m 29s
      🟩 14                 Pass: 100%/35  | Total: 11h 27m | Avg: 19m 38s | Max:  1h 15m | Hits:  46%/8920  
      🟩 17                 Pass: 100%/34  | Total: 10h 55m | Avg: 19m 16s | Max:  1h 14m | Hits:  52%/6690  
      🟩 20                 Pass: 100%/22  | Total:  6h 56m | Avg: 18m 54s | Max:  1h 16m | Hits:  64%/4460  
    
  • 🟩 libcudacxx: Pass: 100%/116 | Total: 1d 18h | Avg: 21m 46s | Max: 1h 19m | Hits: 55%/17017

    🟩 cpu
      🟩 amd64              Pass: 100%/108 | Total:  1d 15h | Avg: 21m 59s | Max:  1h 19m | Hits:  55%/17017 
      🟩 arm64              Pass: 100%/8   | Total:  2h 30m | Avg: 18m 52s | Max: 26m 34s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  4h 53m | Avg: 19m 32s | Max: 33m 37s | Hits:  47%/2644  
      🟩 11.8               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 54s | Max: 27m 14s
      🟩 12.6               Pass: 100%/98  | Total:  1d 12h | Avg: 22m 03s | Max:  1h 19m | Hits:  56%/14373 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 35m 52s | Avg: 17m 56s | Max: 18m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 53m | Avg: 19m 32s | Max: 33m 37s | Hits:  47%/2644  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 11m | Avg: 23m 54s | Max: 27m 14s
      🟩 nvcc12.6           Pass: 100%/96  | Total:  1d 11h | Avg: 22m 08s | Max:  1h 19m | Hits:  56%/14373 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 35m 52s | Avg: 17m 56s | Max: 18m 38s
      🟩 nvcc               Pass: 100%/114 | Total:  1d 17h | Avg: 21m 51s | Max:  1h 19m | Hits:  55%/17017 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 52m | Avg: 18m 42s | Max: 22m 59s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 18s | Max: 29m 16s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 14m | Avg: 18m 32s | Max: 27m 12s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 34m | Avg: 23m 30s | Max: 26m 07s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 14m | Avg: 18m 43s | Max: 26m 33s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 35m | Avg: 23m 47s | Max: 26m 57s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 34m | Avg: 23m 42s | Max: 27m 57s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 37m | Avg: 24m 22s | Max: 29m 13s
      🟩 Clang17            Pass: 100%/4   | Total: 56m 00s | Avg: 14m 00s | Max: 27m 16s
      🟩 Clang18            Pass: 100%/14  | Total:  4h 07m | Avg: 17m 41s | Max: 49m 13s
      🟩 GCC6               Pass: 100%/2   | Total: 25m 26s | Avg: 12m 43s | Max: 22m 35s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 08m | Avg: 21m 20s | Max: 25m 13s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 04m | Avg: 20m 46s | Max: 24m 03s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 57m | Avg: 19m 39s | Max: 26m 06s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 10m | Avg: 17m 44s | Max: 26m 59s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 48m | Avg: 24m 03s | Max: 27m 24s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 38m | Avg: 24m 37s | Max: 29m 35s
      🟩 GCC13              Pass: 100%/21  | Total:  8h 11m | Avg: 23m 25s | Max:  1h 19m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 21m | Avg: 27m 19s | Max: 31m 43s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 33m 37s | Avg: 33m 37s | Max: 33m 37s | Hits:  47%/2644  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 12m | Avg: 36m 11s | Max: 38m 13s | Hits:  44%/5650  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 27m | Avg: 29m 12s | Max: 37m 50s | Hits:  64%/8723  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 17h 05m | Avg: 20m 06s | Max: 49m 13s
      🟩 GCC                Pass: 100%/56  | Total: 20h 25m | Avg: 21m 53s | Max:  1h 19m
      🟩 Intel              Pass: 100%/3   | Total:  1h 21m | Avg: 27m 19s | Max: 31m 43s
      🟩 MSVC               Pass: 100%/6   | Total:  3h 13m | Avg: 32m 15s | Max: 38m 13s | Hits:  55%/17017 
    🟩 gpu
      🟩 v100               Pass: 100%/116 | Total:  1d 18h | Avg: 21m 46s | Max:  1h 19m | Hits:  55%/17017 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  1d 12h | Avg: 21m 04s | Max: 38m 13s | Hits:  55%/17017 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 53m | Avg: 28m 21s | Max: 33m 21s
      🟩 Test               Pass: 100%/8   | Total:  4h 00m | Avg: 30m 06s | Max:  1h 19m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 13s | Avg:  2m 13s | Max:  2m 13s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 11m | Avg: 23m 54s | Max: 27m 14s
      🟩 90a                Pass: 100%/4   | Total:  1h 05m | Avg: 16m 21s | Max: 19m 13s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 10h 33m | Avg: 21m 07s | Max: 49m 13s
      🟩 14                 Pass: 100%/33  | Total: 11h 13m | Avg: 20m 24s | Max: 34m 41s | Hits:  46%/8134  
      🟩 17                 Pass: 100%/32  | Total: 12h 24m | Avg: 23m 16s | Max: 38m 13s | Hits:  44%/5810  
      🟩 20                 Pass: 100%/20  | Total:  7h 52m | Avg: 23m 38s | Max:  1h 19m | Hits:  98%/3073  
    
  • 🟩 cudax: Pass: 100%/58 | Total: 2h 49m | Avg: 2m 55s | Max: 11m 08s | Hits: 89%/204

    🟩 cpu
      🟩 amd64              Pass: 100%/54  | Total:  2h 40m | Avg:  2m 58s | Max: 11m 08s | Hits:  89%/204   
      🟩 arm64              Pass: 100%/4   | Total:  9m 19s | Avg:  2m 19s | Max:  2m 49s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 10m | Avg:  3m 03s | Max: 10m 34s | Hits:  89%/102   
      🟩 12.6               Pass: 100%/35  | Total:  1h 39m | Avg:  2m 50s | Max: 11m 08s | Hits:  89%/102   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 10m | Avg:  3m 03s | Max: 10m 34s | Hits:  89%/102   
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1h 39m | Avg:  2m 50s | Max: 11m 08s | Hits:  89%/102   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/58  | Total:  2h 49m | Avg:  2m 55s | Max: 11m 08s | Hits:  89%/204   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 54s | Avg:  2m 27s | Max:  2m 32s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 43s | Avg:  2m 21s | Max:  2m 24s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 52s | Avg:  2m 28s | Max:  2m 37s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 56s | Avg:  2m 29s | Max:  2m 46s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 51s | Avg:  2m 27s | Max:  2m 39s
      🟩 Clang14            Pass: 100%/6   | Total: 16m 12s | Avg:  2m 42s | Max:  4m 02s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 11s | Avg:  2m 05s | Max:  2m 07s
      🟩 Clang16            Pass: 100%/4   | Total:  8m 17s | Avg:  2m 04s | Max:  2m 10s
      🟩 Clang17            Pass: 100%/2   | Total:  4m 46s | Avg:  2m 23s | Max:  2m 30s
      🟩 Clang18            Pass: 100%/4   | Total: 16m 55s | Avg:  4m 13s | Max:  7m 14s
      🟩 GCC9               Pass: 100%/2   | Total:  3m 44s | Avg:  1m 52s | Max:  1m 53s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 17s | Avg:  2m 04s | Max:  2m 11s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 17s | Avg:  2m 04s | Max:  2m 21s
      🟩 GCC12              Pass: 100%/9   | Total: 29m 54s | Avg:  3m 19s | Max:  5m 45s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 03s | Avg:  2m 41s | Max:  2m 49s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 34s | Avg: 10m 34s | Max: 10m 34s | Hits:  89%/102   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 08s | Avg: 11m 08s | Max: 11m 08s | Hits:  89%/102   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/34  | Total:  1h 29m | Avg:  2m 38s | Max:  7m 14s
      🟩 GCC                Pass: 100%/22  | Total: 58m 15s | Avg:  2m 38s | Max:  5m 45s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 42s | Avg: 10m 51s | Max: 11m 08s | Hits:  89%/204   
    🟩 gpu
      🟩 v100               Pass: 100%/58  | Total:  2h 49m | Avg:  2m 55s | Max: 11m 08s | Hits:  89%/204   
    🟩 jobs
      🟩 Build              Pass: 100%/50  | Total:  2h 10m | Avg:  2m 36s | Max: 11m 08s | Hits:  89%/204   
      🟩 Test               Pass: 100%/8   | Total: 39m 32s | Avg:  4m 56s | Max:  7m 14s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
      🟩 90a                Pass: 100%/1   | Total:  2m 44s | Avg:  2m 44s | Max:  2m 44s
    🟩 std
      🟩 17                 Pass: 100%/32  | Total:  1h 23m | Avg:  2m 35s | Max:  7m 14s
      🟩 20                 Pass: 100%/26  | Total:  1h 26m | Avg:  3m 19s | Max: 11m 08s | Hits:  89%/204   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 39s | Avg: 16m 39s | Max: 16m 39s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 433)

# Runner
320 linux-amd64-cpu16
62 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

@miscco miscco enabled auto-merge (squash) September 19, 2024 10:50
Copy link
Contributor

🟨 CI finished in 5h 15m: Pass: 99%/433 | Total: 5d 00h | Avg: 16m 44s | Max: 1h 57m | Hits: 84%/41657
  • 🟨 libcudacxx: Pass: 98%/116 | Total: 1d 14h | Avg: 19m 52s | Max: 1h 23m | Hits: 62%/17017

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  98%/108 | Total:  1d 11h | Avg: 19m 55s | Max:  1h 23m | Hits:  62%/17017 
      🟩 arm64              Pass: 100%/8   | Total:  2h 33m | Avg: 19m 09s | Max: 27m 15s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  4h 07m | Avg: 16m 28s | Max: 24m 29s | Hits:  99%/2644  
      🟩 11.8               Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 30m 32s
      🔍 12.6               Pass:  97%/98  | Total:  1d 09h | Avg: 20m 12s | Max:  1h 23m | Hits:  56%/14373 
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 38m 07s | Avg: 19m 03s | Max: 19m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 07m | Avg: 16m 28s | Max: 24m 29s | Hits:  99%/2644  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 30m 32s
      🔍 nvcc12.6           Pass:  97%/96  | Total:  1d 08h | Avg: 20m 14s | Max:  1h 23m | Hits:  56%/14373 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 38m 07s | Avg: 19m 03s | Max: 19m 13s
      🔍 nvcc               Pass:  98%/114 | Total:  1d 13h | Avg: 19m 53s | Max:  1h 23m | Hits:  62%/17017 
    🔍 cxx: Clang18 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  2h 01m | Avg: 20m 17s | Max: 28m 37s
      🟩 Clang10            Pass: 100%/3   | Total: 34m 29s | Avg: 11m 29s | Max: 24m 13s
      🟩 Clang11            Pass: 100%/4   | Total: 55m 35s | Avg: 13m 53s | Max: 26m 49s
      🟩 Clang12            Pass: 100%/4   | Total: 58m 33s | Avg: 14m 38s | Max: 29m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 11m | Avg: 17m 54s | Max: 25m 18s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 38m | Avg: 24m 37s | Max: 30m 11s
      🟩 Clang15            Pass: 100%/4   | Total: 33m 32s | Avg:  8m 23s | Max: 20m 28s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 34m | Avg: 23m 41s | Max: 27m 09s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 15m | Avg: 18m 58s | Max: 26m 37s
      🔍 Clang18            Pass:  85%/14  | Total:  7h 28m | Avg: 32m 00s | Max:  1h 23m
      🟩 GCC6               Pass: 100%/2   | Total: 24m 49s | Avg: 12m 24s | Max: 22m 14s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 34m | Avg: 15m 42s | Max: 25m 35s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 13m | Avg: 12m 19s | Max: 22m 29s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 52m | Avg: 18m 49s | Max: 25m 56s
      🟩 GCC10              Pass: 100%/4   | Total: 54m 59s | Avg: 13m 44s | Max: 26m 03s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 12m | Avg: 18m 58s | Max: 30m 32s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 16m | Avg: 19m 08s | Max: 30m 47s
      🟩 GCC13              Pass: 100%/21  | Total:  6h 22m | Avg: 18m 12s | Max:  1h 22m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 18m | Avg: 26m 03s | Max: 30m 17s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s | Hits:  99%/2644  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 40s | Max: 38m 40s | Hits:  45%/5650  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 31m | Avg: 30m 28s | Max: 42m 05s | Hits:  63%/8723  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  96%/51  | Total: 18h 12m | Avg: 21m 25s | Max:  1h 23m
      🟩 GCC                Pass: 100%/56  | Total: 15h 52m | Avg: 17m 00s | Max:  1h 22m
      🟩 Intel              Pass: 100%/3   | Total:  1h 18m | Avg: 26m 03s | Max: 30m 17s
      🟩 MSVC               Pass: 100%/6   | Total:  3h 02m | Avg: 30m 23s | Max: 42m 05s | Hits:  62%/17017 
    🔍 jobs: Test 🔍
      🟩 Build              Pass: 100%/103 | Total:  1d 05h | Avg: 17m 10s | Max: 42m 05s | Hits:  62%/17017 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 46m | Avg: 26m 42s | Max: 43m 49s
      🔍 Test               Pass:  75%/8   | Total:  7h 07m | Avg: 53m 25s | Max:  1h 23m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🟨 std
      🟩 11                 Pass: 100%/30  | Total:  9h 04m | Avg: 18m 09s | Max: 55m 03s
      🟩 14                 Pass: 100%/33  | Total:  9h 12m | Avg: 16m 45s | Max:  1h 03m | Hits:  63%/8134  
      🟨 17                 Pass:  96%/32  | Total: 10h 42m | Avg: 20m 04s | Max:  1h 23m | Hits:  71%/5810  
      🟨 20                 Pass:  95%/20  | Total:  9h 23m | Avg: 28m 11s | Max:  1h 23m | Hits:  43%/3073  
    🟨 gpu
      🟨 v100               Pass:  98%/116 | Total:  1d 14h | Avg: 19m 52s | Max:  1h 23m | Hits:  62%/17017 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 30m 32s
      🟩 90a                Pass: 100%/4   | Total: 15m 12s | Avg:  3m 48s | Max:  4m 19s
    
  • 🟩 cub: Pass: 100%/136 | Total: 2d 16h | Avg: 28m 15s | Max: 1h 57m | Hits: 99%/4362

    🟩 cpu
      🟩 amd64              Pass: 100%/128 | Total:  2d 10h | Avg: 27m 25s | Max:  1h 57m | Hits:  99%/4362  
      🟩 arm64              Pass: 100%/8   | Total:  5h 32m | Avg: 41m 31s | Max: 45m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 11m | Avg: 28m 44s | Max: 32m 46s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 05m | Avg: 41m 45s | Max: 42m 05s
      🟩 12.6               Pass: 100%/118 | Total:  2d 06h | Avg: 27m 51s | Max:  1h 57m | Hits:  99%/3635  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 39m | Avg: 49m 58s | Max: 50m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 11m | Avg: 28m 44s | Max: 32m 46s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 05m | Avg: 41m 45s | Max: 42m 05s
      🟩 nvcc12.6           Pass: 100%/116 | Total:  2d 05h | Avg: 27m 28s | Max:  1h 57m | Hits:  99%/3635  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 39m | Avg: 49m 58s | Max: 50m 23s
      🟩 nvcc               Pass: 100%/134 | Total:  2d 14h | Avg: 27m 55s | Max:  1h 57m | Hits:  99%/4362  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 08m | Avg: 31m 23s | Max: 33m 58s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 39m | Avg: 33m 02s | Max: 33m 33s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 08s | Max: 35m 54s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 09m | Avg: 32m 15s | Max: 32m 19s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 13m | Avg: 33m 29s | Max: 35m 02s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 12s | Max: 32m 29s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 35m 39s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 36m 49s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 37s | Max: 36m 44s
      🟩 Clang18            Pass: 100%/26  | Total: 13h 01m | Avg: 30m 03s | Max: 50m 23s
      🟩 GCC6               Pass: 100%/2   | Total: 59m 18s | Avg: 29m 39s | Max: 31m 02s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 07m | Avg: 31m 15s | Max: 34m 17s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 06m | Avg: 31m 02s | Max: 34m 00s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 15m | Avg: 22m 38s | Max: 34m 49s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 12m | Avg: 33m 03s | Max: 36m 00s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 51m | Avg: 24m 31s | Max: 42m 05s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 09s | Avg:  4m 47s | Max:  5m 08s
      🟩 GCC13              Pass: 100%/29  | Total: 13h 37m | Avg: 28m 11s | Max:  1h 57m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 13m | Avg: 24m 24s | Max: 33m 52s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 16s | Avg: 13m 08s | Max: 13m 46s | Hits:  98%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 41m 39s | Avg: 13m 53s | Max: 14m 29s | Hits:  99%/2181  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/63  | Total:  1d 08h | Avg: 31m 22s | Max: 50m 23s
      🟩 GCC                Pass: 100%/64  | Total:  1d 04h | Avg: 26m 42s | Max:  1h 57m
      🟩 Intel              Pass: 100%/3   | Total:  1h 13m | Avg: 24m 24s | Max: 33m 52s
      🟩 MSVC               Pass: 100%/6   | Total:  1h 23m | Avg: 13m 51s | Max: 15m 16s | Hits:  99%/4362  
    🟩 gpu
      🟩 v100               Pass: 100%/136 | Total:  2d 16h | Avg: 28m 15s | Max:  1h 57m | Hits:  99%/4362  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  2d 00h | Avg: 28m 25s | Max: 50m 23s | Hits:  99%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 48m | Avg: 21m 02s | Max: 32m 13s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 42m | Avg: 20m 19s | Max: 27m 11s
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 36m | Avg: 27m 06s | Max: 39m 22s
      🟩 SmallGMem          Pass: 100%/1   | Total: 35m 51s | Avg: 35m 51s | Max: 35m 51s
      🟩 TestGPU            Pass: 100%/8   | Total:  5h 31m | Avg: 41m 25s | Max:  1h 57m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 05m | Avg: 41m 45s | Max: 42m 05s
      🟩 90a                Pass: 100%/4   | Total: 14m 22s | Avg:  3m 35s | Max:  3m 40s
    🟩 std
      🟩 11                 Pass: 100%/35  | Total: 17h 19m | Avg: 29m 41s | Max: 41m 18s
      🟩 14                 Pass: 100%/38  | Total: 17h 51m | Avg: 28m 12s | Max:  1h 57m | Hits:  98%/2181  
      🟩 17                 Pass: 100%/38  | Total: 17h 38m | Avg: 27m 51s | Max: 50m 23s | Hits:  99%/1454  
      🟩 20                 Pass: 100%/25  | Total: 11h 13m | Avg: 26m 56s | Max: 49m 34s | Hits:  99%/727   
    
  • 🟩 thrust: Pass: 100%/122 | Total: 15h 09m | Avg: 7m 27s | Max: 39m 30s | Hits: 99%/20070

    🟩 cpu
      🟩 amd64              Pass: 100%/114 | Total: 14h 32m | Avg:  7m 39s | Max: 39m 30s | Hits:  99%/20070 
      🟩 arm64              Pass: 100%/8   | Total: 36m 12s | Avg:  4m 31s | Max:  4m 53s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 19m 12s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 10s | Avg:  5m 03s | Max:  5m 25s
      🟩 12.6               Pass: 100%/104 | Total: 13h 39m | Avg:  7m 52s | Max: 39m 30s | Hits:  99%/17840 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 46s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 19m 12s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 10s | Avg:  5m 03s | Max:  5m 25s
      🟩 nvcc12.6           Pass: 100%/102 | Total: 13h 30m | Avg:  7m 56s | Max: 39m 30s | Hits:  99%/17840 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 46s
      🟩 nvcc               Pass: 100%/120 | Total: 14h 59m | Avg:  7m 29s | Max: 39m 30s | Hits:  99%/20070 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 51s | Avg:  4m 48s | Max:  6m 14s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 42s | Avg:  5m 54s | Max:  6m 15s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  5m 08s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  4m 42s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 17s | Avg:  4m 34s | Max:  5m 00s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 23s | Avg:  4m 50s | Max:  5m 15s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 01s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  4m 59s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 09s | Avg:  4m 47s | Max:  4m 55s
      🟩 Clang18            Pass: 100%/18  | Total:  2h 34m | Avg:  8m 34s | Max: 30m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 49s | Avg:  3m 54s | Max:  4m 21s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 18s | Avg:  4m 23s | Max:  5m 32s
      🟩 GCC8               Pass: 100%/6   | Total: 26m 35s | Avg:  4m 25s | Max:  5m 02s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 49s | Avg:  4m 18s | Max:  5m 14s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 37s | Avg:  4m 54s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/7   | Total: 35m 00s | Avg:  5m 00s | Max:  5m 26s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 50s | Avg:  5m 12s | Max:  5m 37s
      🟩 GCC13              Pass: 100%/20  | Total:  3h 33m | Avg: 10m 39s | Max: 39m 30s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 12s | Avg:  6m 04s | Max:  6m 13s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 33m 31s | Avg: 16m 45s | Max: 16m 53s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  2h 10m | Avg: 21m 46s | Max: 26m 01s | Hits:  99%/13380 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  5h 32m | Avg:  6m 02s | Max: 30m 34s
      🟩 GCC                Pass: 100%/55  | Total:  6h 14m | Avg:  6m 49s | Max: 39m 30s
      🟩 Intel              Pass: 100%/3   | Total: 18m 12s | Avg:  6m 04s | Max:  6m 13s
      🟩 MSVC               Pass: 100%/9   | Total:  3h 03m | Avg: 20m 22s | Max: 26m 01s | Hits:  99%/20070 
    🟩 gpu
      🟩 v100               Pass: 100%/122 | Total: 15h 09m | Avg:  7m 27s | Max: 39m 30s | Hits:  99%/20070 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  9h 28m | Avg:  5m 31s | Max: 19m 14s | Hits:  99%/13380 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 42m | Avg: 14m 48s | Max: 39m 30s | Hits:  99%/6690  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 57m | Avg: 22m 14s | Max: 30m 34s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 10s | Avg:  5m 03s | Max:  5m 25s
      🟩 90a                Pass: 100%/4   | Total: 16m 37s | Avg:  4m 09s | Max:  4m 18s
    🟩 std
      🟩 11                 Pass: 100%/31  | Total:  3h 10m | Avg:  6m 09s | Max: 30m 34s
      🟩 14                 Pass: 100%/35  | Total:  4h 12m | Avg:  7m 12s | Max: 24m 12s | Hits:  99%/8920  
      🟩 17                 Pass: 100%/34  | Total:  4h 43m | Avg:  8m 20s | Max: 39m 30s | Hits:  99%/6690  
      🟩 20                 Pass: 100%/22  | Total:  3h 02m | Avg:  8m 17s | Max: 26m 01s | Hits:  99%/4460  
    
  • 🟩 cudax: Pass: 100%/58 | Total: 2h 57m | Avg: 3m 03s | Max: 10m 55s | Hits: 80%/208

    🟩 cpu
      🟩 amd64              Pass: 100%/54  | Total:  2h 47m | Avg:  3m 05s | Max: 10m 55s | Hits:  80%/208   
      🟩 arm64              Pass: 100%/4   | Total: 10m 24s | Avg:  2m 36s | Max:  2m 51s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 11m | Avg:  3m 06s | Max: 10m 55s | Hits:  80%/104   
      🟩 12.6               Pass: 100%/35  | Total:  1h 45m | Avg:  3m 01s | Max: 10m 34s | Hits:  80%/104   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 11m | Avg:  3m 06s | Max: 10m 55s | Hits:  80%/104   
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1h 45m | Avg:  3m 01s | Max: 10m 34s | Hits:  80%/104   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/58  | Total:  2h 57m | Avg:  3m 03s | Max: 10m 55s | Hits:  80%/208   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 07s | Avg:  2m 33s | Max:  2m 34s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 13s | Avg:  2m 36s | Max:  2m 43s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 53s | Avg:  2m 28s | Max:  2m 40s
      🟩 Clang12            Pass: 100%/4   | Total: 10m 11s | Avg:  2m 32s | Max:  2m 46s
      🟩 Clang13            Pass: 100%/4   | Total: 10m 34s | Avg:  2m 38s | Max:  3m 15s
      🟩 Clang14            Pass: 100%/6   | Total: 18m 46s | Avg:  3m 07s | Max:  4m 33s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 05s | Avg:  2m 32s | Max:  2m 37s
      🟩 Clang16            Pass: 100%/4   | Total: 10m 24s | Avg:  2m 36s | Max:  2m 51s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 33s | Avg:  2m 46s | Max:  2m 47s
      🟩 Clang18            Pass: 100%/4   | Total: 13m 00s | Avg:  3m 15s | Max:  4m 07s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 28s | Avg:  2m 14s | Max:  2m 18s
      🟩 GCC10              Pass: 100%/4   | Total:  9m 30s | Avg:  2m 22s | Max:  2m 33s
      🟩 GCC11              Pass: 100%/4   | Total:  9m 05s | Avg:  2m 16s | Max:  2m 19s
      🟩 GCC12              Pass: 100%/9   | Total: 31m 44s | Avg:  3m 31s | Max:  5m 38s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 24s | Avg:  2m 28s | Max:  2m 32s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 55s | Avg: 10m 55s | Max: 10m 55s | Hits:  80%/104   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 34s | Avg: 10m 34s | Max: 10m 34s | Hits:  80%/104   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/34  | Total:  1h 33m | Avg:  2m 45s | Max:  4m 33s
      🟩 GCC                Pass: 100%/22  | Total:  1h 02m | Avg:  2m 49s | Max:  5m 38s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 29s | Avg: 10m 44s | Max: 10m 55s | Hits:  80%/208   
    🟩 gpu
      🟩 v100               Pass: 100%/58  | Total:  2h 57m | Avg:  3m 03s | Max: 10m 55s | Hits:  80%/208   
    🟩 jobs
      🟩 Build              Pass: 100%/50  | Total:  2h 22m | Avg:  2m 51s | Max: 10m 55s | Hits:  80%/208   
      🟩 Test               Pass: 100%/8   | Total: 34m 51s | Avg:  4m 21s | Max:  5m 38s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 90a                Pass: 100%/1   | Total:  2m 25s | Avg:  2m 25s | Max:  2m 25s
    🟩 std
      🟩 17                 Pass: 100%/32  | Total:  1h 26m | Avg:  2m 42s | Max:  4m 25s
      🟩 20                 Pass: 100%/26  | Total:  1h 30m | Avg:  3m 28s | Max: 10m 55s | Hits:  80%/208   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 433)

# Runner
320 linux-amd64-cpu16
62 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 7h 47m: Pass: 100%/433 | Total: 4d 23h | Avg: 16m 35s | Max: 1h 57m | Hits: 84%/41657
  • 🟩 cub: Pass: 100%/136 | Total: 2d 16h | Avg: 28m 15s | Max: 1h 57m | Hits: 99%/4362

    🟩 cpu
      🟩 amd64              Pass: 100%/128 | Total:  2d 10h | Avg: 27m 25s | Max:  1h 57m | Hits:  99%/4362  
      🟩 arm64              Pass: 100%/8   | Total:  5h 32m | Avg: 41m 31s | Max: 45m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 11m | Avg: 28m 44s | Max: 32m 46s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 05m | Avg: 41m 45s | Max: 42m 05s
      🟩 12.6               Pass: 100%/118 | Total:  2d 06h | Avg: 27m 51s | Max:  1h 57m | Hits:  99%/3635  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 39m | Avg: 49m 58s | Max: 50m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 11m | Avg: 28m 44s | Max: 32m 46s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 05m | Avg: 41m 45s | Max: 42m 05s
      🟩 nvcc12.6           Pass: 100%/116 | Total:  2d 05h | Avg: 27m 28s | Max:  1h 57m | Hits:  99%/3635  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 39m | Avg: 49m 58s | Max: 50m 23s
      🟩 nvcc               Pass: 100%/134 | Total:  2d 14h | Avg: 27m 55s | Max:  1h 57m | Hits:  99%/4362  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 08m | Avg: 31m 23s | Max: 33m 58s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 39m | Avg: 33m 02s | Max: 33m 33s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 16m | Avg: 34m 08s | Max: 35m 54s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 09m | Avg: 32m 15s | Max: 32m 19s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 13m | Avg: 33m 29s | Max: 35m 02s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 12s | Max: 32m 29s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 35m 39s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 36m 49s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 37s | Max: 36m 44s
      🟩 Clang18            Pass: 100%/26  | Total: 13h 01m | Avg: 30m 03s | Max: 50m 23s
      🟩 GCC6               Pass: 100%/2   | Total: 59m 18s | Avg: 29m 39s | Max: 31m 02s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 07m | Avg: 31m 15s | Max: 34m 17s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 06m | Avg: 31m 02s | Max: 34m 00s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 15m | Avg: 22m 38s | Max: 34m 49s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 12m | Avg: 33m 03s | Max: 36m 00s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 51m | Avg: 24m 31s | Max: 42m 05s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 09s | Avg:  4m 47s | Max:  5m 08s
      🟩 GCC13              Pass: 100%/29  | Total: 13h 37m | Avg: 28m 11s | Max:  1h 57m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 13m | Avg: 24m 24s | Max: 33m 52s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 16s | Avg: 13m 08s | Max: 13m 46s | Hits:  98%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 41m 39s | Avg: 13m 53s | Max: 14m 29s | Hits:  99%/2181  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/63  | Total:  1d 08h | Avg: 31m 22s | Max: 50m 23s
      🟩 GCC                Pass: 100%/64  | Total:  1d 04h | Avg: 26m 42s | Max:  1h 57m
      🟩 Intel              Pass: 100%/3   | Total:  1h 13m | Avg: 24m 24s | Max: 33m 52s
      🟩 MSVC               Pass: 100%/6   | Total:  1h 23m | Avg: 13m 51s | Max: 15m 16s | Hits:  99%/4362  
    🟩 gpu
      🟩 v100               Pass: 100%/136 | Total:  2d 16h | Avg: 28m 15s | Max:  1h 57m | Hits:  99%/4362  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  2d 00h | Avg: 28m 25s | Max: 50m 23s | Hits:  99%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 48m | Avg: 21m 02s | Max: 32m 13s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 42m | Avg: 20m 19s | Max: 27m 11s
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 36m | Avg: 27m 06s | Max: 39m 22s
      🟩 SmallGMem          Pass: 100%/1   | Total: 35m 51s | Avg: 35m 51s | Max: 35m 51s
      🟩 TestGPU            Pass: 100%/8   | Total:  5h 31m | Avg: 41m 25s | Max:  1h 57m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 05m | Avg: 41m 45s | Max: 42m 05s
      🟩 90a                Pass: 100%/4   | Total: 14m 22s | Avg:  3m 35s | Max:  3m 40s
    🟩 std
      🟩 11                 Pass: 100%/35  | Total: 17h 19m | Avg: 29m 41s | Max: 41m 18s
      🟩 14                 Pass: 100%/38  | Total: 17h 51m | Avg: 28m 12s | Max:  1h 57m | Hits:  98%/2181  
      🟩 17                 Pass: 100%/38  | Total: 17h 38m | Avg: 27m 51s | Max: 50m 23s | Hits:  99%/1454  
      🟩 20                 Pass: 100%/25  | Total: 11h 13m | Avg: 26m 56s | Max: 49m 34s | Hits:  99%/727   
    
  • 🟩 thrust: Pass: 100%/122 | Total: 15h 09m | Avg: 7m 27s | Max: 39m 30s | Hits: 99%/20070

    🟩 cpu
      🟩 amd64              Pass: 100%/114 | Total: 14h 32m | Avg:  7m 39s | Max: 39m 30s | Hits:  99%/20070 
      🟩 arm64              Pass: 100%/8   | Total: 36m 12s | Avg:  4m 31s | Max:  4m 53s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 19m 12s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 10s | Avg:  5m 03s | Max:  5m 25s
      🟩 12.6               Pass: 100%/104 | Total: 13h 39m | Avg:  7m 52s | Max: 39m 30s | Hits:  99%/17840 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 46s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 19m 12s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 10s | Avg:  5m 03s | Max:  5m 25s
      🟩 nvcc12.6           Pass: 100%/102 | Total: 13h 30m | Avg:  7m 56s | Max: 39m 30s | Hits:  99%/17840 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 46s
      🟩 nvcc               Pass: 100%/120 | Total: 14h 59m | Avg:  7m 29s | Max: 39m 30s | Hits:  99%/20070 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 51s | Avg:  4m 48s | Max:  6m 14s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 42s | Avg:  5m 54s | Max:  6m 15s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  5m 08s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  4m 42s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 17s | Avg:  4m 34s | Max:  5m 00s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 23s | Avg:  4m 50s | Max:  5m 15s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 01s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  4m 59s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 09s | Avg:  4m 47s | Max:  4m 55s
      🟩 Clang18            Pass: 100%/18  | Total:  2h 34m | Avg:  8m 34s | Max: 30m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 49s | Avg:  3m 54s | Max:  4m 21s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 18s | Avg:  4m 23s | Max:  5m 32s
      🟩 GCC8               Pass: 100%/6   | Total: 26m 35s | Avg:  4m 25s | Max:  5m 02s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 49s | Avg:  4m 18s | Max:  5m 14s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 37s | Avg:  4m 54s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/7   | Total: 35m 00s | Avg:  5m 00s | Max:  5m 26s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 50s | Avg:  5m 12s | Max:  5m 37s
      🟩 GCC13              Pass: 100%/20  | Total:  3h 33m | Avg: 10m 39s | Max: 39m 30s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 12s | Avg:  6m 04s | Max:  6m 13s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 33m 31s | Avg: 16m 45s | Max: 16m 53s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  2h 10m | Avg: 21m 46s | Max: 26m 01s | Hits:  99%/13380 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  5h 32m | Avg:  6m 02s | Max: 30m 34s
      🟩 GCC                Pass: 100%/55  | Total:  6h 14m | Avg:  6m 49s | Max: 39m 30s
      🟩 Intel              Pass: 100%/3   | Total: 18m 12s | Avg:  6m 04s | Max:  6m 13s
      🟩 MSVC               Pass: 100%/9   | Total:  3h 03m | Avg: 20m 22s | Max: 26m 01s | Hits:  99%/20070 
    🟩 gpu
      🟩 v100               Pass: 100%/122 | Total: 15h 09m | Avg:  7m 27s | Max: 39m 30s | Hits:  99%/20070 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  9h 28m | Avg:  5m 31s | Max: 19m 14s | Hits:  99%/13380 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 42m | Avg: 14m 48s | Max: 39m 30s | Hits:  99%/6690  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 57m | Avg: 22m 14s | Max: 30m 34s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 10s | Avg:  5m 03s | Max:  5m 25s
      🟩 90a                Pass: 100%/4   | Total: 16m 37s | Avg:  4m 09s | Max:  4m 18s
    🟩 std
      🟩 11                 Pass: 100%/31  | Total:  3h 10m | Avg:  6m 09s | Max: 30m 34s
      🟩 14                 Pass: 100%/35  | Total:  4h 12m | Avg:  7m 12s | Max: 24m 12s | Hits:  99%/8920  
      🟩 17                 Pass: 100%/34  | Total:  4h 43m | Avg:  8m 20s | Max: 39m 30s | Hits:  99%/6690  
      🟩 20                 Pass: 100%/22  | Total:  3h 02m | Avg:  8m 17s | Max: 26m 01s | Hits:  99%/4460  
    
  • 🟩 libcudacxx: Pass: 100%/116 | Total: 1d 13h | Avg: 19m 19s | Max: 1h 22m | Hits: 62%/17017

    🟩 cpu
      🟩 amd64              Pass: 100%/108 | Total:  1d 10h | Avg: 19m 20s | Max:  1h 22m | Hits:  62%/17017 
      🟩 arm64              Pass: 100%/8   | Total:  2h 33m | Avg: 19m 09s | Max: 27m 15s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  4h 07m | Avg: 16m 28s | Max: 24m 29s | Hits:  99%/2644  
      🟩 11.8               Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 30m 32s
      🟩 12.6               Pass: 100%/98  | Total:  1d 07h | Avg: 19m 33s | Max:  1h 22m | Hits:  56%/14373 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 38m 07s | Avg: 19m 03s | Max: 19m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 07m | Avg: 16m 28s | Max: 24m 29s | Hits:  99%/2644  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 30m 32s
      🟩 nvcc12.6           Pass: 100%/96  | Total:  1d 07h | Avg: 19m 34s | Max:  1h 22m | Hits:  56%/14373 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 38m 07s | Avg: 19m 03s | Max: 19m 13s
      🟩 nvcc               Pass: 100%/114 | Total:  1d 12h | Avg: 19m 19s | Max:  1h 22m | Hits:  62%/17017 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 01m | Avg: 20m 17s | Max: 28m 37s
      🟩 Clang10            Pass: 100%/3   | Total: 34m 29s | Avg: 11m 29s | Max: 24m 13s
      🟩 Clang11            Pass: 100%/4   | Total: 55m 35s | Avg: 13m 53s | Max: 26m 49s
      🟩 Clang12            Pass: 100%/4   | Total: 58m 33s | Avg: 14m 38s | Max: 29m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 11m | Avg: 17m 54s | Max: 25m 18s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 38m | Avg: 24m 37s | Max: 30m 11s
      🟩 Clang15            Pass: 100%/4   | Total: 33m 32s | Avg:  8m 23s | Max: 20m 28s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 34m | Avg: 23m 41s | Max: 27m 09s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 15m | Avg: 18m 58s | Max: 26m 37s
      🟩 Clang18            Pass: 100%/14  | Total:  6h 23m | Avg: 27m 24s | Max:  1h 03m
      🟩 GCC6               Pass: 100%/2   | Total: 24m 49s | Avg: 12m 24s | Max: 22m 14s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 34m | Avg: 15m 42s | Max: 25m 35s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 13m | Avg: 12m 19s | Max: 22m 29s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 52m | Avg: 18m 49s | Max: 25m 56s
      🟩 GCC10              Pass: 100%/4   | Total: 54m 59s | Avg: 13m 44s | Max: 26m 03s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 12m | Avg: 18m 58s | Max: 30m 32s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 16m | Avg: 19m 08s | Max: 30m 47s
      🟩 GCC13              Pass: 100%/21  | Total:  6h 22m | Avg: 18m 12s | Max:  1h 22m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 18m | Avg: 26m 03s | Max: 30m 17s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 33s | Avg: 19m 33s | Max: 19m 33s | Hits:  99%/2644  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 40s | Max: 38m 40s | Hits:  45%/5650  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 31m | Avg: 30m 28s | Max: 42m 05s | Hits:  63%/8723  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 17h 08m | Avg: 20m 09s | Max:  1h 03m
      🟩 GCC                Pass: 100%/56  | Total: 15h 52m | Avg: 17m 00s | Max:  1h 22m
      🟩 Intel              Pass: 100%/3   | Total:  1h 18m | Avg: 26m 03s | Max: 30m 17s
      🟩 MSVC               Pass: 100%/6   | Total:  3h 02m | Avg: 30m 23s | Max: 42m 05s | Hits:  62%/17017 
    🟩 gpu
      🟩 v100               Pass: 100%/116 | Total:  1d 13h | Avg: 19m 19s | Max:  1h 22m | Hits:  62%/17017 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  1d 05h | Avg: 17m 10s | Max: 42m 05s | Hits:  62%/17017 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 46m | Avg: 26m 42s | Max: 43m 49s
      🟩 Test               Pass: 100%/8   | Total:  6h 03m | Avg: 45m 23s | Max:  1h 22m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 17m | Avg: 25m 54s | Max: 30m 32s
      🟩 90a                Pass: 100%/4   | Total: 15m 12s | Avg:  3m 48s | Max:  4m 19s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  9h 04m | Avg: 18m 09s | Max: 55m 03s
      🟩 14                 Pass: 100%/33  | Total:  9h 12m | Avg: 16m 45s | Max:  1h 03m | Hits:  63%/8134  
      🟩 17                 Pass: 100%/32  | Total: 10h 04m | Avg: 18m 53s | Max: 46m 12s | Hits:  71%/5810  
      🟩 20                 Pass: 100%/20  | Total:  8h 57m | Avg: 26m 52s | Max:  1h 22m | Hits:  43%/3073  
    
  • 🟩 cudax: Pass: 100%/58 | Total: 2h 57m | Avg: 3m 03s | Max: 10m 55s | Hits: 80%/208

    🟩 cpu
      🟩 amd64              Pass: 100%/54  | Total:  2h 47m | Avg:  3m 05s | Max: 10m 55s | Hits:  80%/208   
      🟩 arm64              Pass: 100%/4   | Total: 10m 24s | Avg:  2m 36s | Max:  2m 51s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 11m | Avg:  3m 06s | Max: 10m 55s | Hits:  80%/104   
      🟩 12.6               Pass: 100%/35  | Total:  1h 45m | Avg:  3m 01s | Max: 10m 34s | Hits:  80%/104   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 11m | Avg:  3m 06s | Max: 10m 55s | Hits:  80%/104   
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1h 45m | Avg:  3m 01s | Max: 10m 34s | Hits:  80%/104   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/58  | Total:  2h 57m | Avg:  3m 03s | Max: 10m 55s | Hits:  80%/208   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  5m 07s | Avg:  2m 33s | Max:  2m 34s
      🟩 Clang10            Pass: 100%/2   | Total:  5m 13s | Avg:  2m 36s | Max:  2m 43s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 53s | Avg:  2m 28s | Max:  2m 40s
      🟩 Clang12            Pass: 100%/4   | Total: 10m 11s | Avg:  2m 32s | Max:  2m 46s
      🟩 Clang13            Pass: 100%/4   | Total: 10m 34s | Avg:  2m 38s | Max:  3m 15s
      🟩 Clang14            Pass: 100%/6   | Total: 18m 46s | Avg:  3m 07s | Max:  4m 33s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 05s | Avg:  2m 32s | Max:  2m 37s
      🟩 Clang16            Pass: 100%/4   | Total: 10m 24s | Avg:  2m 36s | Max:  2m 51s
      🟩 Clang17            Pass: 100%/2   | Total:  5m 33s | Avg:  2m 46s | Max:  2m 47s
      🟩 Clang18            Pass: 100%/4   | Total: 13m 00s | Avg:  3m 15s | Max:  4m 07s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 28s | Avg:  2m 14s | Max:  2m 18s
      🟩 GCC10              Pass: 100%/4   | Total:  9m 30s | Avg:  2m 22s | Max:  2m 33s
      🟩 GCC11              Pass: 100%/4   | Total:  9m 05s | Avg:  2m 16s | Max:  2m 19s
      🟩 GCC12              Pass: 100%/9   | Total: 31m 44s | Avg:  3m 31s | Max:  5m 38s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 24s | Avg:  2m 28s | Max:  2m 32s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 55s | Avg: 10m 55s | Max: 10m 55s | Hits:  80%/104   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 34s | Avg: 10m 34s | Max: 10m 34s | Hits:  80%/104   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/34  | Total:  1h 33m | Avg:  2m 45s | Max:  4m 33s
      🟩 GCC                Pass: 100%/22  | Total:  1h 02m | Avg:  2m 49s | Max:  5m 38s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 29s | Avg: 10m 44s | Max: 10m 55s | Hits:  80%/208   
    🟩 gpu
      🟩 v100               Pass: 100%/58  | Total:  2h 57m | Avg:  3m 03s | Max: 10m 55s | Hits:  80%/208   
    🟩 jobs
      🟩 Build              Pass: 100%/50  | Total:  2h 22m | Avg:  2m 51s | Max: 10m 55s | Hits:  80%/208   
      🟩 Test               Pass: 100%/8   | Total: 34m 51s | Avg:  4m 21s | Max:  5m 38s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 90a                Pass: 100%/1   | Total:  2m 25s | Avg:  2m 25s | Max:  2m 25s
    🟩 std
      🟩 17                 Pass: 100%/32  | Total:  1h 26m | Avg:  2m 42s | Max:  4m 25s
      🟩 20                 Pass: 100%/26  | Total:  1h 30m | Avg:  3m 28s | Max: 10m 55s | Hits:  80%/208   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 59s | Avg: 14m 59s | Max: 14m 59s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 433)

# Runner
320 linux-amd64-cpu16
62 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

@miscco miscco merged commit b07f036 into NVIDIA:main Sep 19, 2024
446 checks passed
@miscco miscco deleted the expand_ceil_div branch September 19, 2024 15:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

Replace cub::DivideAndRoundUp by cuda::ceil_div
3 participants