You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As we kept simplifying the reproducible reduction kernel, we removed the code path that processes incomplete tiles. This makes current code return incorrect results on inputs which size is not multiple of four (vector size).
This issue can be closed by:
passed test case that runs reproducible reduction of odd number of elements
NVBench result illustrating that this chance hasn't introduced performance regressions
The text was updated successfully, but these errors were encountered:
As we kept simplifying the reproducible reduction kernel, we removed the code path that processes incomplete tiles. This makes current code return incorrect results on inputs which size is not multiple of four (vector size).
This issue can be closed by:
The text was updated successfully, but these errors were encountered: