Skip to content

Commit

Permalink
Update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
ahendriksen committed Sep 12, 2023
1 parent 1443a90 commit 331ed7c
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 5 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -63,13 +63,16 @@ Under CUDA Compute Capability 6 (Pascal) or prior, an object of type

## Shared memory barriers with transaction count

Starting with the Hopper architecture (CUDA Compute Capability 9), a
`cuda::barrier` object located in shared memory supports a new count, called
In addition to the arrival count, a `cuda::barrier<thread_scope_block>` object
located in shared memory supports a
[tx-count](https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#tracking-asynchronous-operations-by-the-mbarrier-object),
which is used for tracking the completion of asynchronous memory operations or
transactions. The tx-count tracks the number of asynchronous transactions, in
which is used for tracking the completion of some asynchronous memory operations or
transactions.
The tx-count tracks the number of asynchronous transactions, in
units specified by the asynchronous memory operation (typically bytes), that are
outstanding and yet to be complete.
This capability is exposed, starting with the Hopper architecture (CUDA Compute
Capability 9).

The tx-count of `cuda::barrier` must be set to the total amount of asynchronous
memory operations, in units as specified by the asynchronous operations, to be
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ transaction count.
## Preconditions

* `__isShared(&bar) == true`
* `1 <= arrive_count_update`
* `1 <= arrive_count_update && transaction_count_update <= (1 << 20) - 1`
* `0 <= transaction_count_update && transaction_count_update <= (1 << 20) - 1`


Expand Down

0 comments on commit 331ed7c

Please sign in to comment.