docs/structcutlass_1_1transform_1_1PitchLinearWarpStripedThreadMap.html
| | CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers |
Classes | Public Types | Static Public Member Functions | Static Public Attributes | List of all members
cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess > Struct Template Reference
#include <pitch_linear_thread_map.h>
|
|
| struct | Detail |
| | Internal details made public to facilitate introspection Iterations along each dimension (concept: PitchLinearShape) More...
|
| |
|
|
| using | TensorCoord = layout::PitchLinearCoord |
| | Tensor coordinate. More...
|
| |
| using | Shape = Shape_ |
| | Tile shape. More...
|
| |
| using | ThreadAccessShape = layout::PitchLinearShape< kElementsPerAccess, 1 > |
| | Shape of access by each thread. More...
|
| |
| using | Iterations = layout::PitchLinearShape< Detail::WarpAccessIterations::kContiguous/Detail::kWarpsContiguous, Detail::WarpAccessIterations::kStrided/Detail::kWarpsStrided > |
| |
| using | Delta = layout::PitchLinearShape< Detail::WarpThreadArrangement::kContiguous *kElementsPerAccess, Detail::WarpThreadArrangement::kStrided *Detail::WarpArrangement::kStrided > |
| | Delta betweeen accesses (units of elements, concept: PitchLinearShape) More...
|
| |
|
|
| static CUTLASS_HOST_DEVICE TensorCoord | initial_offset (int thread_id) |
| | Maps thread ID to a coordinate offset within the tensor's logical coordinate space. More...
|
| |
|
|
| static int const | kThreads = Threads |
| | Number of threads total. More...
|
| |
| static int const | kElementsPerAccess = ElementsPerAccess |
| | Extract vector length from Layout. More...
|
| |
Policy defining a warp-striped arrangement. This partitions a tile into vectorized memory accesses performed by each warp then distributes warps across them. Warps are striped in the strided dimension and raked across the contiguous dimension.
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
| using cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::Delta = layout::PitchLinearShape< Detail::WarpThreadArrangement::kContiguous * kElementsPerAccess, Detail::WarpThreadArrangement::kStrided * Detail::WarpArrangement::kStrided > |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
| using cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::Iterations = layout::PitchLinearShape< Detail::WarpAccessIterations::kContiguous / Detail::kWarpsContiguous, Detail::WarpAccessIterations::kStrided / Detail::kWarpsStrided > |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
| using cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::Shape = Shape_ |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
| using cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::TensorCoord = layout::PitchLinearCoord |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
| using cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::ThreadAccessShape = layout::PitchLinearShape<kElementsPerAccess, 1> |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
|
| static CUTLASS_HOST_DEVICE TensorCoord cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::initial_offset | ( | int | thread_id | ) | |
| inlinestatic |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
|
| int const cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::kElementsPerAccess = ElementsPerAccess |
| static |
template<typename Shape_ , int Threads, typename WarpThreadArrangement_ , int ElementsPerAccess = 1>
|
| int const cutlass::transform::PitchLinearWarpStripedThreadMap< Shape_, Threads, WarpThreadArrangement_, ElementsPerAccess >::kThreads = Threads |
| static |
The documentation for this struct was generated from the following file:
Generated by 1.8.11