docs/structcutlass_1_1gemm_1_1threadblock_1_1GemmSplitKIdentityThreadblockSwizzle.html
| | CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers |
Public Member Functions | List of all members
cutlass::gemm::threadblock::GemmSplitKIdentityThreadblockSwizzle Struct Reference
Threadblock swizzling function for split-K GEMMs.
#include <threadblock_swizzle.h>
|
|
| CUTLASS_HOST_DEVICE GemmCoord | get_tiled_shape (GemmCoord problem_size, GemmCoord tile_size, int partitions) const |
| | Returns the shape of the problem in units of logical tiles. More...
|
| |
| CUTLASS_HOST_DEVICE dim3 | get_grid_shape (GemmCoord tiled_shape) const |
| | Computes CUDA grid dimensions given a size in units of logical tiles. More...
|
| |
| CUTLASS_DEVICE GemmCoord | get_tile_offset () const |
| | Obtains the threadblock offset (in units of threadblock-scoped tiles) More...
|
| |
|
| CUTLASS_HOST_DEVICE dim3 cutlass::gemm::threadblock::GemmSplitKIdentityThreadblockSwizzle::get_grid_shape | ( | GemmCoord | tiled_shape | ) | const |
| inline |
|
| CUTLASS_DEVICE GemmCoord cutlass::gemm::threadblock::GemmSplitKIdentityThreadblockSwizzle::get_tile_offset | ( | | ) | const |
| inline |
|
| CUTLASS_HOST_DEVICE GemmCoord cutlass::gemm::threadblock::GemmSplitKIdentityThreadblockSwizzle::get_tiled_shape | ( | GemmCoord | problem_size, | | | | GemmCoord | tile_size, | | | | int | partitions | | | ) | | const |
| inline |
The documentation for this struct was generated from the following file:
Generated by 1.8.11