Back to Cutlass

CUTLASS: default_mma_core_sm50.h File Reference

docs/default__mma__core__sm50_8h.html

4.4.22.0 KB
Original Source

| | CUTLASS

CUDA Templates for Linear Algebra Subroutines and Solvers |

Classes | Namespaces

default_mma_core_sm50.h File Reference

Defines basic properties needed by CTA-level GEMMs assuming expectations about data layout of the global memory fragments, data types, and internal tile sizes. More...

#include "cutlass/cutlass.h"
#include "cutlass/array.h"
#include "cutlass/numeric_types.h"
#include "cutlass/matrix_shape.h"
#include "cutlass/layout/matrix.h"
#include "cutlass/transform/pitch_linear_thread_map.h"
#include "cutlass/transform/threadblock/regular_tile_iterator.h"
#include "cutlass/gemm/warp/mma_simt.h"
#include "cutlass/gemm/threadblock/default_mma_core.h"

Include dependency graph for default_mma_core_sm50.h:

[Go to the source code of this file.](default mma core__sm50_8h_source.html)

|

Classes

| | struct | [cutlass::gemm::threadblock::DefaultMmaCore< Shape_, WarpShape_, GemmShape< 1, 1, 1 >, ElementA_, layout::ColumnMajor, ElementB_, layout::RowMajor, ElementC_, LayoutC_, arch::OpClassSimt, 2, Operator_, >](structcutlass_1_1gemm_1_1threadblock_1_1DefaultMmaCore_3_01Shape 00_01WarpShape 00_01GemmSha46446d1e3871e31d2e728f710d78c8c1.html) | | |

|

Namespaces

| | | cutlass | | | | | cutlass::gemm | | | | | cutlass::gemm::threadblock | | |

Detailed Description

Partial specializations for threadblock::Mma operations targeting TensorOp instructions.


Generated by 1.8.11