Back to Cutlass

CUTLASS: cutlass::gemm::device Namespace Reference

docs/namespacecutlass_1_1gemm_1_1device.html

4.4.27.0 KB
Original Source

| | CUTLASS

CUDA Templates for Linear Algebra Subroutines and Solvers |

Classes

cutlass::gemm::device Namespace Reference

|

Classes

| | struct | DefaultGemmConfiguration | | | | struct | DefaultGemmConfiguration< arch::OpClassSimt, ArchTag, ElementA, ElementB, ElementC, ElementAccumulator > | | | | struct | DefaultGemmConfiguration< arch::OpClassSimt, ArchTag, int8_t, int8_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm70, ElementA, ElementB, ElementC, ElementAccumulator > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, ElementA, ElementB, ElementC, ElementAccumulator > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, int4b_t, int4b_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, int4b_t, uint4b_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, int8_t, int8_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, int8_t, uint8_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, uint4b_t, int4b_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, uint4b_t, uint4b_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, uint8_t, int8_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassTensorOp, arch::Sm75, uint8_t, uint8_t, ElementC, int32_t > | | | | struct | DefaultGemmConfiguration< arch::OpClassWmmaTensorOp, ArchTag, ElementA, ElementB, ElementC, ElementAccumulator > | | | | class | Gemm | | | | class | [Gemm< ElementA_, LayoutA_, ElementB_, LayoutB_, ElementC_, layout::ColumnMajor, ElementAccumulator_, OperatorClass_, ArchTag_, ThreadblockShape_, WarpShape_, InstructionShape_, EpilogueOutputOp_, ThreadblockSwizzle_, Stages, AlignmentA, AlignmentB, SplitKSerial, Operator_, IsBetaZero >](classcutlass_1_1gemm_1_1device_1_1Gemm_3_01ElementA 00_01LayoutA 00_01ElementB___00_01Layout4d0960ae6b1d1bf19e6239dbd002249c.html) | | | Partial specialization for column-major output exchanges problem size and operand. [More...](classcutlass_1_1gemm_1_1device_1_1Gemm_3_01ElementA 00_01LayoutA 00_01ElementB___00_01Layout4d0960ae6b1d1bf19e6239dbd002249c.html#details)
| | | | class | GemmBatched | | | | class | [GemmBatched< ElementA_, LayoutA_, ElementB_, LayoutB_, ElementC_, layout::ColumnMajor, ElementAccumulator_, OperatorClass_, ArchTag_, ThreadblockShape_, WarpShape_, InstructionShape_, EpilogueOutputOp_, ThreadblockSwizzle_, Stages, AlignmentA, AlignmentB, Operator_ >](classcutlass_1_1gemm_1_1device_1_1GemmBatched_3_01ElementA 00_01LayoutA 00_01ElementB___00_0c9bb6f4463ab6085e6008b5d5ad6abfd.html) | | | Partial specialization for column-major output exchanges problem size and operand. [More...](classcutlass_1_1gemm_1_1device_1_1GemmBatched_3_01ElementA 00_01LayoutA 00_01ElementB___00_0c9bb6f4463ab6085e6008b5d5ad6abfd.html#details)
| | | | class | GemmComplex | | | | class | [GemmComplex< ElementA_, LayoutA_, ElementB_, LayoutB_, ElementC_, layout::ColumnMajor, ElementAccumulator_, OperatorClass_, ArchTag_, ThreadblockShape_, WarpShape_, InstructionShape_, EpilogueOutputOp_, ThreadblockSwizzle_, Stages, TransformA, TransformB, SplitKSerial >](classcutlass_1_1gemm_1_1device_1_1GemmComplex_3_01ElementA 00_01LayoutA 00_01ElementB___00_07c56401b4df75709ae636675d9980a9a.html) | | | Partial specialization for column-major output exchanges problem size and operand. [More...](classcutlass_1_1gemm_1_1device_1_1GemmComplex_3_01ElementA 00_01LayoutA 00_01ElementB___00_07c56401b4df75709ae636675d9980a9a.html#details)
| | | | class | GemmSplitKParallel | | | | class | [GemmSplitKParallel< ElementA_, LayoutA_, ElementB_, LayoutB_, ElementC_, layout::ColumnMajor, ElementAccumulator_, OperatorClass_, ArchTag_, ThreadblockShape_, WarpShape_, InstructionShape_, EpilogueOutputOp_, ConvertScaledOp_, ReductionOp_, ThreadblockSwizzle_, Stages, kAlignmentA, kAlignmentB, Operator_ >](classcutlass_1_1gemm_1_1device_1_1GemmSplitKParallel_3_01ElementA 00_01LayoutA 00_01ElementBbe7c1f7154ad5b5bf9d4d28301e2b457.html) | | | Partial specialization for column-major output. [More...](classcutlass_1_1gemm_1_1device_1_1GemmSplitKParallel_3_01ElementA 00_01LayoutA 00_01ElementBbe7c1f7154ad5b5bf9d4d28301e2b457.html#details)
| | |


Generated by 1.8.11