CUTLASS: linear_combination_clamp.h Source File - Cutlass

CUTLASS_HOST_DEVICE FragmentOutput operator()(FragmentAccumulator const &accumulator, FragmentOutput const &source, ElementCompute uniform=ElementCompute(0)) const

Computes linear scaling: D = alpha * accumulator + beta * source.

Definition: linear_combination_clamp.h:144

cutlass::epilogue::thread::LinearCombinationClamp::Params::Params

CUTLASS_HOST_DEVICE Params()

Definition: linear_combination_clamp.h:86

array.h

Statically sized array of elements that accommodates all CUTLASS-supported numeric types and is safe ...

CUTLASS_PRAGMA_UNROLL

#define CUTLASS_PRAGMA_UNROLL

Definition: cutlass.h:110

numeric_conversion.h

Boost-like numeric conversion operator for CUTLASS numeric types.

cutlass::sizeof_bits

Defines the size of an element in bits.

Definition: numeric_types.h:42

nullptr

#define nullptr

nullptr

Definition: platform.h:144

cutlass::epilogue::thread::LinearCombinationClamp::LinearCombinationClamp

CUTLASS_HOST_DEVICE LinearCombinationClamp(Params const &params)

Constructs the function object, possibly loading from pointers in host memory.

Definition: linear_combination_clamp.h:122

cutlass::multiplies

Definition: functional.h:64

CUTLASS_HOST_DEVICE

#define CUTLASS_HOST_DEVICE

Definition: cutlass.h:89

numeric_types.h

Top-level include for all CUTLASS numeric types.

cutlass::epilogue::thread::LinearCombinationClamp::ComputeFragment

Array< ElementCompute, kCount > ComputeFragment

Definition: linear_combination_clamp.h:69

cutlass::epilogue::thread::LinearCombinationClamp::ElementOutput

ElementOutput_ ElementOutput

Definition: linear_combination_clamp.h:61

cutlass::epilogue::thread::LinearCombinationClamp::FragmentOutput

Array< ElementOutput, kCount > FragmentOutput

Definition: linear_combination_clamp.h:67

cutlass::epilogue::thread::LinearCombinationClamp::ElementAccumulator

ElementAccumulator_ ElementAccumulator

Definition: linear_combination_clamp.h:62

cutlass::FloatRoundStyle::round_to_nearest

round to nearest even

cutlass::epilogue::thread::LinearCombinationClamp::Params::beta_ptr

ElementCompute const * beta_ptr

pointer to source scalar - if not null, loads it from memory

Definition: linear_combination_clamp.h:79

cutlass::epilogue::thread::LinearCombinationClamp::set_k_partition

CUTLASS_HOST_DEVICE void set_k_partition(int k_partition)

Functionally required for serial reduction in the epilogue.

Definition: linear_combination_clamp.h:136

cutlass::FloatRoundStyle

FloatRoundStyle

Definition: numeric_conversion.h:43

cutlass::NumericArrayConverter

Conversion operator for Array.

Definition: numeric_conversion.h:294

cutlass::epilogue::thread::LinearCombinationClamp::Params

Host-constructable parameters structure.

Definition: linear_combination_clamp.h:74

cutlass::epilogue::thread::LinearCombinationClamp::kRound

static FloatRoundStyle const kRound

Definition: linear_combination_clamp.h:71

cutlass::epilogue::thread::LinearCombinationClamp::is_source_needed

CUTLASS_HOST_DEVICE bool is_source_needed() const

Returns true if source is needed.

Definition: linear_combination_clamp.h:130

cutlass.h

Basic include for CUTLASS.

cutlass::epilogue::thread::LinearCombinationClamp::Params::alpha_ptr

ElementCompute const * alpha_ptr

pointer to accumulator scalar - if not null, loads it from memory

Definition: linear_combination_clamp.h:78

functional.h

Define basic numeric operators with specializations for Array<T, N>. SIMD-ize where possible...

cutlass::epilogue::thread::LinearCombinationClamp::Params::alpha

ElementCompute alpha

scales accumulators

Definition: linear_combination_clamp.h:76

cutlass::epilogue::thread::LinearCombinationClamp::FragmentAccumulator

Array< ElementAccumulator, kCount > FragmentAccumulator

Definition: linear_combination_clamp.h:68

Generated by 1.8.11