docs/batched__reduction_8h.html
| | CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers |
Classes | Namespaces | Functions
batched_reduction.h File Reference
Implements a software-pipelined efficient batched reduction. D = alpha * Reduction(A) + beta * C. More...
#include <cuda.h>
#include "cutlass/coord.h"
#include "cutlass/util/platform.h"
#include "cutlass/fragment.h"
Include dependency graph for batched_reduction.h:
This graph shows which files directly or indirectly include this file:
Go to the source code of this file.
|
| | struct | cutlass::reduction::BatchedReduction< BatchedReductionTraits_ > | | |
|
| | | cutlass | | | | | cutlass::reduction | | |
|
| | template<typename batched_reduction_ > | | __global__ | cutlass::reduction::__launch_bounds__ (batched_reduction_::Traits::kThreads, 1) void batched_reduction_kernel(typename batched_reduction_ | | |
Generated by 1.8.11