Back to Cutlass

CUTLASS: regular_tile_iterator_pitch_linear.h File Reference

docs/regular__tile__iterator__pitch__linear_8h.html

4.4.23.3 KB
Original Source

| | CUTLASS

CUDA Templates for Linear Algebra Subroutines and Solvers |

Classes | Namespaces

regular_tile_iterator_pitch_linear.h File Reference

Templates implementing loading of tiles from pitch-linear rank=2 tensors. More...

#include "cutlass/cutlass.h"
#include "cutlass/tensor_ref.h"
#include "cutlass/layout/matrix.h"
#include "cutlass/layout/pitch_linear.h"
#include "regular_tile_iterator.h"

Include dependency graph for regular_tile_iterator_pitch_linear.h:

This graph shows which files directly or indirectly include this file:

[Go to the source code of this file.](regular tile iterator pitch linear_8h_source.html)

|

Classes

| | class | [cutlass::transform::threadblock::RegularTileIterator< Shape_, Element_, layout::PitchLinear, AdvanceRank, ThreadMap_, Alignment >](classcutlass_1_1transform_1_1threadblock_1_1RegularTileIterator_3_01Shape 00_01Element 00_0184a89653916f5d51ab59d1b386989a17.html) | | | Regular tile iterator specialized for pitch-linear. [More...](classcutlass_1_1transform_1_1threadblock_1_1RegularTileIterator_3_01Shape 00_01Element 00_0184a89653916f5d51ab59d1b386989a17.html#details)
| | | | class | [cutlass::transform::threadblock::RegularTileIterator< Shape_, Element_, layout::RowMajor, AdvanceRank, ThreadMap_, Alignment >](classcutlass_1_1transform_1_1threadblock_1_1RegularTileIterator_3_01Shape 00_01Element 00_0149454d361ea5885cf5166a920b5145df.html) | | | Regular tile iterator specialized for pitch-linear. [More...](classcutlass_1_1transform_1_1threadblock_1_1RegularTileIterator_3_01Shape 00_01Element 00_0149454d361ea5885cf5166a920b5145df.html#details)
| | | | class | [cutlass::transform::threadblock::RegularTileIterator< Shape_, Element_, layout::ColumnMajor, AdvanceRank, ThreadMap_, Alignment >](classcutlass_1_1transform_1_1threadblock_1_1RegularTileIterator_3_01Shape 00_01Element 00_011d3637dbd8bc58bcb020b51bf57fbfc0.html) | | | Regular tile iterator specialized for pitch-linear. [More...](classcutlass_1_1transform_1_1threadblock_1_1RegularTileIterator_3_01Shape 00_01Element 00_011d3637dbd8bc58bcb020b51bf57fbfc0.html#details)
| | |

|

Namespaces

| | | cutlass | | | | | cutlass::transform | | | | | cutlass::transform::threadblock | | |

Detailed Description

This iterator uses masks to guard out-of-bounds accesses and visits the last "residue" tile first, with the objective of minimizing predicate mask updates during steady-state operation.

A precomputed "Params" object minimizes the amount of state that must be stored in registers, and integer addition is used to advance the pointer through memory.


Generated by 1.8.11