Back to Cutlass

CUTLASS: predicated_tile_iterator_2dthreadtile.h File Reference

docs/predicated__tile__iterator__2dthreadtile_8h.html

4.4.24.7 KB
Original Source

| | CUTLASS

CUDA Templates for Linear Algebra Subroutines and Solvers |

Classes | Namespaces

predicated_tile_iterator_2dthreadtile.h File Reference

Templates implementing loading of tiles from pitch-linear rank=2 tensors. More...

#include "cutlass/transform/threadblock/predicated_tile_access_iterator_2dthreadtile.h"
#include "cutlass/transform/thread/transpose.h"

Include dependency graph for predicated_tile_iterator_2dthreadtile.h:

This graph shows which files directly or indirectly include this file:

[Go to the source code of this file.](predicated tile iterator__2dthreadtile_8h_source.html)

|

Classes

| | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape, Element, Layout, AdvanceRank, ThreadMap, Transpose > | | | | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::PitchLinear, AdvanceRank, ThreadMap_, Transpose_ > | | | | struct | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::PitchLinear, AdvanceRank, ThreadMap_, Transpose_ >::AccessType | | | | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::PitchLinear, AdvanceRank, ThreadMap_, Transpose_ >::Params | | | Parameters object is precomputed state and is host-constructible. More...
| | | | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::ColumnMajor, AdvanceRank, ThreadMap_, Transpose_ > | | | | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::ColumnMajor, AdvanceRank, ThreadMap_, Transpose_ >::Params | | | Parameters object is precomputed state and is host-constructible. More...
| | | | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::RowMajor, AdvanceRank, ThreadMap_, Transpose_ > | | | | class | cutlass::transform::threadblock::PredicatedTileIterator2dThreadTile< Shape_, Element_, layout::RowMajor, AdvanceRank, ThreadMap_, Transpose_ >::Params | | | Parameters object is precomputed state and is host-constructible. More...
| | |

|

Namespaces

| | | cutlass | | | | | cutlass::transform | | | | | cutlass::transform::threadblock | | |

Detailed Description

This iterator uses masks to guard out-of-bounds accesses and visits the last "residue" tile first, with the objective of minimizing predicate mask updates during steady-state operation.

A precomputed "Params" object minimizes the amount of state that must be stored in registers, and integer addition is used to advance the pointer through memory.


Generated by 1.8.11