media/docs/pythonDSL/cute_dsl_api/cute_arch.rst
.. _cute_arch:
The cute.arch module provides lightweight wrappers for NVVM Operation builders which implement CUDA built-in
device functions such as thread_idx. It integrates seamlessly with CuTe DSL types.
These wrappers enable source location tracking through the @dsl_user_op
decorator. The module includes the following functionality:
thread_idx, warp_idx, block_dim, grid_dim, cluster_dim, and related functionsmbarrier_init, mbarrier_arrive, mbarrier_wait, and associated operationsSmemAllocator as the recommended interfaceTmemAllocator as the recommended interface.. automodule:: cutlass.cute.arch :members: :undoc-members: :show-inheritance: :special-members: init :private-members: