cutlass/include/cutlass/epilogue/threadblock
Yujia Zhai cc3c29a81a
CUTLASS 3.6.0 (#1850)
* v3.6

* update changelog

* update readme

* fix typo

* fixing typos

* hopper gemm with weight prefetch

---------

Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2024-10-09 15:33:27 -04:00
..
fusion CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
default_epilogue_complex_tensor_op_blas3.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_epilogue_complex_tensor_op.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_epilogue_direct_store.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_epilogue_planar_complex.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_epilogue_simt.h Updates for CUTLASS 3.5.0 (#1468) 2024-04-11 21:33:40 -04:00
default_epilogue_tensor_op_blas3.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
default_epilogue_tensor_op.h CUTLASS 3.6.0 (#1850) 2024-10-09 15:33:27 -04:00
default_epilogue_volta_tensor_op.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_epilogue_with_absmax.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
default_epilogue_with_broadcast.h CUTLASS 3.5.1 (#1623) 2024-07-29 08:46:24 -04:00
default_epilogue_with_reduction.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_epilogue_wmma_tensor_op.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_thread_map_simt.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_thread_map_tensor_op.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_thread_map_volta_tensor_op.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
default_thread_map_wmma_tensor_op.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
direct_store_epilogue_iterator.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_base_streamk.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_base.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_depthwise.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_direct_store.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_gemm_k_reduction.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_planar_complex.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_smem_accumulator.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_streamk_with_broadcast.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_visitor_with_softmax.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_with_absmax.h CUTLASS 3.5.0 (#1411) 2024-03-19 17:51:04 -04:00
epilogue_with_broadcast.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_with_reduction.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_with_visitor_callbacks.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_with_visitor.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue_workspace.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
epilogue.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
interleaved_epilogue.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
output_iterator_parameter.h CUTLASS 3.5.1 (#1623) 2024-07-29 08:46:24 -04:00
output_tile_thread_map.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator_affine_layout_params.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator_affine.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator_blas3.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator_conv.h CUTLASS 3.5.1 (#1623) 2024-07-29 08:46:24 -04:00
predicated_tile_iterator_direct_conv.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator_params.h Updates for CUTLASS 3.5.0 (#1468) 2024-04-11 21:33:40 -04:00
predicated_tile_iterator_predicates.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator_strided_dgrad.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
predicated_tile_iterator.h CUTLASS 3.5.1 (#1623) 2024-07-29 08:46:24 -04:00
shared_load_iterator_mixed.h Update license year (#1306) 2024-01-16 14:37:22 -05:00
shared_load_iterator_pitch_linear.h Updates for CUTLASS 3.5.0 (#1468) 2024-04-11 21:33:40 -04:00
shared_load_iterator.h Update license year (#1306) 2024-01-16 14:37:22 -05:00