cutlass/docs/gemm__shared__tile_8h_source.html
2018-10-26 14:54:58 -07:00

216 lines
146 KiB
HTML

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/xhtml;charset=UTF-8"/>
<meta http-equiv="X-UA-Compatible" content="IE=9"/>
<meta name="generator" content="Doxygen 1.8.14"/>
<meta name="viewport" content="width=device-width, initial-scale=1"/>
<title>Cutlass: gemm_shared_tile.h Source File</title>
<link href="tabs.css" rel="stylesheet" type="text/css"/>
<script type="text/javascript" src="jquery.js"></script>
<script type="text/javascript" src="dynsections.js"></script>
<link href="search/search.css" rel="stylesheet" type="text/css"/>
<script type="text/javascript" src="search/searchdata.js"></script>
<script type="text/javascript" src="search/search.js"></script>
<script type="text/x-mathjax-config">
MathJax.Hub.Config({
extensions: ["tex2jax.js"],
jax: ["input/TeX","output/HTML-CSS"],
});
</script><script type="text/javascript" async src="http://cdn.mathjax.org/mathjax/latest/MathJax.js"></script>
<link href="doxygen.css" rel="stylesheet" type="text/css" />
</head>
<body>
<div id="top"><!-- do not remove this div, it is closed by doxygen! -->
<div id="titlearea">
<table cellspacing="0" cellpadding="0">
<tbody>
<tr style="height: 56px;">
<td id="projectalign" style="padding-left: 0.5em;">
<div id="projectname">Cutlass
</div>
<div id="projectbrief">CUDA Templates for Linear Algebra Subroutines and Solvers</div>
</td>
</tr>
</tbody>
</table>
</div>
<!-- end header part -->
<!-- Generated by Doxygen 1.8.14 -->
<script type="text/javascript">
/* @license magnet:?xt=urn:btih:cf05388f2679ee054f2beb29a391d25f4e673ac3&amp;dn=gpl-2.0.txt GPL-v2 */
var searchBox = new SearchBox("searchBox", "search",false,'Search');
/* @license-end */
</script>
<script type="text/javascript" src="menudata.js"></script>
<script type="text/javascript" src="menu.js"></script>
<script type="text/javascript">
/* @license magnet:?xt=urn:btih:cf05388f2679ee054f2beb29a391d25f4e673ac3&amp;dn=gpl-2.0.txt GPL-v2 */
$(function() {
initMenu('',true,false,'search.php','Search');
$(document).ready(function() { init_search(); });
});
/* @license-end */</script>
<div id="main-nav"></div>
<!-- window showing the filter options -->
<div id="MSearchSelectWindow"
onmouseover="return searchBox.OnSearchSelectShow()"
onmouseout="return searchBox.OnSearchSelectHide()"
onkeydown="return searchBox.OnSearchSelectKey(event)">
</div>
<!-- iframe showing the search results (closed by default) -->
<div id="MSearchResultsWindow">
<iframe src="javascript:void(0)" frameborder="0"
name="MSearchResults" id="MSearchResults">
</iframe>
</div>
<div id="nav-path" class="navpath">
<ul>
<li class="navelem"><a class="el" href="dir_1417ee5ebebc309c36b7962f26a92c39.html">cutlass</a></li><li class="navelem"><a class="el" href="dir_18d6a367a3982a494d65599933fc67a3.html">gemm</a></li> </ul>
</div>
</div><!-- top -->
<div class="header">
<div class="headertitle">
<div class="title">gemm_shared_tile.h</div> </div>
</div><!--header-->
<div class="contents">
<a href="gemm__shared__tile_8h.html">Go to the documentation of this file.</a><div class="fragment"><div class="line"><a name="l00001"></a><span class="lineno"> 1</span>&#160;<span class="comment">/***************************************************************************************************</span></div><div class="line"><a name="l00002"></a><span class="lineno"> 2</span>&#160;<span class="comment"> * Copyright (c) 2017-2018, NVIDIA CORPORATION. All rights reserved.</span></div><div class="line"><a name="l00003"></a><span class="lineno"> 3</span>&#160;<span class="comment"> *</span></div><div class="line"><a name="l00004"></a><span class="lineno"> 4</span>&#160;<span class="comment"> * Redistribution and use in source and binary forms, with or without modification, are permitted</span></div><div class="line"><a name="l00005"></a><span class="lineno"> 5</span>&#160;<span class="comment"> * provided that the following conditions are met:</span></div><div class="line"><a name="l00006"></a><span class="lineno"> 6</span>&#160;<span class="comment"> * * Redistributions of source code must retain the above copyright notice, this list of</span></div><div class="line"><a name="l00007"></a><span class="lineno"> 7</span>&#160;<span class="comment"> * conditions and the following disclaimer.</span></div><div class="line"><a name="l00008"></a><span class="lineno"> 8</span>&#160;<span class="comment"> * * Redistributions in binary form must reproduce the above copyright notice, this list of</span></div><div class="line"><a name="l00009"></a><span class="lineno"> 9</span>&#160;<span class="comment"> * conditions and the following disclaimer in the documentation and/or other materials</span></div><div class="line"><a name="l00010"></a><span class="lineno"> 10</span>&#160;<span class="comment"> * provided with the distribution.</span></div><div class="line"><a name="l00011"></a><span class="lineno"> 11</span>&#160;<span class="comment"> * * Neither the name of the NVIDIA CORPORATION nor the names of its contributors may be used</span></div><div class="line"><a name="l00012"></a><span class="lineno"> 12</span>&#160;<span class="comment"> * to endorse or promote products derived from this software without specific prior written</span></div><div class="line"><a name="l00013"></a><span class="lineno"> 13</span>&#160;<span class="comment"> * permission.</span></div><div class="line"><a name="l00014"></a><span class="lineno"> 14</span>&#160;<span class="comment"> *</span></div><div class="line"><a name="l00015"></a><span class="lineno"> 15</span>&#160;<span class="comment"> * THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS &quot;AS IS&quot; AND ANY EXPRESS OR</span></div><div class="line"><a name="l00016"></a><span class="lineno"> 16</span>&#160;<span class="comment"> * IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND</span></div><div class="line"><a name="l00017"></a><span class="lineno"> 17</span>&#160;<span class="comment"> * FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL NVIDIA CORPORATION BE LIABLE</span></div><div class="line"><a name="l00018"></a><span class="lineno"> 18</span>&#160;<span class="comment"> * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING,</span></div><div class="line"><a name="l00019"></a><span class="lineno"> 19</span>&#160;<span class="comment"> * BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS;</span></div><div class="line"><a name="l00020"></a><span class="lineno"> 20</span>&#160;<span class="comment"> * OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT,</span></div><div class="line"><a name="l00021"></a><span class="lineno"> 21</span>&#160;<span class="comment"> * STRICT LIABILITY, OR TOR (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE</span></div><div class="line"><a name="l00022"></a><span class="lineno"> 22</span>&#160;<span class="comment"> * OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.</span></div><div class="line"><a name="l00023"></a><span class="lineno"> 23</span>&#160;<span class="comment"> *</span></div><div class="line"><a name="l00024"></a><span class="lineno"> 24</span>&#160;<span class="comment"> **************************************************************************************************/</span></div><div class="line"><a name="l00028"></a><span class="lineno"> 28</span>&#160;<span class="preprocessor">#pragma once</span></div><div class="line"><a name="l00029"></a><span class="lineno"> 29</span>&#160;</div><div class="line"><a name="l00030"></a><span class="lineno"> 30</span>&#160;<span class="preprocessor">#include &quot;<a class="code" href="gemm__operand_8h.html">cutlass/gemm/gemm_operand.h</a>&quot;</span></div><div class="line"><a name="l00031"></a><span class="lineno"> 31</span>&#160;</div><div class="line"><a name="l00032"></a><span class="lineno"> 32</span>&#160;<span class="keyword">namespace </span><a class="code" href="namespacecutlass.html">cutlass</a> {</div><div class="line"><a name="l00033"></a><span class="lineno"> 33</span>&#160;<span class="keyword">namespace </span>gemm {</div><div class="line"><a name="l00034"></a><span class="lineno"> 34</span>&#160;</div><div class="line"><a name="l00036"></a><span class="lineno"> 36</span>&#160;</div><div class="line"><a name="l00037"></a><span class="lineno"> 37</span>&#160;<span class="keyword">template</span> &lt;<span class="keyword">typename</span> Scalar_, <span class="keyword">typename</span> Tile_, <span class="keyword">typename</span> Threads_, <span class="keywordtype">int</span> kScalarsPerSts_&gt;</div><div class="line"><a name="l00038"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html"> 38</a></span>&#160;<span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html">GemmSharedStoreTileAbTraits</a> {</div><div class="line"><a name="l00040"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a8b04fd003fc2db46d749360e8838438b"> 40</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">platform::remove_const&lt;Scalar_&gt;::type</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a8b04fd003fc2db46d749360e8838438b">Scalar</a>;</div><div class="line"><a name="l00042"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a5be0c995c57faafaad7ae55ae015fc00"> 42</a></span>&#160; <span class="keyword">typedef</span> Scalar_* <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a5be0c995c57faafaad7ae55ae015fc00">Pointer</a>;</div><div class="line"><a name="l00044"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ab96f324083e51ce4c2b73c18803c69a7"> 44</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">ReshapeTile&lt;Tile_, kScalarsPerSts_&gt;::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ab96f324083e51ce4c2b73c18803c69a7">Tile</a>;</div><div class="line"><a name="l00046"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a1acf2a1d8bf73fda142e7d82e05f00a2"> 46</a></span>&#160; <span class="keyword">typedef</span> Threads_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a1acf2a1d8bf73fda142e7d82e05f00a2">Threads</a>;</div><div class="line"><a name="l00048"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae540e7ea7106552682aa4c97b833b3b1"> 48</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, ShapeCount&lt;Tile&gt;::kWc</a>, Tile::kC, kScalarsPerSts_&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae540e7ea7106552682aa4c97b833b3b1">ThreadsStrides</a>;</div><div class="line"><a name="l00050"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ace14ca9ad11e2cdafcd4a4b63c0df591"> 50</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ace14ca9ad11e2cdafcd4a4b63c0df591">kSkew</a> = 0;</div><div class="line"><a name="l00052"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae852c89da0455025c0c41af258e47047"> 52</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae852c89da0455025c0c41af258e47047">kAccessSize</a> = kScalarsPerSts_;</div><div class="line"><a name="l00054"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a59c981aa720f983b846bed7c3e4a7cab"> 54</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">MemorySpace::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a59c981aa720f983b846bed7c3e4a7cab">kMemorySpace</a> = <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">MemorySpace::kShared</a>;</div><div class="line"><a name="l00055"></a><span class="lineno"> 55</span>&#160;</div><div class="line"><a name="l00057"></a><span class="lineno"> 57</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1,</div><div class="line"><a name="l00058"></a><span class="lineno"> 58</span>&#160; Tile::kH / Threads::kH,</div><div class="line"><a name="l00059"></a><span class="lineno"> 59</span>&#160; Tile::kW / Threads::kW,</div><div class="line"><a name="l00060"></a><span class="lineno"> 60</span>&#160; Tile::kC / Threads::kC / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae852c89da0455025c0c41af258e47047">kAccessSize</a>&gt;</div><div class="line"><a name="l00061"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a6125e052e47296c3ef53c8a149ffd31b"> 61</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a6125e052e47296c3ef53c8a149ffd31b">Iterations</a>;</div><div class="line"><a name="l00063"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a645f65f7d8f123936b286521df470224"> 63</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, Threads::kH * ShapeCount&lt;Tile&gt;::kWc</a>, Threads::kW * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae852c89da0455025c0c41af258e47047">kAccessSize</a>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a645f65f7d8f123936b286521df470224">Delta</a>;</div><div class="line"><a name="l00065"></a><span class="lineno"> 65</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, Threads::kH * ShapeCount&lt;Tile&gt;::kWc</a>, Threads::kW * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae852c89da0455025c0c41af258e47047">kAccessSize</a>&gt;</div><div class="line"><a name="l00066"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a027bebceeda2287b40915ffd95d494a7"> 66</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a027bebceeda2287b40915ffd95d494a7">ImmediateOffsetStrides</a>;</div><div class="line"><a name="l00067"></a><span class="lineno"> 67</span>&#160;</div><div class="line"><a name="l00068"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset.html"> 68</a></span>&#160; <span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset.html">ThreadOffset</a> {</div><div class="line"><a name="l00069"></a><span class="lineno"> 69</span>&#160; <a class="code" href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a></div><div class="line"><a name="l00070"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset.html#a1e357fe5bc1daef333e6be776a21a2ca"> 70</a></span>&#160; <a class="code" href="structcutlass_1_1Coord.html">Coord&lt;4&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset.html#a1e357fe5bc1daef333e6be776a21a2ca">operator()</a>()<span class="keyword"> const </span>{</div><div class="line"><a name="l00071"></a><span class="lineno"> 71</span>&#160; <span class="keywordtype">int</span> offset = <a class="code" href="structcutlass_1_1ComputeThreadOffsetFromStrides.html#a1744bfe277cbe0c642cce4a48c1dd9ad">ComputeThreadOffsetFromStrides&lt;Threads, ThreadsStrides&gt;::get</a>();</div><div class="line"><a name="l00072"></a><span class="lineno"> 72</span>&#160; <span class="keywordflow">return</span> <a class="code" href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">make_Coord</a>(0, 0, offset, 0);</div><div class="line"><a name="l00073"></a><span class="lineno"> 73</span>&#160; }</div><div class="line"><a name="l00074"></a><span class="lineno"> 74</span>&#160; };</div><div class="line"><a name="l00075"></a><span class="lineno"> 75</span>&#160;};</div><div class="line"><a name="l00076"></a><span class="lineno"> 76</span>&#160;</div><div class="line"><a name="l00078"></a><span class="lineno"> 78</span>&#160;</div><div class="line"><a name="l00079"></a><span class="lineno"> 79</span>&#160;<span class="keyword">template</span> &lt;<span class="keyword">typename</span> Scalar_, <span class="keyword">typename</span> Tile_, <span class="keyword">typename</span> Threads_, <span class="keywordtype">int</span> kScalarsPerSts_, <span class="keywordtype">int</span> kSkew_&gt;</div><div class="line"><a name="l00080"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html"> 80</a></span>&#160;<span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html">GemmSharedStoreWithSkewTileAbTraits</a> {</div><div class="line"><a name="l00082"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#aaa439a0bb6b9de5e2722ea7b011effea"> 82</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">platform::remove_const&lt;Scalar_&gt;::type</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#aaa439a0bb6b9de5e2722ea7b011effea">Scalar</a>;</div><div class="line"><a name="l00084"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#ab883c2a8b90262152faca9cabe515dc4"> 84</a></span>&#160; <span class="keyword">typedef</span> Scalar_* <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#ab883c2a8b90262152faca9cabe515dc4">Pointer</a>;</div><div class="line"><a name="l00086"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a050cf5964a2d3683491bc4313ead5450"> 86</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">ReshapeTile&lt;Tile_, kScalarsPerSts_&gt;::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a050cf5964a2d3683491bc4313ead5450">TileWithoutSkew</a>;</div><div class="line"><a name="l00088"></a><span class="lineno"> 88</span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html">ReshapeTile&lt;Shape&lt;Tile_::kD, Tile_::kH, Tile_::kW + kSkew_&gt;</a>,</div><div class="line"><a name="l00089"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a74196946c28e98ee60346b0eeede1471"> 89</a></span>&#160; kScalarsPerSts_&gt;<a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a74196946c28e98ee60346b0eeede1471">::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a74196946c28e98ee60346b0eeede1471">Tile</a>;</div><div class="line"><a name="l00091"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a9bef06b59f27c6e673066a7f0280aa06"> 91</a></span>&#160; <span class="keyword">typedef</span> Threads_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a9bef06b59f27c6e673066a7f0280aa06">Threads</a>;</div><div class="line"><a name="l00093"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#aba6decf87d770becaadd610d9fc27491"> 93</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#aba6decf87d770becaadd610d9fc27491">kSkew</a> = kSkew_;</div><div class="line"><a name="l00095"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a846e6d8d06be0ba6fa41b1431c8ec061"> 95</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a846e6d8d06be0ba6fa41b1431c8ec061">kAccessSize</a> = kScalarsPerSts_;</div><div class="line"><a name="l00097"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#ae5a07814b9cfe9a64f69bac0f0772f20"> 97</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">MemorySpace::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#ae5a07814b9cfe9a64f69bac0f0772f20">kMemorySpace</a> = <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">MemorySpace::kShared</a>;</div><div class="line"><a name="l00098"></a><span class="lineno"> 98</span>&#160;</div><div class="line"><a name="l00100"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a025445699c5c86237d8c3e48f01081ea"> 100</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1, TileWithoutSkew::kH / Threads::kW, TileWithoutSkew::kW / Threads::kH&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a025445699c5c86237d8c3e48f01081ea">Iterations</a>;</div><div class="line"><a name="l00102"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#afd691b764b7d105a1ed41dada6049e71"> 102</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, ShapeCount&lt;Tile&gt;::kWc</a>, Threads::kH * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a846e6d8d06be0ba6fa41b1431c8ec061">kAccessSize</a>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#afd691b764b7d105a1ed41dada6049e71">Delta</a>;</div><div class="line"><a name="l00104"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a39414f484da7f993bc96d61c97273614"> 104</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, ShapeCount&lt;Tile&gt;::kWc</a>, Threads::kH * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a846e6d8d06be0ba6fa41b1431c8ec061">kAccessSize</a>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a39414f484da7f993bc96d61c97273614">ImmediateOffsetStrides</a>;</div><div class="line"><a name="l00105"></a><span class="lineno"> 105</span>&#160;</div><div class="line"><a name="l00106"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset.html"> 106</a></span>&#160; <span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset.html">ThreadOffset</a> {</div><div class="line"><a name="l00107"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset.html#a4e35f0b2ca63a6b981230b73f843f726"> 107</a></span>&#160; <a class="code" href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a> <a class="code" href="structcutlass_1_1Coord.html">Coord&lt;4&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset.html#a4e35f0b2ca63a6b981230b73f843f726">operator()</a>()<span class="keyword"> const </span>{</div><div class="line"><a name="l00108"></a><span class="lineno"> 108</span>&#160; <span class="keywordtype">int</span> offset = <a class="code" href="structcutlass_1_1ComputeThreadOffsetFromStrides.html#a1744bfe277cbe0c642cce4a48c1dd9ad">ComputeThreadOffsetFromStrides&lt;Threads, ThreadsStrides&gt;::get</a>();</div><div class="line"><a name="l00109"></a><span class="lineno"> 109</span>&#160; <span class="keywordflow">return</span> <a class="code" href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">make_Coord</a>(0, 0, offset, 0);</div><div class="line"><a name="l00110"></a><span class="lineno"> 110</span>&#160; }</div><div class="line"><a name="l00111"></a><span class="lineno"> 111</span>&#160; };</div><div class="line"><a name="l00112"></a><span class="lineno"> 112</span>&#160;</div><div class="line"><a name="l00113"></a><span class="lineno"> 113</span>&#160; <span class="keyword">protected</span>:</div><div class="line"><a name="l00115"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a2053e4b9cb3ed2727c89960354ea0b29"> 115</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, kScalarsPerSts_, ShapeCount&lt;Tile&gt;::kHwc</a> / Threads::kW&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a2053e4b9cb3ed2727c89960354ea0b29">ThreadsStrides</a>;</div><div class="line"><a name="l00116"></a><span class="lineno"> 116</span>&#160;};</div><div class="line"><a name="l00117"></a><span class="lineno"> 117</span>&#160;</div><div class="line"><a name="l00119"></a><span class="lineno"> 119</span>&#160;</div><div class="line"><a name="l00120"></a><span class="lineno"> 120</span>&#160;<span class="keyword">template</span> &lt;<span class="keyword">typename</span> Scalar_,</div><div class="line"><a name="l00121"></a><span class="lineno"> 121</span>&#160; <span class="keyword">typename</span> OutputTile_,</div><div class="line"><a name="l00122"></a><span class="lineno"> 122</span>&#160; <span class="keyword">typename</span> Warps_,</div><div class="line"><a name="l00123"></a><span class="lineno"> 123</span>&#160; <span class="keyword">typename</span> ThreadsPerWarp_,</div><div class="line"><a name="l00124"></a><span class="lineno"> 124</span>&#160; <span class="keyword">typename</span> InstructionShape_,</div><div class="line"><a name="l00125"></a><span class="lineno"> 125</span>&#160; <span class="keywordtype">int</span> kStages_,</div><div class="line"><a name="l00126"></a><span class="lineno"> 126</span>&#160; <span class="keywordtype">int</span> kScalarsPerLds_,</div><div class="line"><a name="l00127"></a><span class="lineno"> 127</span>&#160; <span class="keywordtype">int</span> kSkew_ = 0&gt;</div><div class="line"><a name="l00128"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html"> 128</a></span>&#160;<span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html">GemmSharedLoadTileATraits</a> {</div><div class="line"><a name="l00129"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af511bba9fc2125516eb1442b1c88d851"> 129</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0c">GemmOperand::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af511bba9fc2125516eb1442b1c88d851">kOperand</a> = <a class="code" href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0cac2b9fe9e3679a059d1a6c946b2a2c31a">GemmOperand::kA</a>;</div><div class="line"><a name="l00131"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a1b6956adc65254202864520b668edd14"> 131</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">platform::remove_const&lt;Scalar_&gt;::type</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a1b6956adc65254202864520b668edd14">Scalar</a>;</div><div class="line"><a name="l00133"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#adc4946dfbe914140c6852d0c05b30864"> 133</a></span>&#160; <span class="keyword">typedef</span> Scalar_* <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#adc4946dfbe914140c6852d0c05b30864">Pointer</a>;</div><div class="line"><a name="l00135"></a><span class="lineno"> 135</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;kStages_,</div><div class="line"><a name="l00136"></a><span class="lineno"> 136</span>&#160; OutputTile_::kD / InstructionShape_::kD,</div><div class="line"><a name="l00137"></a><span class="lineno"> 137</span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GetExtent.html">GetExtent&lt;kOperand, OutputTile_&gt;::kExtent</a> * InstructionShape_::kD&gt;</div><div class="line"><a name="l00138"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a93ae99460695718babaef6d1ef597e38"> 138</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a93ae99460695718babaef6d1ef597e38">TileWithoutSkew_</a>;</div><div class="line"><a name="l00140"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a72e0214f86cf8b3711d006dcd69d7a17"> 140</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;kStages_, TileWithoutSkew_::kH, TileWithoutSkew_::kW + kSkew_&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a72e0214f86cf8b3711d006dcd69d7a17">TileWithSkew</a>;</div><div class="line"><a name="l00142"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a5a5a36fc570e1225b20ce0a48c89d213"> 142</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">ReshapeTile&lt;TileWithoutSkew_, kScalarsPerLds_&gt;::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a5a5a36fc570e1225b20ce0a48c89d213">TileWithoutSkew</a>;</div><div class="line"><a name="l00144"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a9a00be672617162c4c7ac94c7d8980cc"> 144</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">ReshapeTile&lt;TileWithSkew, kScalarsPerLds_&gt;::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a9a00be672617162c4c7ac94c7d8980cc">Tile</a>;</div><div class="line"><a name="l00146"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#aaff4a5e0f9e4256f184a22cad0ce8cf4"> 146</a></span>&#160; <span class="keyword">typedef</span> Warps_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#aaff4a5e0f9e4256f184a22cad0ce8cf4">Warps</a>;</div><div class="line"><a name="l00148"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0761c497c41a45652368fc0d54def98f"> 148</a></span>&#160; <span class="keyword">typedef</span> ThreadsPerWarp_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0761c497c41a45652368fc0d54def98f">ThreadsPerWarp</a>;</div><div class="line"><a name="l00150"></a><span class="lineno"> 150</span>&#160; <span class="comment">// static int const kScalarsPerLds = kScalarsPerLds_;</span></div><div class="line"><a name="l00151"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0a33d4289ed45e988d560b5f73ac997e"> 151</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0a33d4289ed45e988d560b5f73ac997e">kAccessSize</a> = kScalarsPerLds_;</div><div class="line"><a name="l00153"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#aaffe67e519e919bf561142e05da6e6c8"> 153</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#aaffe67e519e919bf561142e05da6e6c8">kSkew</a> = kSkew_;</div><div class="line"><a name="l00155"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4456e4c8048bfb378e5b80833a0d19e5"> 155</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">MemorySpace::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4456e4c8048bfb378e5b80833a0d19e5">kMemorySpace</a> = <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">MemorySpace::kShared</a>;</div><div class="line"><a name="l00156"></a><span class="lineno"> 156</span>&#160;</div><div class="line"><a name="l00158"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af78a275086a297bd93aed920f57a17be"> 158</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af78a275086a297bd93aed920f57a17be">kWarps</a> = <a class="code" href="structcutlass_1_1gemm_1_1GetExtent.html">GetExtent&lt;kOperand, Warps&gt;::kExtent</a>;</div><div class="line"><a name="l00160"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4246185b8279f245ef5d0650c1eec14f"> 160</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4246185b8279f245ef5d0650c1eec14f">kThreadsPerWarp</a> = <a class="code" href="structcutlass_1_1gemm_1_1GetExtent.html">GetExtent&lt;kOperand, ThreadsPerWarp&gt;::kExtent</a>;</div><div class="line"><a name="l00161"></a><span class="lineno"> 161</span>&#160;</div><div class="line"><a name="l00163"></a><span class="lineno"> 163</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1, 1, TileWithoutSkew::kW / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af78a275086a297bd93aed920f57a17be">kWarps</a> / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4246185b8279f245ef5d0650c1eec14f">kThreadsPerWarp</a> <span class="comment">/* / kScalarsPerLds*/</span>&gt;</div><div class="line"><a name="l00164"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#ae96e490d38ade6db4d853fb6c8f3378b"> 164</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#ae96e490d38ade6db4d853fb6c8f3378b">Iterations</a>;</div><div class="line"><a name="l00166"></a><span class="lineno"> 166</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;TileWithSkew::kW * Warps::kD, 0, kWarps * kThreadsPerWarp * kAccessSize, 0&gt;</a></div><div class="line"><a name="l00167"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#ad012add21d9393d136720f609467e121"> 167</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#ad012add21d9393d136720f609467e121">ImmediateOffsetStrides</a>;</div><div class="line"><a name="l00168"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a134a02091bf4360d2cbca56624e52024"> 168</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;TileWithSkew::kW * Warps::kD, 0, kWarps * kThreadsPerWarp * kAccessSize, 0&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a134a02091bf4360d2cbca56624e52024">Delta</a>;</div><div class="line"><a name="l00169"></a><span class="lineno"> 169</span>&#160;</div><div class="line"><a name="l00171"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset.html"> 171</a></span>&#160; <span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset.html">ThreadOffset</a> {</div><div class="line"><a name="l00172"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset.html#a51a325b435b9a53effaa003b3670e410"> 172</a></span>&#160; <a class="code" href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a> <a class="code" href="structcutlass_1_1Coord.html">Coord&lt;4&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset.html#a51a325b435b9a53effaa003b3670e410">operator()</a>()<span class="keyword"> const </span>{</div><div class="line"><a name="l00173"></a><span class="lineno"> 173</span>&#160; <span class="comment">// Extract the warp.</span></div><div class="line"><a name="l00174"></a><span class="lineno"> 174</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp = threadIdx.x / kWarpSize;</div><div class="line"><a name="l00175"></a><span class="lineno"> 175</span>&#160; <span class="comment">// Extract the slice.</span></div><div class="line"><a name="l00176"></a><span class="lineno"> 176</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> slice = warp / (Warps::kH * Warps::kW);</div><div class="line"><a name="l00177"></a><span class="lineno"> 177</span>&#160; <span class="comment">// Compute the row offset for each warp.</span></div><div class="line"><a name="l00178"></a><span class="lineno"> 178</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp_row = warp % Warps::kW;</div><div class="line"><a name="l00179"></a><span class="lineno"> 179</span>&#160; <span class="comment">// Compute the row offset for each thread.</span></div><div class="line"><a name="l00180"></a><span class="lineno"> 180</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> lane_row = (threadIdx.x &amp; 0x0e) / 2;</div><div class="line"><a name="l00181"></a><span class="lineno"> 181</span>&#160; <span class="comment">// The offset.</span></div><div class="line"><a name="l00182"></a><span class="lineno"> 182</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> offset =</div><div class="line"><a name="l00183"></a><span class="lineno"> 183</span>&#160; slice * Tile::kW * Tile::kC + (warp_row * ThreadsPerWarp::kW + lane_row) * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0a33d4289ed45e988d560b5f73ac997e">kAccessSize</a>;</div><div class="line"><a name="l00184"></a><span class="lineno"> 184</span>&#160; <span class="comment">// Embed the offset in a 4D coordinate vector.</span></div><div class="line"><a name="l00185"></a><span class="lineno"> 185</span>&#160; <span class="keywordflow">return</span> <a class="code" href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">make_Coord</a>(0, 0, offset, 0);</div><div class="line"><a name="l00186"></a><span class="lineno"> 186</span>&#160; }</div><div class="line"><a name="l00187"></a><span class="lineno"> 187</span>&#160; };</div><div class="line"><a name="l00188"></a><span class="lineno"> 188</span>&#160;};</div><div class="line"><a name="l00189"></a><span class="lineno"> 189</span>&#160;</div><div class="line"><a name="l00191"></a><span class="lineno"> 191</span>&#160;</div><div class="line"><a name="l00192"></a><span class="lineno"> 192</span>&#160;<span class="keyword">template</span> &lt;<span class="keyword">typename</span> Scalar_,</div><div class="line"><a name="l00193"></a><span class="lineno"> 193</span>&#160; <span class="keyword">typename</span> OutputTile_,</div><div class="line"><a name="l00194"></a><span class="lineno"> 194</span>&#160; <span class="keyword">typename</span> Warps_,</div><div class="line"><a name="l00195"></a><span class="lineno"> 195</span>&#160; <span class="keyword">typename</span> ThreadsPerWarp_,</div><div class="line"><a name="l00196"></a><span class="lineno"> 196</span>&#160; <span class="keyword">typename</span> InstructionShape_,</div><div class="line"><a name="l00197"></a><span class="lineno"> 197</span>&#160; <span class="keywordtype">int</span> kStages_,</div><div class="line"><a name="l00198"></a><span class="lineno"> 198</span>&#160; <span class="keywordtype">int</span> kScalarsPerLds_,</div><div class="line"><a name="l00199"></a><span class="lineno"> 199</span>&#160; <span class="keywordtype">int</span> kSkew_ = 0&gt;</div><div class="line"><a name="l00200"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html"> 200</a></span>&#160;<span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html">GemmSharedLoadTileBTraits</a> {</div><div class="line"><a name="l00201"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#afd4881aae69c8041d3931982d85f44e4"> 201</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0c">GemmOperand::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#afd4881aae69c8041d3931982d85f44e4">kOperand</a> = <a class="code" href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0caad0876342d150cef7da6ae149d5e99f9">GemmOperand::kB</a>;</div><div class="line"><a name="l00203"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a2a6065e583155b3e389253d3bfb64d73"> 203</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">platform::remove_const&lt;Scalar_&gt;::type</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a2a6065e583155b3e389253d3bfb64d73">Scalar</a>;</div><div class="line"><a name="l00205"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#afafb3d9ae470c8ef56ec4ca5e66e2182"> 205</a></span>&#160; <span class="keyword">typedef</span> Scalar_* <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#afafb3d9ae470c8ef56ec4ca5e66e2182">Pointer</a>;</div><div class="line"><a name="l00207"></a><span class="lineno"> 207</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;kStages_,</div><div class="line"><a name="l00208"></a><span class="lineno"> 208</span>&#160; OutputTile_::kD / InstructionShape_::kD,</div><div class="line"><a name="l00209"></a><span class="lineno"> 209</span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GetExtent.html">GetExtent&lt;kOperand, OutputTile_&gt;::kExtent</a> * InstructionShape_::kD&gt;</div><div class="line"><a name="l00210"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a3d8be9ddea1cab53d1b4b3d508f9eab8"> 210</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a3d8be9ddea1cab53d1b4b3d508f9eab8">TileWithoutSkew_</a>;</div><div class="line"><a name="l00212"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a69c7ec2a779718556e6d9119588e791c"> 212</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;kStages_, TileWithoutSkew_::kH, TileWithoutSkew_::kW + kSkew_&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a69c7ec2a779718556e6d9119588e791c">TileWithSkew</a>;</div><div class="line"><a name="l00214"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a1f35981a6d661635dfbcf7c7a76056a2"> 214</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">ReshapeTile&lt;TileWithoutSkew_, kScalarsPerLds_&gt;::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a1f35981a6d661635dfbcf7c7a76056a2">TileWithoutSkew</a>;</div><div class="line"><a name="l00216"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#ac242508ec46db0493a69a589dbfc19e4"> 216</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">ReshapeTile&lt;TileWithSkew, kScalarsPerLds_&gt;::Tile</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#ac242508ec46db0493a69a589dbfc19e4">Tile</a>;</div><div class="line"><a name="l00218"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a7ad7a4e33ed43926e165e66162eb620b"> 218</a></span>&#160; <span class="keyword">typedef</span> Warps_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a7ad7a4e33ed43926e165e66162eb620b">Warps</a>;</div><div class="line"><a name="l00220"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aed92656a074e915d97a1b6a990aeba66"> 220</a></span>&#160; <span class="keyword">typedef</span> ThreadsPerWarp_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aed92656a074e915d97a1b6a990aeba66">ThreadsPerWarp</a>;</div><div class="line"><a name="l00222"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aa41cc5dc82fe08457d103545f8f63081"> 222</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aa41cc5dc82fe08457d103545f8f63081">kAccessSize</a> = kScalarsPerLds_;</div><div class="line"><a name="l00224"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#ac9cd90ecd02809060a2fe6e2da4210f9"> 224</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#ac9cd90ecd02809060a2fe6e2da4210f9">kSkew</a> = kSkew_;</div><div class="line"><a name="l00226"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a7007093a4abf79a0b4bfb3fc85a02620"> 226</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">MemorySpace::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a7007093a4abf79a0b4bfb3fc85a02620">kMemorySpace</a> = <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">MemorySpace::kShared</a>;</div><div class="line"><a name="l00227"></a><span class="lineno"> 227</span>&#160;</div><div class="line"><a name="l00229"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a8b8d6a26a29d5477f526d9ce8c27e3e2"> 229</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a8b8d6a26a29d5477f526d9ce8c27e3e2">kWarps</a> = <a class="code" href="structcutlass_1_1gemm_1_1GetExtent.html">GetExtent&lt;kOperand, Warps&gt;::kExtent</a>;</div><div class="line"><a name="l00231"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a049b0bcdf8c5318ee84edeb1e42eaf78"> 231</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a049b0bcdf8c5318ee84edeb1e42eaf78">kThreadsPerWarp</a> = <a class="code" href="structcutlass_1_1gemm_1_1GetExtent.html">GetExtent&lt;kOperand, ThreadsPerWarp&gt;::kExtent</a>;</div><div class="line"><a name="l00232"></a><span class="lineno"> 232</span>&#160;</div><div class="line"><a name="l00234"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a27bc06b72a94e34d5da6fbfb950459b5"> 234</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1, 1, TileWithoutSkew::kW / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a8b8d6a26a29d5477f526d9ce8c27e3e2">kWarps</a> / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a049b0bcdf8c5318ee84edeb1e42eaf78">kThreadsPerWarp</a> <span class="comment">/* / kAccessSize*/</span>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a27bc06b72a94e34d5da6fbfb950459b5">Iterations</a>;</div><div class="line"><a name="l00236"></a><span class="lineno"> 236</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;TileWithSkew::kW * Warps::kD, 0, kWarps * kThreadsPerWarp * kAccessSize, 0&gt;</a></div><div class="line"><a name="l00237"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a99017ecc737060f53fd9804ea6f9583f"> 237</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a99017ecc737060f53fd9804ea6f9583f">ImmediateOffsetStrides</a>;</div><div class="line"><a name="l00238"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#adcede218eec980903221feb664cad3a1"> 238</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;TileWithSkew::kW * Warps::kD, 0, kWarps * kThreadsPerWarp * kAccessSize, 0&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#adcede218eec980903221feb664cad3a1">Delta</a>;</div><div class="line"><a name="l00239"></a><span class="lineno"> 239</span>&#160;</div><div class="line"><a name="l00241"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset.html"> 241</a></span>&#160; <span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset.html">ThreadOffset</a> {</div><div class="line"><a name="l00242"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset.html#a5b4a635a521364357386259b0f84c0ba"> 242</a></span>&#160; <a class="code" href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a> <a class="code" href="structcutlass_1_1Coord.html">Coord&lt;4&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset.html#a5b4a635a521364357386259b0f84c0ba">operator()</a>()<span class="keyword"> const </span>{</div><div class="line"><a name="l00243"></a><span class="lineno"> 243</span>&#160; <span class="comment">// Extract the warp.</span></div><div class="line"><a name="l00244"></a><span class="lineno"> 244</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp = threadIdx.x / kWarpSize;</div><div class="line"><a name="l00245"></a><span class="lineno"> 245</span>&#160; <span class="comment">// Extract the slice.</span></div><div class="line"><a name="l00246"></a><span class="lineno"> 246</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> slice = warp / (Warps::kH * Warps::kW);</div><div class="line"><a name="l00247"></a><span class="lineno"> 247</span>&#160; <span class="comment">// The warp in the slice.</span></div><div class="line"><a name="l00248"></a><span class="lineno"> 248</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp_in_slice = warp % (Warps::kH * Warps::kW);</div><div class="line"><a name="l00249"></a><span class="lineno"> 249</span>&#160; <span class="comment">// Compute the row offset for each warp.</span></div><div class="line"><a name="l00250"></a><span class="lineno"> 250</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp_col = warp_in_slice / Warps::kW;</div><div class="line"><a name="l00251"></a><span class="lineno"> 251</span>&#160; <span class="comment">// Compute the row offset for each thread.</span></div><div class="line"><a name="l00252"></a><span class="lineno"> 252</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> lane_col = (threadIdx.x &amp; 0x10) / 8 + (threadIdx.x &amp; 0x01);</div><div class="line"><a name="l00253"></a><span class="lineno"> 253</span>&#160; <span class="comment">// The offset.</span></div><div class="line"><a name="l00254"></a><span class="lineno"> 254</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> offset =</div><div class="line"><a name="l00255"></a><span class="lineno"> 255</span>&#160; slice * Tile::kW * Tile::kC + (warp_col * ThreadsPerWarp::kH + lane_col) * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aa41cc5dc82fe08457d103545f8f63081">kAccessSize</a>;</div><div class="line"><a name="l00256"></a><span class="lineno"> 256</span>&#160; <span class="comment">// Embed the offset in a 4D coordinate.</span></div><div class="line"><a name="l00257"></a><span class="lineno"> 257</span>&#160; <span class="keywordflow">return</span> <a class="code" href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">make_Coord</a>(0, 0, offset, 0);</div><div class="line"><a name="l00258"></a><span class="lineno"> 258</span>&#160; }</div><div class="line"><a name="l00259"></a><span class="lineno"> 259</span>&#160; };</div><div class="line"><a name="l00260"></a><span class="lineno"> 260</span>&#160;};</div><div class="line"><a name="l00261"></a><span class="lineno"> 261</span>&#160;</div><div class="line"><a name="l00263"></a><span class="lineno"> 263</span>&#160;</div><div class="line"><a name="l00264"></a><span class="lineno"> 264</span>&#160;<span class="keyword">template</span> &lt;<span class="keyword">typename</span> Scalar_,</div><div class="line"><a name="l00265"></a><span class="lineno"> 265</span>&#160; <span class="keyword">typename</span> OutputTile_,</div><div class="line"><a name="l00266"></a><span class="lineno"> 266</span>&#160; <span class="keyword">typename</span> Warps_,</div><div class="line"><a name="l00267"></a><span class="lineno"> 267</span>&#160; <span class="keyword">typename</span> ThreadsPerWarp_,</div><div class="line"><a name="l00268"></a><span class="lineno"> 268</span>&#160; <span class="keywordtype">int</span> kScalarsPerSts_,</div><div class="line"><a name="l00269"></a><span class="lineno"> 269</span>&#160; <span class="keywordtype">int</span> kSkew_ = 0&gt;</div><div class="line"><a name="l00270"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html"> 270</a></span>&#160;<span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html">GemmSharedStoreTileDTraits</a> {</div><div class="line"><a name="l00272"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9a2218b570dada2f1e3ccd8004c47856"> 272</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">platform::remove_const&lt;Scalar_&gt;::type</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9a2218b570dada2f1e3ccd8004c47856">Scalar</a>;</div><div class="line"><a name="l00274"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a20471c2f569c28538dad8a220ab25624"> 274</a></span>&#160; <span class="keyword">typedef</span> Scalar_* <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a20471c2f569c28538dad8a220ab25624">Pointer</a>;</div><div class="line"><a name="l00276"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ad52b81080731ee1f0d3c2c7eaba6f60d"> 276</a></span>&#160; <span class="keyword">typedef</span> OutputTile_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ad52b81080731ee1f0d3c2c7eaba6f60d">OutputTile</a>;</div><div class="line"><a name="l00278"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af4597927405d8bb1ad2c464fad064703"> 278</a></span>&#160; <span class="keyword">typedef</span> Warps_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af4597927405d8bb1ad2c464fad064703">Warps</a>;</div><div class="line"><a name="l00280"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#adf72ea773b8d4d3eb184f59c8cdf9543"> 280</a></span>&#160; <span class="keyword">typedef</span> ThreadsPerWarp_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#adf72ea773b8d4d3eb184f59c8cdf9543">ThreadsPerWarp</a>;</div><div class="line"><a name="l00282"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1"> 282</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1">kAccessSize</a> = kScalarsPerSts_;</div><div class="line"><a name="l00284"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a48baee6541e6359753f1bae5bd864029"> 284</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a48baee6541e6359753f1bae5bd864029">kSkew</a> = kSkew_;</div><div class="line"><a name="l00286"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a8914bc5154f21fa5fd182b0009c44c39"> 286</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">MemorySpace::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a8914bc5154f21fa5fd182b0009c44c39">kMemorySpace</a> = <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">MemorySpace::kShared</a>;</div><div class="line"><a name="l00287"></a><span class="lineno"> 287</span>&#160;</div><div class="line"><a name="l00289"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ae0b53d76096f9d34df6e16280565c7b1"> 289</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ae0b53d76096f9d34df6e16280565c7b1">kScalarsPerThread</a> = OutputTile_::kW / Warps::kW / ThreadsPerWarp::kW;</div><div class="line"><a name="l00291"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a05039ba8b7d9890903064b1a834dcd3e"> 291</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a05039ba8b7d9890903064b1a834dcd3e">kThreads</a> = <a class="code" href="structcutlass_1_1ShapeCount.html">ShapeCount&lt;Warps&gt;::kCount</a> * kWarpSize;</div><div class="line"><a name="l00293"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af1c981ec89a9cabaf5d34231d51a029c"> 293</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af1c981ec89a9cabaf5d34231d51a029c">kScalarsPerRow</a> = <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a05039ba8b7d9890903064b1a834dcd3e">kThreads</a> / 2 * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ae0b53d76096f9d34df6e16280565c7b1">kScalarsPerThread</a> + <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a48baee6541e6359753f1bae5bd864029">kSkew</a>;</div><div class="line"><a name="l00294"></a><span class="lineno"> 294</span>&#160;</div><div class="line"><a name="l00296"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a2bc41b907417b47f3dca9c3dd358f8bc"> 296</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1, 2, <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af1c981ec89a9cabaf5d34231d51a029c">kScalarsPerRow</a> / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1">kAccessSize</a>, <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1">kAccessSize</a>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a2bc41b907417b47f3dca9c3dd358f8bc">Tile</a>;</div><div class="line"><a name="l00298"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a6bacc866485330f80596f634e6d14336"> 298</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1, 1, <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ae0b53d76096f9d34df6e16280565c7b1">kScalarsPerThread</a> / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1">kAccessSize</a>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a6bacc866485330f80596f634e6d14336">Iterations</a>;</div><div class="line"><a name="l00300"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a5587ef22f419ab9a7c6117917cc99c57"> 300</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, 0, Warps::kW * ThreadsPerWarp::kW * kAccessSize&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a5587ef22f419ab9a7c6117917cc99c57">Delta</a>;</div><div class="line"><a name="l00302"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ac585815d08290d9a5a9cdbd611ffdac4"> 302</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;0, 0, Warps::kW * ThreadsPerWarp::kW * kAccessSize&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ac585815d08290d9a5a9cdbd611ffdac4">ImmediateOffsetStrides</a>;</div><div class="line"><a name="l00303"></a><span class="lineno"> 303</span>&#160;</div><div class="line"><a name="l00305"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset.html"> 305</a></span>&#160; <span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset.html">ThreadOffset</a> {</div><div class="line"><a name="l00306"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset.html#a4f9cca16303ac9ae29a0eaa11dcc23b6"> 306</a></span>&#160; <a class="code" href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a> <a class="code" href="structcutlass_1_1Coord.html">Coord&lt;4&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset.html#a4f9cca16303ac9ae29a0eaa11dcc23b6">operator()</a>()<span class="keyword"> const </span>{</div><div class="line"><a name="l00307"></a><span class="lineno"> 307</span>&#160; <span class="comment">// The warp.</span></div><div class="line"><a name="l00308"></a><span class="lineno"> 308</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp = threadIdx.x / kWarpSize;</div><div class="line"><a name="l00309"></a><span class="lineno"> 309</span>&#160;</div><div class="line"><a name="l00310"></a><span class="lineno"> 310</span>&#160; <span class="comment">// The position of the warp in the 2D tile.</span></div><div class="line"><a name="l00311"></a><span class="lineno"> 311</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp_row = warp % Warps::kW;</div><div class="line"><a name="l00312"></a><span class="lineno"> 312</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> warp_col = warp / Warps::kW;</div><div class="line"><a name="l00313"></a><span class="lineno"> 313</span>&#160;</div><div class="line"><a name="l00314"></a><span class="lineno"> 314</span>&#160; <span class="comment">// We assume that the elements are distributed in a warps as 4 columns of 8 elements. The</span></div><div class="line"><a name="l00315"></a><span class="lineno"> 315</span>&#160; <span class="comment">// columns are stored in threads col0=[0, 2, 4, 6, 8, 10, 12, 14], col1=[1, 3, 5, 7, .., 15],</span></div><div class="line"><a name="l00316"></a><span class="lineno"> 316</span>&#160; <span class="comment">// col2=[16, 18, 20, ..., 30] and col3=[17, 19, ..., 31].</span></div><div class="line"><a name="l00317"></a><span class="lineno"> 317</span>&#160; <span class="keywordtype">int</span> hi_halfwarp_offset = ((threadIdx.x &gt;&gt; 4) &amp; 0x1) * OutputTile::kW;</div><div class="line"><a name="l00318"></a><span class="lineno"> 318</span>&#160; <span class="keywordtype">int</span> lo_halfwarp_offset = ((threadIdx.x &gt;&gt; 1) &amp; 0x7) + ThreadsPerWarp::kW * warp_row;</div><div class="line"><a name="l00319"></a><span class="lineno"> 319</span>&#160;</div><div class="line"><a name="l00320"></a><span class="lineno"> 320</span>&#160; <span class="comment">// Odd threads go to the second half of shared memory.</span></div><div class="line"><a name="l00321"></a><span class="lineno"> 321</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> row = threadIdx.x &amp; 0x01;</div><div class="line"><a name="l00322"></a><span class="lineno"> 322</span>&#160; <span class="keywordtype">int</span> col = warp_col * (ThreadsPerWarp::kH / 2) * OutputTile::kW +</div><div class="line"><a name="l00323"></a><span class="lineno"> 323</span>&#160; lo_halfwarp_offset * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1">kAccessSize</a> + hi_halfwarp_offset;</div><div class="line"><a name="l00324"></a><span class="lineno"> 324</span>&#160; <span class="comment">// Embed the offset in a 4D coords.</span></div><div class="line"><a name="l00325"></a><span class="lineno"> 325</span>&#160; <span class="keywordflow">return</span> <a class="code" href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">make_Coord</a>(0, 0, row * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af1c981ec89a9cabaf5d34231d51a029c">kScalarsPerRow</a> + col, 0);</div><div class="line"><a name="l00326"></a><span class="lineno"> 326</span>&#160; }</div><div class="line"><a name="l00327"></a><span class="lineno"> 327</span>&#160; };</div><div class="line"><a name="l00328"></a><span class="lineno"> 328</span>&#160;};</div><div class="line"><a name="l00329"></a><span class="lineno"> 329</span>&#160;</div><div class="line"><a name="l00331"></a><span class="lineno"> 331</span>&#160;</div><div class="line"><a name="l00332"></a><span class="lineno"> 332</span>&#160;<span class="keyword">template</span> &lt;<span class="keyword">typename</span> Scalar_,</div><div class="line"><a name="l00333"></a><span class="lineno"> 333</span>&#160; <span class="keyword">typename</span> OutputTile_,</div><div class="line"><a name="l00334"></a><span class="lineno"> 334</span>&#160; <span class="keyword">typename</span> Warps_,</div><div class="line"><a name="l00335"></a><span class="lineno"> 335</span>&#160; <span class="keyword">typename</span> ThreadsPerWarp_,</div><div class="line"><a name="l00336"></a><span class="lineno"> 336</span>&#160; <span class="keywordtype">int</span> kTileH_,</div><div class="line"><a name="l00337"></a><span class="lineno"> 337</span>&#160; <span class="keywordtype">int</span> kScalarsPerLds_,</div><div class="line"><a name="l00338"></a><span class="lineno"> 338</span>&#160; <span class="keywordtype">int</span> kSkew_ = 0&gt;</div><div class="line"><a name="l00339"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html"> 339</a></span>&#160;<span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html">GemmSharedLoadTileDTraits</a> {</div><div class="line"><a name="l00341"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1b025cb056729706f36469e74a9799dc"> 341</a></span>&#160; <span class="keyword">typedef</span> <span class="keyword">typename</span> <a class="code" href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">platform::remove_const&lt;Scalar_&gt;::type</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1b025cb056729706f36469e74a9799dc">Scalar</a>;</div><div class="line"><a name="l00343"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1e72b69cf2147e4d194893a64417b920"> 343</a></span>&#160; <span class="keyword">typedef</span> Scalar_* <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1e72b69cf2147e4d194893a64417b920">Pointer</a>;</div><div class="line"><a name="l00345"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#acb16feebdcad5bbebe9d4d3383c37899"> 345</a></span>&#160; <span class="keyword">typedef</span> OutputTile_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#acb16feebdcad5bbebe9d4d3383c37899">OutputTile</a>;</div><div class="line"><a name="l00347"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4764f70691cb3fee91ce47653363aa4f"> 347</a></span>&#160; <span class="keyword">typedef</span> Warps_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4764f70691cb3fee91ce47653363aa4f">Warps</a>;</div><div class="line"><a name="l00349"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a9022ffc49b32503fd3639341e7e291a3"> 349</a></span>&#160; <span class="keyword">typedef</span> ThreadsPerWarp_ <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a9022ffc49b32503fd3639341e7e291a3">ThreadsPerWarp</a>;</div><div class="line"><a name="l00351"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d"> 351</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d">kAccessSize</a> = kScalarsPerLds_;</div><div class="line"><a name="l00353"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a7e9ce187e12575f0ecd39b2bfe13dddf"> 353</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a7e9ce187e12575f0ecd39b2bfe13dddf">kSkew</a> = kSkew_;</div><div class="line"><a name="l00355"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#afb4687520eff9c6a21c35a5e04f69de8"> 355</a></span>&#160; <span class="keyword">static</span> <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">MemorySpace::Kind</a> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#afb4687520eff9c6a21c35a5e04f69de8">kMemorySpace</a> = <a class="code" href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">MemorySpace::kShared</a>;</div><div class="line"><a name="l00356"></a><span class="lineno"> 356</span>&#160;</div><div class="line"><a name="l00358"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#abb5fdb164b09c8f74f92278f3d68b95f"> 358</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#abb5fdb164b09c8f74f92278f3d68b95f">kScalarsPerThread</a> = OutputTile_::kW / Warps::kW / ThreadsPerWarp::kW;</div><div class="line"><a name="l00360"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8325bc9d56155ecb6f2ddbd56f4ed23d"> 360</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8325bc9d56155ecb6f2ddbd56f4ed23d">kThreads</a> = <a class="code" href="structcutlass_1_1ShapeCount.html">ShapeCount&lt;Warps&gt;::kCount</a> * kWarpSize;</div><div class="line"><a name="l00362"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#aa3e378cabce9ed7f199c179c15a12ca4"> 362</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#aa3e378cabce9ed7f199c179c15a12ca4">kScalarsPerRow</a> = <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8325bc9d56155ecb6f2ddbd56f4ed23d">kThreads</a> / 2 * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#abb5fdb164b09c8f74f92278f3d68b95f">kScalarsPerThread</a> + <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a7e9ce187e12575f0ecd39b2bfe13dddf">kSkew</a>;</div><div class="line"><a name="l00363"></a><span class="lineno"> 363</span>&#160;</div><div class="line"><a name="l00366"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a63f980fea1ff3dd83ac276cfd83a4ce5"> 366</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;1, 2, <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#aa3e378cabce9ed7f199c179c15a12ca4">kScalarsPerRow</a> / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d">kAccessSize</a>, <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d">kAccessSize</a>&gt; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a63f980fea1ff3dd83ac276cfd83a4ce5">Tile</a>;</div><div class="line"><a name="l00367"></a><span class="lineno"> 367</span>&#160;</div><div class="line"><a name="l00368"></a><span class="lineno"> 368</span>&#160; <span class="comment">// Compute the number of iterations per warp in the Tile::kH dimension.</span></div><div class="line"><a name="l00369"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4b8d66df02ba1653aa6d1f23b967f237"> 369</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4b8d66df02ba1653aa6d1f23b967f237">kIterationsInHPerWarp</a> = kTileH_ / <a class="code" href="structcutlass_1_1ShapeCount.html">ShapeCount&lt;Warps&gt;::kCount</a>;</div><div class="line"><a name="l00370"></a><span class="lineno"> 370</span>&#160;</div><div class="line"><a name="l00371"></a><span class="lineno"> 371</span>&#160; <span class="comment">// As explained above, the shared memory tile is composed of 2 rows and each rows is made of</span></div><div class="line"><a name="l00372"></a><span class="lineno"> 372</span>&#160; <span class="comment">// kScalarsPerRow. A warp is expected to read from the 1st row, then move to the 2nd row and go</span></div><div class="line"><a name="l00373"></a><span class="lineno"> 373</span>&#160; <span class="comment">// back to the 1st row. To model that scheme we define the Iterations shape as Shape&lt;X, 2, ...&gt;.</span></div><div class="line"><a name="l00374"></a><span class="lineno"> 374</span>&#160; <span class="comment">// However, in some cases, we have only 1 iteration per warp. In that case, we must define the</span></div><div class="line"><a name="l00375"></a><span class="lineno"> 375</span>&#160; <span class="comment">// shape as Shape&lt;1, 1, ...&gt;. The following code does that except that we hijack the kH dimension</span></div><div class="line"><a name="l00376"></a><span class="lineno"> 376</span>&#160; <span class="comment">// to keep the number of elements to reduce for split-K.</span></div><div class="line"><a name="l00377"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a3b1a461c1dfbcd3817ab2d57bd0da9f1"> 377</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a3b1a461c1dfbcd3817ab2d57bd0da9f1">kIterationsH</a> = <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4b8d66df02ba1653aa6d1f23b967f237">kIterationsInHPerWarp</a> == 1 ? 1 : 2;</div><div class="line"><a name="l00378"></a><span class="lineno"> 378</span>&#160; <span class="comment">// As soon as we know kIterationsH, it is trivial to compute kIterationsD:</span></div><div class="line"><a name="l00379"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8663311646210b690bb0c2a1012e82f0"> 379</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8663311646210b690bb0c2a1012e82f0">kIterationsD</a> = <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4b8d66df02ba1653aa6d1f23b967f237">kIterationsInHPerWarp</a> / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a3b1a461c1dfbcd3817ab2d57bd0da9f1">kIterationsH</a>;</div><div class="line"><a name="l00380"></a><span class="lineno"> 380</span>&#160;</div><div class="line"><a name="l00381"></a><span class="lineno"> 381</span>&#160; <span class="comment">// If we have split-K enabled, we have to jump over the elements from the &quot;odd/even&quot; column of</span></div><div class="line"><a name="l00382"></a><span class="lineno"> 382</span>&#160; <span class="comment">// threads to grab the other elements.</span></div><div class="line"><a name="l00383"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a15438a44b588dc4cfd4b47c18af79cd2"> 383</a></span>&#160; <span class="keyword">static</span> <span class="keywordtype">int</span> <span class="keyword">const</span> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a15438a44b588dc4cfd4b47c18af79cd2">kSplitK</a> = OutputTile::kW * ThreadsPerWarp::kH / 2 * Warps::kH;</div><div class="line"><a name="l00384"></a><span class="lineno"> 384</span>&#160;</div><div class="line"><a name="l00386"></a><span class="lineno"> 386</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape</a>&lt;<a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8663311646210b690bb0c2a1012e82f0">kIterationsD</a>, <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a3b1a461c1dfbcd3817ab2d57bd0da9f1">kIterationsH</a>, OutputTile::kW / kWarpSize / <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d">kAccessSize</a>, Warps::kD&gt;</div><div class="line"><a name="l00387"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1b33700f904dd15e3533fec15d9d71bd"> 387</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1b33700f904dd15e3533fec15d9d71bd">Iterations</a>;</div><div class="line"><a name="l00389"></a><span class="lineno"> 389</span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;OutputTile::kW, kScalarsPerRow, kWarpSize * kAccessSize, kSplitK&gt;</a></div><div class="line"><a name="l00390"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a57b065abb737bee1c17398c90b5bc39b"> 390</a></span>&#160; <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a57b065abb737bee1c17398c90b5bc39b">ImmediateOffsetStrides</a>;</div><div class="line"><a name="l00392"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a2cd23d3b5e2cb64c6d5e9b1d6a78fbce"> 392</a></span>&#160; <span class="keyword">typedef</span> <a class="code" href="structcutlass_1_1Shape.html">Shape&lt;OutputTile::kW, kScalarsPerRow, kWarpSize * kAccessSize, kSplitK&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a2cd23d3b5e2cb64c6d5e9b1d6a78fbce">Delta</a>;</div><div class="line"><a name="l00393"></a><span class="lineno"> 393</span>&#160;</div><div class="line"><a name="l00395"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset.html"> 395</a></span>&#160; <span class="keyword">struct </span><a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset.html">ThreadOffset</a> {</div><div class="line"><a name="l00396"></a><span class="lineno"><a class="line" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset.html#ace1b936cab289c6884e673312283d422"> 396</a></span>&#160; <a class="code" href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a> <a class="code" href="structcutlass_1_1Coord.html">Coord&lt;4&gt;</a> <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset.html#ace1b936cab289c6884e673312283d422">operator()</a>()<span class="keyword"> const </span>{</div><div class="line"><a name="l00397"></a><span class="lineno"> 397</span>&#160; <span class="comment">// Each warp works on a different column.</span></div><div class="line"><a name="l00398"></a><span class="lineno"> 398</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> h = threadIdx.x / kWarpSize;</div><div class="line"><a name="l00399"></a><span class="lineno"> 399</span>&#160; <span class="comment">// Compute the row.</span></div><div class="line"><a name="l00400"></a><span class="lineno"> 400</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> w = (threadIdx.x &amp; (kWarpSize - 1)) * <a class="code" href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d">kAccessSize</a>;</div><div class="line"><a name="l00401"></a><span class="lineno"> 401</span>&#160; <span class="keywordtype">int</span> offset = 0;</div><div class="line"><a name="l00402"></a><span class="lineno"> 402</span>&#160; <span class="keywordflow">if</span> (<a class="code" href="structcutlass_1_1Shape.html#a3a20d9062bba613c160bb2cd14f80a5e">Iterations::kH</a> == 1) {</div><div class="line"><a name="l00403"></a><span class="lineno"> 403</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> row = h &amp; 0x1;</div><div class="line"><a name="l00404"></a><span class="lineno"> 404</span>&#160; <span class="keywordtype">int</span> <span class="keyword">const</span> col = h / 2;</div><div class="line"><a name="l00405"></a><span class="lineno"> 405</span>&#160; offset = row * <a class="code" href="structcutlass_1_1ShapeCount.html">ShapeCount&lt;Tile&gt;::kWc</a> + col * OutputTile::kW * <a class="code" href="structcutlass_1_1Shape.html#a19086a5567d6c710ec853e35a7f29c25">Iterations::kD</a> + w;</div><div class="line"><a name="l00406"></a><span class="lineno"> 406</span>&#160; } <span class="keywordflow">else</span> {</div><div class="line"><a name="l00407"></a><span class="lineno"> 407</span>&#160; offset = h * OutputTile::kW * <a class="code" href="structcutlass_1_1Shape.html#a19086a5567d6c710ec853e35a7f29c25">Iterations::kD</a> + w;</div><div class="line"><a name="l00408"></a><span class="lineno"> 408</span>&#160; }</div><div class="line"><a name="l00409"></a><span class="lineno"> 409</span>&#160; <span class="keywordflow">return</span> <a class="code" href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">make_Coord</a>(0, 0, offset, 0);</div><div class="line"><a name="l00410"></a><span class="lineno"> 410</span>&#160; }</div><div class="line"><a name="l00411"></a><span class="lineno"> 411</span>&#160; };</div><div class="line"><a name="l00412"></a><span class="lineno"> 412</span>&#160;};</div><div class="line"><a name="l00413"></a><span class="lineno"> 413</span>&#160;</div><div class="line"><a name="l00415"></a><span class="lineno"> 415</span>&#160;</div><div class="line"><a name="l00416"></a><span class="lineno"> 416</span>&#160;} <span class="comment">// namespace gemm</span></div><div class="line"><a name="l00417"></a><span class="lineno"> 417</span>&#160;} <span class="comment">// namespace cutlass</span></div><div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a846e6d8d06be0ba6fa41b1431c8ec061"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a846e6d8d06be0ba6fa41b1431c8ec061">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::kAccessSize</a></div><div class="ttdeci">static int const kAccessSize</div><div class="ttdoc">The number of scalars per STS. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:95</div></div>
<div class="ttc" id="structcutlass_1_1ComputeThreadOffsetFromStrides_html_a1744bfe277cbe0c642cce4a48c1dd9ad"><div class="ttname"><a href="structcutlass_1_1ComputeThreadOffsetFromStrides.html#a1744bfe277cbe0c642cce4a48c1dd9ad">cutlass::ComputeThreadOffsetFromStrides::get</a></div><div class="ttdeci">static CUTLASS_DEVICE int get()</div><div class="ttdef"><b>Definition:</b> shape.h:214</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_ac242508ec46db0493a69a589dbfc19e4"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#ac242508ec46db0493a69a589dbfc19e4">cutlass::gemm::GemmSharedLoadTileBTraits::Tile</a></div><div class="ttdeci">ReshapeTile&lt; TileWithSkew, kScalarsPerLds_ &gt;::Tile Tile</div><div class="ttdoc">The tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:216</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a9a00be672617162c4c7ac94c7d8980cc"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a9a00be672617162c4c7ac94c7d8980cc">cutlass::gemm::GemmSharedLoadTileATraits::Tile</a></div><div class="ttdeci">ReshapeTile&lt; TileWithSkew, kScalarsPerLds_ &gt;::Tile Tile</div><div class="ttdoc">The tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:144</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a1f35981a6d661635dfbcf7c7a76056a2"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a1f35981a6d661635dfbcf7c7a76056a2">cutlass::gemm::GemmSharedLoadTileBTraits::TileWithoutSkew</a></div><div class="ttdeci">ReshapeTile&lt; TileWithoutSkew_, kScalarsPerLds_ &gt;::Tile TileWithoutSkew</div><div class="ttdoc">The tile without skew after reshaping. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:214</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_afb4687520eff9c6a21c35a5e04f69de8"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#afb4687520eff9c6a21c35a5e04f69de8">cutlass::gemm::GemmSharedLoadTileDTraits::kMemorySpace</a></div><div class="ttdeci">static MemorySpace::Kind const kMemorySpace</div><div class="ttdoc">The memory space. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:355</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_abb5fdb164b09c8f74f92278f3d68b95f"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#abb5fdb164b09c8f74f92278f3d68b95f">cutlass::gemm::GemmSharedLoadTileDTraits::kScalarsPerThread</a></div><div class="ttdeci">static int const kScalarsPerThread</div><div class="ttdoc">The number of scalars per thread. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:358</div></div>
<div class="ttc" id="structcutlass_1_1MemorySpace_html_a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc"><div class="ttname"><a href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03ca2804339b2be64ff68ae3042073aaa7cc">cutlass::MemorySpace::kShared</a></div><div class="ttdef"><b>Definition:</b> load_store.h:41</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset_html_a5b4a635a521364357386259b0f84c0ba"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset.html#a5b4a635a521364357386259b0f84c0ba">cutlass::gemm::GemmSharedLoadTileBTraits::ThreadOffset::operator()</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 4 &gt; operator()() const</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:242</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a27bc06b72a94e34d5da6fbfb950459b5"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a27bc06b72a94e34d5da6fbfb950459b5">cutlass::gemm::GemmSharedLoadTileBTraits::Iterations</a></div><div class="ttdeci">Shape&lt; 1, 1, TileWithoutSkew::kW/kWarps/kThreadsPerWarp &gt; Iterations</div><div class="ttdoc">The number of iterations needed to load/store the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:234</div></div>
<div class="ttc" id="namespacecutlass_html"><div class="ttname"><a href="namespacecutlass.html">cutlass</a></div><div class="ttdef"><b>Definition:</b> convert.h:33</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a8b8d6a26a29d5477f526d9ce8c27e3e2"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a8b8d6a26a29d5477f526d9ce8c27e3e2">cutlass::gemm::GemmSharedLoadTileBTraits::kWarps</a></div><div class="ttdeci">static int const kWarps</div><div class="ttdoc">The number of warps. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:229</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html">cutlass::gemm::GemmSharedLoadTileATraits</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:128</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a5be0c995c57faafaad7ae55ae015fc00"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a5be0c995c57faafaad7ae55ae015fc00">cutlass::gemm::GemmSharedStoreTileAbTraits::Pointer</a></div><div class="ttdeci">Scalar_ * Pointer</div><div class="ttdoc">The pointer. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:42</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:80</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_af1c981ec89a9cabaf5d34231d51a029c"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af1c981ec89a9cabaf5d34231d51a029c">cutlass::gemm::GemmSharedStoreTileDTraits::kScalarsPerRow</a></div><div class="ttdeci">static int const kScalarsPerRow</div><div class="ttdoc">The number of scalars per row. We build a tile with 2 rows (to avoid bank conflicts). </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:293</div></div>
<div class="ttc" id="structcutlass_1_1platform_1_1remove__const_html_ac3662947fa50251daf58240a9c798085"><div class="ttname"><a href="structcutlass_1_1platform_1_1remove__const.html#ac3662947fa50251daf58240a9c798085">cutlass::platform::remove_const::type</a></div><div class="ttdeci">T type</div><div class="ttdef"><b>Definition:</b> platform.h:377</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a1b6956adc65254202864520b668edd14"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a1b6956adc65254202864520b668edd14">cutlass::gemm::GemmSharedLoadTileATraits::Scalar</a></div><div class="ttdeci">platform::remove_const&lt; Scalar_ &gt;::type Scalar</div><div class="ttdoc">The scalar. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:131</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset.html">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::ThreadOffset</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:106</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a6bacc866485330f80596f634e6d14336"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a6bacc866485330f80596f634e6d14336">cutlass::gemm::GemmSharedStoreTileDTraits::Iterations</a></div><div class="ttdeci">Shape&lt; 1, 1, kScalarsPerThread/kAccessSize &gt; Iterations</div><div class="ttdoc">The number of iterations needed to store the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:298</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_adcede218eec980903221feb664cad3a1"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#adcede218eec980903221feb664cad3a1">cutlass::gemm::GemmSharedLoadTileBTraits::Delta</a></div><div class="ttdeci">Shape&lt; TileWithSkew::kW *Warps::kD, 0, kWarps *kThreadsPerWarp *kAccessSize, 0 &gt; Delta</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:238</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a8d308d593b59624abe3e228d588be61d"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8d308d593b59624abe3e228d588be61d">cutlass::gemm::GemmSharedLoadTileDTraits::kAccessSize</a></div><div class="ttdeci">static int const kAccessSize</div><div class="ttdoc">The number of scalars per LDG/STG. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:351</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a0761c497c41a45652368fc0d54def98f"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0761c497c41a45652368fc0d54def98f">cutlass::gemm::GemmSharedLoadTileATraits::ThreadsPerWarp</a></div><div class="ttdeci">ThreadsPerWarp_ ThreadsPerWarp</div><div class="ttdoc">The threads in a warp. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:148</div></div>
<div class="ttc" id="structcutlass_1_1ReshapeTile_html"><div class="ttname"><a href="structcutlass_1_1ReshapeTile.html">cutlass::ReshapeTile</a></div><div class="ttdef"><b>Definition:</b> reshape_tile.h:42</div></div>
<div class="ttc" id="namespacecutlass_html_a7419519fa453a121dfa5f26bf87318d9"><div class="ttname"><a href="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">cutlass::make_Coord</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 1 &gt; make_Coord(int _0)</div><div class="ttdoc">Helper to make a 2-element coordinate. </div><div class="ttdef"><b>Definition:</b> coord.h:368</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_ae540e7ea7106552682aa4c97b833b3b1"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae540e7ea7106552682aa4c97b833b3b1">cutlass::gemm::GemmSharedStoreTileAbTraits::ThreadsStrides</a></div><div class="ttdeci">Shape&lt; 0, ShapeCount&lt; Tile &gt;::kWc, Tile::kC, kScalarsPerSts_ &gt; ThreadsStrides</div><div class="ttdoc">The strides to compute the base position of the thread. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:48</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a9521c4017e227b2511891a7fb18513e1"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9521c4017e227b2511891a7fb18513e1">cutlass::gemm::GemmSharedStoreTileDTraits::kAccessSize</a></div><div class="ttdeci">static int const kAccessSize</div><div class="ttdoc">The number of scalars per LDG/STG. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:282</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a7e9ce187e12575f0ecd39b2bfe13dddf"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a7e9ce187e12575f0ecd39b2bfe13dddf">cutlass::gemm::GemmSharedLoadTileDTraits::kSkew</a></div><div class="ttdeci">static int const kSkew</div><div class="ttdoc">The skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:353</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a7ad7a4e33ed43926e165e66162eb620b"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a7ad7a4e33ed43926e165e66162eb620b">cutlass::gemm::GemmSharedLoadTileBTraits::Warps</a></div><div class="ttdeci">Warps_ Warps</div><div class="ttdoc">The number of warps. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:218</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_ac9cd90ecd02809060a2fe6e2da4210f9"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#ac9cd90ecd02809060a2fe6e2da4210f9">cutlass::gemm::GemmSharedLoadTileBTraits::kSkew</a></div><div class="ttdeci">static int const kSkew</div><div class="ttdoc">The skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:224</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html">cutlass::gemm::GemmSharedStoreTileAbTraits</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:38</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a2a6065e583155b3e389253d3bfb64d73"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a2a6065e583155b3e389253d3bfb64d73">cutlass::gemm::GemmSharedLoadTileBTraits::Scalar</a></div><div class="ttdeci">platform::remove_const&lt; Scalar_ &gt;::type Scalar</div><div class="ttdoc">The scalar. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:203</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset.html">cutlass::gemm::GemmSharedLoadTileDTraits::ThreadOffset</a></div><div class="ttdoc">Computes the thread offset in (H, W) based on thread ID. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:395</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html">cutlass::gemm::GemmSharedLoadTileBTraits</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:200</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a4456e4c8048bfb378e5b80833a0d19e5"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4456e4c8048bfb378e5b80833a0d19e5">cutlass::gemm::GemmSharedLoadTileATraits::kMemorySpace</a></div><div class="ttdeci">static MemorySpace::Kind const kMemorySpace</div><div class="ttdoc">The memory space. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:155</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a8b04fd003fc2db46d749360e8838438b"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a8b04fd003fc2db46d749360e8838438b">cutlass::gemm::GemmSharedStoreTileAbTraits::Scalar</a></div><div class="ttdeci">platform::remove_const&lt; Scalar_ &gt;::type Scalar</div><div class="ttdoc">The scalar. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:40</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_af511bba9fc2125516eb1442b1c88d851"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af511bba9fc2125516eb1442b1c88d851">cutlass::gemm::GemmSharedLoadTileATraits::kOperand</a></div><div class="ttdeci">static GemmOperand::Kind const kOperand</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:129</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a8914bc5154f21fa5fd182b0009c44c39"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a8914bc5154f21fa5fd182b0009c44c39">cutlass::gemm::GemmSharedStoreTileDTraits::kMemorySpace</a></div><div class="ttdeci">static MemorySpace::Kind const kMemorySpace</div><div class="ttdoc">The memory space. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:286</div></div>
<div class="ttc" id="structcutlass_1_1MemorySpace_html_a1e031ec41668015a8fe4ba2c1145d03c"><div class="ttname"><a href="structcutlass_1_1MemorySpace.html#a1e031ec41668015a8fe4ba2c1145d03c">cutlass::MemorySpace::Kind</a></div><div class="ttdeci">Kind</div><div class="ttdef"><b>Definition:</b> load_store.h:39</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a69c7ec2a779718556e6d9119588e791c"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a69c7ec2a779718556e6d9119588e791c">cutlass::gemm::GemmSharedLoadTileBTraits::TileWithSkew</a></div><div class="ttdeci">Shape&lt; kStages_, TileWithoutSkew_::kH, TileWithoutSkew_::kW+kSkew_ &gt; TileWithSkew</div><div class="ttdoc">The tile with skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:212</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a0a33d4289ed45e988d560b5f73ac997e"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a0a33d4289ed45e988d560b5f73ac997e">cutlass::gemm::GemmSharedLoadTileATraits::kAccessSize</a></div><div class="ttdeci">static int const kAccessSize</div><div class="ttdoc">The number of scalars per LDG/STG. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:151</div></div>
<div class="ttc" id="structcutlass_1_1Shape_html_a3a20d9062bba613c160bb2cd14f80a5e"><div class="ttname"><a href="structcutlass_1_1Shape.html#a3a20d9062bba613c160bb2cd14f80a5e">cutlass::Shape::kH</a></div><div class="ttdeci">static int const kH</div><div class="ttdoc">The height of the cube. </div><div class="ttdef"><b>Definition:</b> shape.h:68</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a6125e052e47296c3ef53c8a149ffd31b"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a6125e052e47296c3ef53c8a149ffd31b">cutlass::gemm::GemmSharedStoreTileAbTraits::Iterations</a></div><div class="ttdeci">Shape&lt; 1, Tile::kH/Threads::kH, Tile::kW/Threads::kW, Tile::kC/Threads::kC/kAccessSize &gt; Iterations</div><div class="ttdoc">The number of iterations needed to load/store the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:61</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_aba6decf87d770becaadd610d9fc27491"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#aba6decf87d770becaadd610d9fc27491">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::kSkew</a></div><div class="ttdeci">static int const kSkew</div><div class="ttdoc">The skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:93</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_ae96e490d38ade6db4d853fb6c8f3378b"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#ae96e490d38ade6db4d853fb6c8f3378b">cutlass::gemm::GemmSharedLoadTileATraits::Iterations</a></div><div class="ttdeci">Shape&lt; 1, 1, TileWithoutSkew::kW/kWarps/kThreadsPerWarp &gt; Iterations</div><div class="ttdoc">The number of iterations needed to load/store the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:164</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_ad52b81080731ee1f0d3c2c7eaba6f60d"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ad52b81080731ee1f0d3c2c7eaba6f60d">cutlass::gemm::GemmSharedStoreTileDTraits::OutputTile</a></div><div class="ttdeci">OutputTile_ OutputTile</div><div class="ttdoc">The dimension of the output tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:276</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_aa3e378cabce9ed7f199c179c15a12ca4"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#aa3e378cabce9ed7f199c179c15a12ca4">cutlass::gemm::GemmSharedLoadTileDTraits::kScalarsPerRow</a></div><div class="ttdeci">static int const kScalarsPerRow</div><div class="ttdoc">The number of scalars per row. We build a tile with 2 rows (to avoid bank conflicts). </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:362</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_afafb3d9ae470c8ef56ec4ca5e66e2182"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#afafb3d9ae470c8ef56ec4ca5e66e2182">cutlass::gemm::GemmSharedLoadTileBTraits::Pointer</a></div><div class="ttdeci">Scalar_ * Pointer</div><div class="ttdoc">The pointer. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:205</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_adc4946dfbe914140c6852d0c05b30864"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#adc4946dfbe914140c6852d0c05b30864">cutlass::gemm::GemmSharedLoadTileATraits::Pointer</a></div><div class="ttdeci">Scalar_ * Pointer</div><div class="ttdoc">The pointer. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:133</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a20471c2f569c28538dad8a220ab25624"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a20471c2f569c28538dad8a220ab25624">cutlass::gemm::GemmSharedStoreTileDTraits::Pointer</a></div><div class="ttdeci">Scalar_ * Pointer</div><div class="ttdoc">The pointer. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:274</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_ae0b53d76096f9d34df6e16280565c7b1"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ae0b53d76096f9d34df6e16280565c7b1">cutlass::gemm::GemmSharedStoreTileDTraits::kScalarsPerThread</a></div><div class="ttdeci">static int const kScalarsPerThread</div><div class="ttdoc">The number of scalars per thread. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:289</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset_html_a4f9cca16303ac9ae29a0eaa11dcc23b6"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset.html#a4f9cca16303ac9ae29a0eaa11dcc23b6">cutlass::gemm::GemmSharedStoreTileDTraits::ThreadOffset::operator()</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 4 &gt; operator()() const</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:306</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a1b33700f904dd15e3533fec15d9d71bd"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1b33700f904dd15e3533fec15d9d71bd">cutlass::gemm::GemmSharedLoadTileDTraits::Iterations</a></div><div class="ttdeci">Shape&lt; kIterationsD, kIterationsH, OutputTile::kW/kWarpSize/kAccessSize, Warps::kD &gt; Iterations</div><div class="ttdoc">The number of iterations needed to store the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:387</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a59c981aa720f983b846bed7c3e4a7cab"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a59c981aa720f983b846bed7c3e4a7cab">cutlass::gemm::GemmSharedStoreTileAbTraits::kMemorySpace</a></div><div class="ttdeci">static MemorySpace::Kind const kMemorySpace</div><div class="ttdoc">The memory space. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:54</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_ace14ca9ad11e2cdafcd4a4b63c0df591"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ace14ca9ad11e2cdafcd4a4b63c0df591">cutlass::gemm::GemmSharedStoreTileAbTraits::kSkew</a></div><div class="ttdeci">static int const kSkew</div><div class="ttdoc">The skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:50</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a049b0bcdf8c5318ee84edeb1e42eaf78"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a049b0bcdf8c5318ee84edeb1e42eaf78">cutlass::gemm::GemmSharedLoadTileBTraits::kThreadsPerWarp</a></div><div class="ttdeci">static int const kThreadsPerWarp</div><div class="ttdoc">The number of threads in one dimension of the warp. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:231</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_1_1ThreadOffset.html">cutlass::gemm::GemmSharedLoadTileBTraits::ThreadOffset</a></div><div class="ttdoc">Computes the thread offset in (H, W) based on thread ID. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:241</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a39414f484da7f993bc96d61c97273614"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a39414f484da7f993bc96d61c97273614">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::ImmediateOffsetStrides</a></div><div class="ttdeci">Shape&lt; 0, ShapeCount&lt; Tile &gt;::kWc, Threads::kH *kAccessSize &gt; ImmediateOffsetStrides</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:104</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a2bc41b907417b47f3dca9c3dd358f8bc"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a2bc41b907417b47f3dca9c3dd358f8bc">cutlass::gemm::GemmSharedStoreTileDTraits::Tile</a></div><div class="ttdeci">Shape&lt; 1, 2, kScalarsPerRow/kAccessSize, kAccessSize &gt; Tile</div><div class="ttdoc">The tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:296</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_ae852c89da0455025c0c41af258e47047"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ae852c89da0455025c0c41af258e47047">cutlass::gemm::GemmSharedStoreTileAbTraits::kAccessSize</a></div><div class="ttdeci">static int const kAccessSize</div><div class="ttdoc">The number of scalars per LDG/STG. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:52</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_ab96f324083e51ce4c2b73c18803c69a7"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#ab96f324083e51ce4c2b73c18803c69a7">cutlass::gemm::GemmSharedStoreTileAbTraits::Tile</a></div><div class="ttdeci">ReshapeTile&lt; Tile_, kScalarsPerSts_ &gt;::Tile Tile</div><div class="ttdoc">The tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:44</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset.html">cutlass::gemm::GemmSharedStoreTileAbTraits::ThreadOffset</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:68</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a4b8d66df02ba1653aa6d1f23b967f237"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4b8d66df02ba1653aa6d1f23b967f237">cutlass::gemm::GemmSharedLoadTileDTraits::kIterationsInHPerWarp</a></div><div class="ttdeci">static int const kIterationsInHPerWarp</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:369</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a57b065abb737bee1c17398c90b5bc39b"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a57b065abb737bee1c17398c90b5bc39b">cutlass::gemm::GemmSharedLoadTileDTraits::ImmediateOffsetStrides</a></div><div class="ttdeci">Shape&lt; OutputTile::kW, kScalarsPerRow, kWarpSize *kAccessSize, kSplitK &gt; ImmediateOffsetStrides</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:390</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a48baee6541e6359753f1bae5bd864029"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a48baee6541e6359753f1bae5bd864029">cutlass::gemm::GemmSharedStoreTileDTraits::kSkew</a></div><div class="ttdeci">static int const kSkew</div><div class="ttdoc">The skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:284</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a5a5a36fc570e1225b20ce0a48c89d213"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a5a5a36fc570e1225b20ce0a48c89d213">cutlass::gemm::GemmSharedLoadTileATraits::TileWithoutSkew</a></div><div class="ttdeci">ReshapeTile&lt; TileWithoutSkew_, kScalarsPerLds_ &gt;::Tile TileWithoutSkew</div><div class="ttdoc">The tile without skew after reshaping. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:142</div></div>
<div class="ttc" id="gemm__operand_8h_html"><div class="ttname"><a href="gemm__operand_8h.html">gemm_operand.h</a></div><div class="ttdoc">Defines constant expressions for mapping GEMM problem size and strides onto pitch-linear memory...</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a027bebceeda2287b40915ffd95d494a7"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a027bebceeda2287b40915ffd95d494a7">cutlass::gemm::GemmSharedStoreTileAbTraits::ImmediateOffsetStrides</a></div><div class="ttdeci">Shape&lt; 0, Threads::kH *ShapeCount&lt; Tile &gt;::kWc, Threads::kW *kAccessSize &gt; ImmediateOffsetStrides</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:66</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a74196946c28e98ee60346b0eeede1471"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a74196946c28e98ee60346b0eeede1471">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::Tile</a></div><div class="ttdeci">ReshapeTile&lt; Shape&lt; Tile_::kD, Tile_::kH, Tile_::kW+kSkew_ &gt;, kScalarsPerSts_ &gt;::Tile Tile</div><div class="ttdoc">The tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:89</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a2053e4b9cb3ed2727c89960354ea0b29"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a2053e4b9cb3ed2727c89960354ea0b29">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::ThreadsStrides</a></div><div class="ttdeci">Shape&lt; 0, kScalarsPerSts_, ShapeCount&lt; Tile &gt;::kHwc/Threads::kW &gt; ThreadsStrides</div><div class="ttdoc">The strides to compute the base position of the thread. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:115</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a050cf5964a2d3683491bc4313ead5450"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a050cf5964a2d3683491bc4313ead5450">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::TileWithoutSkew</a></div><div class="ttdeci">ReshapeTile&lt; Tile_, kScalarsPerSts_ &gt;::Tile TileWithoutSkew</div><div class="ttdoc">The tile without skews. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:86</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a8663311646210b690bb0c2a1012e82f0"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8663311646210b690bb0c2a1012e82f0">cutlass::gemm::GemmSharedLoadTileDTraits::kIterationsD</a></div><div class="ttdeci">static int const kIterationsD</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:379</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a134a02091bf4360d2cbca56624e52024"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a134a02091bf4360d2cbca56624e52024">cutlass::gemm::GemmSharedLoadTileATraits::Delta</a></div><div class="ttdeci">Shape&lt; TileWithSkew::kW *Warps::kD, 0, kWarps *kThreadsPerWarp *kAccessSize, 0 &gt; Delta</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:168</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_af78a275086a297bd93aed920f57a17be"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#af78a275086a297bd93aed920f57a17be">cutlass::gemm::GemmSharedLoadTileATraits::kWarps</a></div><div class="ttdeci">static int const kWarps</div><div class="ttdoc">The number of warps. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:158</div></div>
<div class="ttc" id="structcutlass_1_1GemmOperand_html_ab209ea3de198efabe8e8707dfe8e0a0caad0876342d150cef7da6ae149d5e99f9"><div class="ttname"><a href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0caad0876342d150cef7da6ae149d5e99f9">cutlass::GemmOperand::kB</a></div><div class="ttdef"><b>Definition:</b> matrix_traits.h:357</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_adf72ea773b8d4d3eb184f59c8cdf9543"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#adf72ea773b8d4d3eb184f59c8cdf9543">cutlass::gemm::GemmSharedStoreTileDTraits::ThreadsPerWarp</a></div><div class="ttdeci">ThreadsPerWarp_ ThreadsPerWarp</div><div class="ttdoc">The threads in the warps. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:280</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset.html">cutlass::gemm::GemmSharedLoadTileATraits::ThreadOffset</a></div><div class="ttdoc">Computes the thread offset in (H, W) based on thread ID. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:171</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a93ae99460695718babaef6d1ef597e38"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a93ae99460695718babaef6d1ef597e38">cutlass::gemm::GemmSharedLoadTileATraits::TileWithoutSkew_</a></div><div class="ttdeci">Shape&lt; kStages_, OutputTile_::kD/InstructionShape_::kD, GetExtent&lt; kOperand, OutputTile_ &gt;::kExtent *InstructionShape_::kD &gt; TileWithoutSkew_</div><div class="ttdoc">The tile without skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:138</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html">cutlass::gemm::GemmSharedLoadTileDTraits</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:339</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a9bef06b59f27c6e673066a7f0280aa06"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a9bef06b59f27c6e673066a7f0280aa06">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::Threads</a></div><div class="ttdeci">Threads_ Threads</div><div class="ttdoc">The threads. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:91</div></div>
<div class="ttc" id="cutlass_8h_html_a28c2443a142676d3d71effdae1a986b1"><div class="ttname"><a href="cutlass_8h.html#a28c2443a142676d3d71effdae1a986b1">CUTLASS_HOST_DEVICE</a></div><div class="ttdeci">#define CUTLASS_HOST_DEVICE</div><div class="ttdef"><b>Definition:</b> cutlass.h:46</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_acb16feebdcad5bbebe9d4d3383c37899"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#acb16feebdcad5bbebe9d4d3383c37899">cutlass::gemm::GemmSharedLoadTileDTraits::OutputTile</a></div><div class="ttdeci">OutputTile_ OutputTile</div><div class="ttdoc">The dimension of the output tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:345</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_aaa439a0bb6b9de5e2722ea7b011effea"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#aaa439a0bb6b9de5e2722ea7b011effea">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::Scalar</a></div><div class="ttdeci">platform::remove_const&lt; Scalar_ &gt;::type Scalar</div><div class="ttdoc">The scalar. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:82</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_ac585815d08290d9a5a9cdbd611ffdac4"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#ac585815d08290d9a5a9cdbd611ffdac4">cutlass::gemm::GemmSharedStoreTileDTraits::ImmediateOffsetStrides</a></div><div class="ttdeci">Shape&lt; 0, 0, Warps::kW *ThreadsPerWarp::kW *kAccessSize &gt; ImmediateOffsetStrides</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:302</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_afd4881aae69c8041d3931982d85f44e4"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#afd4881aae69c8041d3931982d85f44e4">cutlass::gemm::GemmSharedLoadTileBTraits::kOperand</a></div><div class="ttdeci">static GemmOperand::Kind const kOperand</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:201</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a63f980fea1ff3dd83ac276cfd83a4ce5"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a63f980fea1ff3dd83ac276cfd83a4ce5">cutlass::gemm::GemmSharedLoadTileDTraits::Tile</a></div><div class="ttdeci">Shape&lt; 1, 2, kScalarsPerRow/kAccessSize, kAccessSize &gt; Tile</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:366</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a2cd23d3b5e2cb64c6d5e9b1d6a78fbce"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a2cd23d3b5e2cb64c6d5e9b1d6a78fbce">cutlass::gemm::GemmSharedLoadTileDTraits::Delta</a></div><div class="ttdeci">Shape&lt; OutputTile::kW, kScalarsPerRow, kWarpSize *kAccessSize, kSplitK &gt; Delta</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:392</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a4246185b8279f245ef5d0650c1eec14f"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a4246185b8279f245ef5d0650c1eec14f">cutlass::gemm::GemmSharedLoadTileATraits::kThreadsPerWarp</a></div><div class="ttdeci">static int const kThreadsPerWarp</div><div class="ttdoc">The number of threads in one dimension of the warp. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:160</div></div>
<div class="ttc" id="structcutlass_1_1Shape_html"><div class="ttname"><a href="structcutlass_1_1Shape.html">cutlass::Shape</a></div><div class="ttdoc">A Shape implementing Layout Concept describing the dimensions of a cube. </div><div class="ttdef"><b>Definition:</b> shape.h:64</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_ab883c2a8b90262152faca9cabe515dc4"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#ab883c2a8b90262152faca9cabe515dc4">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::Pointer</a></div><div class="ttdeci">Scalar_ * Pointer</div><div class="ttdoc">The pointer. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:84</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a15438a44b588dc4cfd4b47c18af79cd2"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a15438a44b588dc4cfd4b47c18af79cd2">cutlass::gemm::GemmSharedLoadTileDTraits::kSplitK</a></div><div class="ttdeci">static int const kSplitK</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:383</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_a025445699c5c86237d8c3e48f01081ea"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#a025445699c5c86237d8c3e48f01081ea">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::Iterations</a></div><div class="ttdeci">Shape&lt; 1, TileWithoutSkew::kH/Threads::kW, TileWithoutSkew::kW/Threads::kH &gt; Iterations</div><div class="ttdoc">The number of iterations needed to load/store the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:100</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a3d8be9ddea1cab53d1b4b3d508f9eab8"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a3d8be9ddea1cab53d1b4b3d508f9eab8">cutlass::gemm::GemmSharedLoadTileBTraits::TileWithoutSkew_</a></div><div class="ttdeci">Shape&lt; kStages_, OutputTile_::kD/InstructionShape_::kD, GetExtent&lt; kOperand, OutputTile_ &gt;::kExtent *InstructionShape_::kD &gt; TileWithoutSkew_</div><div class="ttdoc">The tile without skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:210</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a1acf2a1d8bf73fda142e7d82e05f00a2"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a1acf2a1d8bf73fda142e7d82e05f00a2">cutlass::gemm::GemmSharedStoreTileAbTraits::Threads</a></div><div class="ttdeci">Threads_ Threads</div><div class="ttdoc">The threads. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:46</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GetExtent_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GetExtent.html">cutlass::gemm::GetExtent</a></div><div class="ttdef"><b>Definition:</b> gemm_operand.h:50</div></div>
<div class="ttc" id="structcutlass_1_1Coord_html"><div class="ttname"><a href="structcutlass_1_1Coord.html">cutlass::Coord&lt; 4 &gt;</a></div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a99017ecc737060f53fd9804ea6f9583f"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a99017ecc737060f53fd9804ea6f9583f">cutlass::gemm::GemmSharedLoadTileBTraits::ImmediateOffsetStrides</a></div><div class="ttdeci">Shape&lt; TileWithSkew::kW *Warps::kD, 0, kWarps *kThreadsPerWarp *kAccessSize, 0 &gt; ImmediateOffsetStrides</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:237</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_html_a645f65f7d8f123936b286521df470224"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits.html#a645f65f7d8f123936b286521df470224">cutlass::gemm::GemmSharedStoreTileAbTraits::Delta</a></div><div class="ttdeci">Shape&lt; 0, Threads::kH *ShapeCount&lt; Tile &gt;::kWc, Threads::kW *kAccessSize &gt; Delta</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:63</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a8325bc9d56155ecb6f2ddbd56f4ed23d"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a8325bc9d56155ecb6f2ddbd56f4ed23d">cutlass::gemm::GemmSharedLoadTileDTraits::kThreads</a></div><div class="ttdeci">static int const kThreads</div><div class="ttdoc">The number of threads. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:360</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_aaff4a5e0f9e4256f184a22cad0ce8cf4"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#aaff4a5e0f9e4256f184a22cad0ce8cf4">cutlass::gemm::GemmSharedLoadTileATraits::Warps</a></div><div class="ttdeci">Warps_ Warps</div><div class="ttdoc">The number of warps. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:146</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_ae5a07814b9cfe9a64f69bac0f0772f20"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#ae5a07814b9cfe9a64f69bac0f0772f20">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::kMemorySpace</a></div><div class="ttdeci">static MemorySpace::Kind const kMemorySpace</div><div class="ttdoc">The memory space. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:97</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_a7007093a4abf79a0b4bfb3fc85a02620"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#a7007093a4abf79a0b4bfb3fc85a02620">cutlass::gemm::GemmSharedLoadTileBTraits::kMemorySpace</a></div><div class="ttdeci">static MemorySpace::Kind const kMemorySpace</div><div class="ttdoc">The memory space. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:226</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset_html_a1e357fe5bc1daef333e6be776a21a2ca"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileAbTraits_1_1ThreadOffset.html#a1e357fe5bc1daef333e6be776a21a2ca">cutlass::gemm::GemmSharedStoreTileAbTraits::ThreadOffset::operator()</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 4 &gt; operator()() const</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:70</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset_html_a51a325b435b9a53effaa003b3670e410"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_1_1ThreadOffset.html#a51a325b435b9a53effaa003b3670e410">cutlass::gemm::GemmSharedLoadTileATraits::ThreadOffset::operator()</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 4 &gt; operator()() const</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:172</div></div>
<div class="ttc" id="structcutlass_1_1Shape_html_a19086a5567d6c710ec853e35a7f29c25"><div class="ttname"><a href="structcutlass_1_1Shape.html#a19086a5567d6c710ec853e35a7f29c25">cutlass::Shape::kD</a></div><div class="ttdeci">static int const kD</div><div class="ttdoc">The depth of the cube. </div><div class="ttdef"><b>Definition:</b> shape.h:66</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_1_1ThreadOffset.html">cutlass::gemm::GemmSharedStoreTileDTraits::ThreadOffset</a></div><div class="ttdoc">Computes the thread offset in (H, W) based on thread ID. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:305</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a4764f70691cb3fee91ce47653363aa4f"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a4764f70691cb3fee91ce47653363aa4f">cutlass::gemm::GemmSharedLoadTileDTraits::Warps</a></div><div class="ttdeci">Warps_ Warps</div><div class="ttdoc">The warps in the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:347</div></div>
<div class="ttc" id="structcutlass_1_1ReshapeTile_html_a8d57fe6422aa920d9815a66e5a85b5f5"><div class="ttname"><a href="structcutlass_1_1ReshapeTile.html#a8d57fe6422aa920d9815a66e5a85b5f5">cutlass::ReshapeTile::Tile</a></div><div class="ttdeci">Tile_ Tile</div><div class="ttdef"><b>Definition:</b> reshape_tile.h:43</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_html_afd691b764b7d105a1ed41dada6049e71"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits.html#afd691b764b7d105a1ed41dada6049e71">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::Delta</a></div><div class="ttdeci">Shape&lt; 0, ShapeCount&lt; Tile &gt;::kWc, Threads::kH *kAccessSize &gt; Delta</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:102</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a3b1a461c1dfbcd3817ab2d57bd0da9f1"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a3b1a461c1dfbcd3817ab2d57bd0da9f1">cutlass::gemm::GemmSharedLoadTileDTraits::kIterationsH</a></div><div class="ttdeci">static int const kIterationsH</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:377</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a5587ef22f419ab9a7c6117917cc99c57"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a5587ef22f419ab9a7c6117917cc99c57">cutlass::gemm::GemmSharedStoreTileDTraits::Delta</a></div><div class="ttdeci">Shape&lt; 0, 0, Warps::kW *ThreadsPerWarp::kW *kAccessSize &gt; Delta</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:300</div></div>
<div class="ttc" id="structcutlass_1_1GemmOperand_html_ab209ea3de198efabe8e8707dfe8e0a0c"><div class="ttname"><a href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0c">cutlass::GemmOperand::Kind</a></div><div class="ttdeci">Kind</div><div class="ttdef"><b>Definition:</b> matrix_traits.h:357</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_aaffe67e519e919bf561142e05da6e6c8"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#aaffe67e519e919bf561142e05da6e6c8">cutlass::gemm::GemmSharedLoadTileATraits::kSkew</a></div><div class="ttdeci">static int const kSkew</div><div class="ttdoc">The skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:153</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a9022ffc49b32503fd3639341e7e291a3"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a9022ffc49b32503fd3639341e7e291a3">cutlass::gemm::GemmSharedLoadTileDTraits::ThreadsPerWarp</a></div><div class="ttdeci">ThreadsPerWarp_ ThreadsPerWarp</div><div class="ttdoc">The threads in the warps. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:349</div></div>
<div class="ttc" id="structcutlass_1_1GemmOperand_html_ab209ea3de198efabe8e8707dfe8e0a0cac2b9fe9e3679a059d1a6c946b2a2c31a"><div class="ttname"><a href="structcutlass_1_1GemmOperand.html#ab209ea3de198efabe8e8707dfe8e0a0cac2b9fe9e3679a059d1a6c946b2a2c31a">cutlass::GemmOperand::kA</a></div><div class="ttdef"><b>Definition:</b> matrix_traits.h:357</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a1e72b69cf2147e4d194893a64417b920"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1e72b69cf2147e4d194893a64417b920">cutlass::gemm::GemmSharedLoadTileDTraits::Pointer</a></div><div class="ttdeci">Scalar_ * Pointer</div><div class="ttdoc">The pointer. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:343</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a05039ba8b7d9890903064b1a834dcd3e"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a05039ba8b7d9890903064b1a834dcd3e">cutlass::gemm::GemmSharedStoreTileDTraits::kThreads</a></div><div class="ttdeci">static int const kThreads</div><div class="ttdoc">The number of threads. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:291</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_ad012add21d9393d136720f609467e121"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#ad012add21d9393d136720f609467e121">cutlass::gemm::GemmSharedLoadTileATraits::ImmediateOffsetStrides</a></div><div class="ttdeci">Shape&lt; TileWithSkew::kW *Warps::kD, 0, kWarps *kThreadsPerWarp *kAccessSize, 0 &gt; ImmediateOffsetStrides</div><div class="ttdoc">The strides in each dimension between different loads/stores. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:167</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_aed92656a074e915d97a1b6a990aeba66"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aed92656a074e915d97a1b6a990aeba66">cutlass::gemm::GemmSharedLoadTileBTraits::ThreadsPerWarp</a></div><div class="ttdeci">ThreadsPerWarp_ ThreadsPerWarp</div><div class="ttdoc">The threads in a warp. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:220</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_a9a2218b570dada2f1e3ccd8004c47856"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#a9a2218b570dada2f1e3ccd8004c47856">cutlass::gemm::GemmSharedStoreTileDTraits::Scalar</a></div><div class="ttdeci">platform::remove_const&lt; Scalar_ &gt;::type Scalar</div><div class="ttdoc">The scalar. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:272</div></div>
<div class="ttc" id="structcutlass_1_1ShapeCount_html"><div class="ttname"><a href="structcutlass_1_1ShapeCount.html">cutlass::ShapeCount</a></div><div class="ttdoc">Compute derived counted of a Layout Concept based class. </div><div class="ttdef"><b>Definition:</b> shape.h:79</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits_html_a72e0214f86cf8b3711d006dcd69d7a17"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileATraits.html#a72e0214f86cf8b3711d006dcd69d7a17">cutlass::gemm::GemmSharedLoadTileATraits::TileWithSkew</a></div><div class="ttdeci">Shape&lt; kStages_, TileWithoutSkew_::kH, TileWithoutSkew_::kW+kSkew_ &gt; TileWithSkew</div><div class="ttdoc">The tile with skew. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:140</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html_af4597927405d8bb1ad2c464fad064703"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html#af4597927405d8bb1ad2c464fad064703">cutlass::gemm::GemmSharedStoreTileDTraits::Warps</a></div><div class="ttdeci">Warps_ Warps</div><div class="ttdoc">The warps in the tile. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:278</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset_html_a4e35f0b2ca63a6b981230b73f843f726"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreWithSkewTileAbTraits_1_1ThreadOffset.html#a4e35f0b2ca63a6b981230b73f843f726">cutlass::gemm::GemmSharedStoreWithSkewTileAbTraits::ThreadOffset::operator()</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 4 &gt; operator()() const</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:107</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_html_a1b025cb056729706f36469e74a9799dc"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits.html#a1b025cb056729706f36469e74a9799dc">cutlass::gemm::GemmSharedLoadTileDTraits::Scalar</a></div><div class="ttdeci">platform::remove_const&lt; Scalar_ &gt;::type Scalar</div><div class="ttdoc">The scalar. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:341</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset_html_ace1b936cab289c6884e673312283d422"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileDTraits_1_1ThreadOffset.html#ace1b936cab289c6884e673312283d422">cutlass::gemm::GemmSharedLoadTileDTraits::ThreadOffset::operator()</a></div><div class="ttdeci">CUTLASS_HOST_DEVICE Coord&lt; 4 &gt; operator()() const</div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:396</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits_html"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedStoreTileDTraits.html">cutlass::gemm::GemmSharedStoreTileDTraits</a></div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:270</div></div>
<div class="ttc" id="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits_html_aa41cc5dc82fe08457d103545f8f63081"><div class="ttname"><a href="structcutlass_1_1gemm_1_1GemmSharedLoadTileBTraits.html#aa41cc5dc82fe08457d103545f8f63081">cutlass::gemm::GemmSharedLoadTileBTraits::kAccessSize</a></div><div class="ttdeci">static int const kAccessSize</div><div class="ttdoc">The number of scalars per LDG/STG. </div><div class="ttdef"><b>Definition:</b> gemm_shared_tile.h:222</div></div>
</div><!-- fragment --></div><!-- contents -->
<!-- start footer part -->
<hr class="footer"/><address class="footer"><small>
Generated on Fri Oct 26 2018 14:53:33 for Cutlass by &#160;<a href="http://www.doxygen.org/index.html">
<img class="footer" src="doxygen.png" alt="doxygen"/>
</a> 1.8.14
</small></address>
</body>
</html>