<ahref="batched__reduction__traits_8h.html">Go to the documentation of this file.</a><divclass="fragment"><divclass="line"><aname="l00001"></a><spanclass="lineno"> 1</span> <spanclass="comment">/***************************************************************************************************</span></div><divclass="line"><aname="l00002"></a><spanclass="lineno"> 2</span> <spanclass="comment">* Copyright (c) 2017-2019, NVIDIA CORPORATION. All rights reserved.</span></div><divclass="line"><aname="l00003"></a><spanclass="lineno"> 3</span> <spanclass="comment">*</span></div><divclass="line"><aname="l00004"></a><spanclass="lineno"> 4</span> <spanclass="comment">* Redistribution and use in source and binary forms, with or without modification, are permitted</span></div><divclass="line"><aname="l00005"></a><spanclass="lineno"> 5</span> <spanclass="comment">* provided that the following conditions are met:</span></div><divclass="line"><aname="l00006"></a><spanclass="lineno"> 6</span> <spanclass="comment">* * Redistributions of source code must retain the above copyright notice, this list of</span></div><divclass="line"><aname="l00007"></a><spanclass="lineno"> 7</span> <spanclass="comment">* conditions and the following disclaimer.</span></div><divclass="line"><aname="l00008"></a><spanclass="lineno"> 8</span> <spanclass="comment">* * Redistributions in binary form must reproduce the above copyright notice, this list of</span></div><divclass="line"><aname="l00009"></a><spanclass="lineno"> 9</span> <spanclass="comment">* conditions and the following disclaimer in the documentation and/or other materials</span></div><divclass="line"><aname="l00010"></a><spanclass="lineno"> 10</span> <spanclass="comment">* provided with the distribution.</span></div><divclass="line"><aname="l00011"></a><spanclass="lineno"> 11</span> <spanclass="comment">* * Neither the name of the NVIDIA CORPORATION nor the names of its contributors may be used</span></div><divclass="line"><aname="l00012"></a><spanclass="lineno"> 12</span> <spanclass="comment">* to endorse or promote products derived from this software without specific prior written</span></div><divclass="line"><aname="l00013"></a><spanclass="lineno"> 13</span> <spanclass="comment">* permission.</span></div><divclass="line"><aname="l00014"></a><spanclass="lineno"> 14</span> <spanclass="comment">*</span></div><divclass="line"><aname="l00015"></a><spanclass="lineno"> 15</span> <spanclass="comment">* THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR</span></div><divclass="line"><aname="l00016"></a><spanclass="lineno"> 16</span> <spanclass="comment">* IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND</span></div><divclass="line"><aname="l00017"></a><spanclass="lineno"> 17</span> <spanclass="comment">* FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL NVIDIA CORPORATION BE LIABLE</span></div><divclass="line"><aname="l00018"></a><spanclass="lineno"> 18</span> <spanclass="comment">* FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING,</span></div><divclass="line"><aname="l00019"></a><spanclass="lineno"> 19</span> <spanclass="comment">* BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS;</span></div><divclass="line"><aname="l00020"></a><spanclass="lineno"> 20</span> <spanclass="comment">* OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT,</span></div><divclass="line"><aname="l00021"></a><spanclass="lineno"> 21</span> <spanclass="comment">* STRICT LIABILITY, OR TOR (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE</span></div><divclass="line"><aname="l00022"></a><
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_ab35edeae5cd8767bd376fad5f6680e25"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#ab35edeae5cd8767bd376fad5f6680e25">cutlass::reduction::BatchedReductionTraits::kThreads</a></div><divclass="ttdeci">static int const kThreads</div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:122</div></div>
<divclass="ttc"id="namespacecutlass_html_a7419519fa453a121dfa5f26bf87318d9"><divclass="ttname"><ahref="namespacecutlass.html#a7419519fa453a121dfa5f26bf87318d9">cutlass::make_Coord</a></div><divclass="ttdeci">CUTLASS_HOST_DEVICE Coord< 1 > make_Coord(int _0)</div><divclass="ttdoc">Helper to make a 2-element coordinate. </div><divclass="ttdef"><b>Definition:</b> coord.h:387</div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_ae7e468b1d372b4b807e2e1089af885ec"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#ae7e468b1d372b4b807e2e1089af885ec">cutlass::reduction::BatchedReductionTraits::ScalarAccum</a></div><divclass="ttdeci">ScalarAccum_ ScalarAccum</div><divclass="ttdoc">The type for accumulation. </div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:109</div></div>
<divclass="ttc"id="reduction_2threadblock__swizzle_8h_html"><divclass="ttname"><ahref="reduction_2threadblock__swizzle_8h.html">threadblock_swizzle.h</a></div><divclass="ttdoc">Defies functors for mapping blockIdx to partitions of the batched reduction computation. </div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_1_1Params_html_a5d1463d473d4226b0d19c581b16ed3b2"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits_1_1Params.html#a5d1463d473d4226b0d19c581b16ed3b2">cutlass::reduction::BatchedReductionTraits::Params::reduction_stride</a></div><divclass="ttdeci">long long int reduction_stride</div><divclass="ttdoc">stride between two element that will be sumed </div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:146</div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_a00c71c9a18aaad84f4a48023dbbb454e"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#a00c71c9a18aaad84f4a48023dbbb454e">cutlass::reduction::BatchedReductionTraits::ReductionSize</a></div><divclass="ttdeci">static const int ReductionSize</div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:115</div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_af11a3284195a24e580d2f379f179f05a"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#af11a3284195a24e580d2f379f179f05a">cutlass::reduction::BatchedReductionTraits::maxInReg</a></div><divclass="ttdeci">static int const maxInReg</div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:124</div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_ab4f5f457dbfa6bd250a4c34e1d573a85"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#ab4f5f457dbfa6bd250a4c34e1d573a85">cutlass::reduction::BatchedReductionTraits::ThreadShapeMultiple2</a></div><divclass="ttdeci">static const bool ThreadShapeMultiple2</div><divclass="ttdoc">check if threadShape is multiple of 2. </div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:117</div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_ac28e31791c5888bbe7b04abe6376a422"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#ac28e31791c5888bbe7b04abe6376a422">cutlass::reduction::BatchedReductionTraits::maxOutReg</a></div><divclass="ttdeci">static int const maxOutReg</div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:126</div></div>
<divclass="ttc"id="batched__reduction_8h_html"><divclass="ttname"><ahref="batched__reduction_8h.html">batched_reduction.h</a></div><divclass="ttdoc">Implements a software-pipelined efficient batched reduction. D = alpha * Reduction(A) + beta * C...</div></div>
<divclass="ttc"id="cutlass_8h_html"><divclass="ttname"><ahref="cutlass_8h.html">cutlass.h</a></div><divclass="ttdoc">Basic include for CUTLASS. </div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_1_1Params_html_ac27f42beb3625c5183b76b26677c0cb0"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits_1_1Params.html#ac27f42beb3625c5183b76b26677c0cb0">cutlass::reduction::BatchedReductionTraits::Params::initialize</a></div><divclass="ttdeci">CUTLASS_HOST_DEVICE int initialize(Index m_, Index n_, ScalarAlphaBeta alpha_, ScalarAlphaBeta beta_, long long int reduction_stride_, ScalarA const *d_a_, Index lda_, ScalarC const *d_c_, Index ldc_, ScalarD *d_d_, Index ldd_)</div><divclass="ttdoc">Initialize the parameters for 2D output tensor. </div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:162</div></div>
<divclass="ttc"id="structcutlass_1_1reduction_1_1BatchedReductionTraits_html_a085c72d54426f5eb60f5bffa9c383229"><divclass="ttname"><ahref="structcutlass_1_1reduction_1_1BatchedReductionTraits.html#a085c72d54426f5eb60f5bffa9c383229">cutlass::reduction::BatchedReductionTraits::KernelClass</a></div><divclass="ttdeci">cutlass::reduction::BatchedReduction< This_ > KernelClass</div><divclass="ttdoc">The struct that consumes this Traits. </div><divclass="ttdef"><b>Definition:</b> batched_reduction_traits.h:93</div></div>