Yujia Zhai 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							cc3c29a81a 
							
						 
					 
					
						
						
							
							CUTLASS 3.6.0 ( #1850 )  
						
						... 
						
						
						
						* v3.6
* update changelog
* update readme
* fix typo
* fixing typos
* hopper gemm with weight prefetch
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com> 
						
					 
					
						2024-10-09 15:33:27 -04:00 
						 
				 
			
				
					
						
							
							
								reed 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2991ce18d3 
							
						 
					 
					
						
						
							
							Add print_svg for mma ( #1733 )  
						
						... 
						
						
						
						* add print_svg for mma
* correct the code indentation 
						
					 
					
						2024-09-18 10:37:24 -04:00 
						 
				 
			
				
					
						
							
							
								Vijay Thakkar 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							be60a0b272 
							
						 
					 
					
						
						
							
							CUTLASS 3.5.1 ( #1623 )  
						
						... 
						
						
						
						* CUTLASS 3.5.1
* updates, optimizations, fixes 
						
					 
					
						2024-07-29 08:46:24 -04:00 
						 
				 
			
				
					
						
							
							
								Andy Lo 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							81b06ee0e0 
							
						 
					 
					
						
						
							
							Fix B operand variable name and comments ( #1458 )  
						
						
						
					 
					
						2024-07-10 11:06:29 -04:00 
						 
				 
			
				
					
						
							
							
								Vijay Thakkar 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							7d49e6c7e2 
							
						 
					 
					
						
						
							
							Updates for CUTLASS 3.5.0 ( #1468 )  
						
						
						
					 
					
						2024-04-11 21:33:40 -04:00 
						 
				 
			
				
					
						
							
							
								Vijay Thakkar 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							629f4653c3 
							
						 
					 
					
						
						
							
							CUTLASS 3.5.0 ( #1411 )  
						
						
						
					 
					
						2024-03-19 17:51:04 -04:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							bbe579a9e3 
							
						 
					 
					
						
						
							
							Updates for CUTLASS 3.4.1 ( #1346 )  
						
						... 
						
						
						
						* Updates for CUTLASS 3.4.1
* minor epi change 
						
					 
					
						2024-02-15 15:48:34 -05:00 
						 
				 
			
				
					
						
							
							
								reed 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							092f14db05 
							
						 
					 
					
						
						
							
							fix tile_size_mnk compilation warning ( #1294 )  
						
						
						
					 
					
						2024-01-29 21:21:15 -05:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							751eb9a885 
							
						 
					 
					
						
						
							
							Update license year ( #1306 )  
						
						
						
					 
					
						2024-01-16 14:37:22 -05:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2f589ffa76 
							
						 
					 
					
						
						
							
							Updates for 3.4 release. ( #1305 )  
						
						
						
					 
					
						2024-01-16 13:42:51 -05:00 
						 
				 
			
				
					
						
							
							
								Pradeep Ramani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							8236f30675 
							
						 
					 
					
						
						
							
							CUTLASS 3.4.0 ( #1286 )  
						
						... 
						
						
						
						* CUTLASS 3.4.0
* Update CHANGELOG.md
---------
Co-authored-by: Pradeep Ramani <prramani@nvidia.com> 
						
					 
					
						2023-12-29 15:21:31 -05:00 
						 
				 
			
				
					
						
							
							
								Christian Sigg 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							2375a07d01 
							
						 
					 
					
						
						
							
							Qualify calls to make_fragment_? from templated base class. ( #1196 )  
						
						... 
						
						
						
						Fixes clang build error. 
						
					 
					
						2023-12-01 09:52:57 -05:00 
						 
				 
			
				
					
						
							
							
								Pradeep Ramani 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							c008b4aea8 
							
						 
					 
					
						
						
							
							CUTLASS 3.3.0 ( #1167 )  
						
						... 
						
						
						
						* Release 3.3.0
Adds support for mixed precision GEMMs On Hopper and Ampere
Adds support for < 16B aligned GEMMs on Hopper
Enhancements to EVT
Enhancements to Python interface
Enhancements to Sub-byte type handling in CuTe
Several other bug-fixes and performance improvements.
* minor doc update 
						
					 
					
						2023-11-02 11:09:05 -04:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							90d3b0fb18 
							
						 
					 
					
						
						
							
							CUTLASS 3.2.1 ( #1113 )  
						
						... 
						
						
						
						* Updates for 3.2.1 release.
* Minor fix in gemm op profiler for raster order.
* Add scheduler mapping for raster order in the kernels. 
						
					 
					
						2023-09-26 17:24:26 -04:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							4575443d44 
							
						 
					 
					
						
						
							
							CUTLASS 3.2 ( #1024 )  
						
						... 
						
						
						
						* CUTLASS 3.2 
						
					 
					
						2023-08-07 20:50:32 -04:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							f079619f5e 
							
						 
					 
					
						
						
							
							More updates for 3.1 ( #958 )  
						
						... 
						
						
						
						* Updates for 3.1
* Minor change
* doc link fix
* Minor updates 
						
					 
					
						2023-05-24 10:17:16 -04:00 
						 
				 
			
				
					
						
							
							
								ANIKET SHIVAM 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							d572cc1aab 
							
						 
					 
					
						
						
							
							CUTLASS 3.1 ( #915 )  
						
						... 
						
						
						
						Co-authored-by: Aniket Shivam <ashivam@nvidia.com> 
						
					 
					
						2023-04-14 23:19:34 -04:00 
						 
				 
			
				
					
						
							
							
								Vijay Thakkar 
							
						 
					 
					
						
						
							
							
						
						
						
							
						
						
							277bd6e537 
							
						 
					 
					
						
						
							
							CUTLASS 3.0.0 ( #786 )  
						
						... 
						
						
						
						* CUTLASS 3.0.0 
						
					 
					
						2023-01-23 20:55:28 -05:00