ANIKET SHIVAM
|
66d9cddc83
|
New updates for 2.11 (#775)
* New updates.
* Minor profiler updates
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2023-01-20 16:32:57 -05:00 |
|
Aditya Atluri
|
c975e2ccbb
|
releaase 2.11 (#703)
|
2022-11-19 09:02:15 -05:00 |
|
Yujia Zhai
|
b1d3f9b2fd
|
upstream internal updates (#616)
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2022-09-04 23:05:09 -04:00 |
|
ANIKET SHIVAM
|
b72cbf957d
|
CUTLASS 2.10 (#615)
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2022-09-03 18:48:46 -04:00 |
|
Yujia Zhai
|
04a9777b87
|
Softmax (#546)
* add test layernorm g-mem version
* Delete include/configure directory
* Delete examples/test_layernorm directory
* Update gemm_with_softmax.h
* Update gemm_softmax.cu
* Update linear_combination.h
* Update fast_math.h
* remove redundant vars
Co-authored-by: yujia.zhai <yujia.zhai@bytedance.com>
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2022-07-02 01:19:18 -04:00 |
|
Andrew Kerr
|
12f4108ac2
|
CUTLASS 2.9 (#468)
|
2022-04-23 15:02:38 -04:00 |
|