|
aqlm
|
AQLM CUDA support (#3287)
|
2024-04-23 13:59:33 -04:00 |
|
awq
|
Refactor 2 awq gemm kernels into m16nXk32 (#2723)
|
2024-02-12 11:02:17 -08:00 |
|
fp8
|
[Kernel] Make static FP8 scaling more robust (#4570)
|
2024-05-06 17:39:28 -07:00 |
|
marlin
|
[Bugfix] Fix marlin kernel crash on H100 (#4218)
|
2024-04-24 10:35:01 -07:00 |