Co-authored-by: Robert Irvine <robert@seamlessml.com> Co-authored-by: root <rirv938@gmail.com> Co-authored-by: Casper <casperbh.96@gmail.com> Co-authored-by: julian-q <julianhquevedo@gmail.com> |
||
|---|---|---|
| .. | ||
| attention | ||
| quantization/awq | ||
| activation_kernels.cu | ||
| activation.cpp | ||
| attention.cpp | ||
| cache_kernels.cu | ||
| cache.cpp | ||
| dispatch_utils.h | ||
| layernorm_kernels.cu | ||
| layernorm.cpp | ||
| pos_encoding_kernels.cu | ||
| pos_encoding.cpp | ||
| quantization.cpp | ||
| reduction_utils.cuh | ||