Co-authored-by: Chen Shen <scv119@gmail.com> Co-authored-by: Shreyas Krishnaswamy <shrekris@anyscale.com> Co-authored-by: Avnish Narayan <avnish@anyscale.com> |
||
|---|---|---|
| .. | ||
| attention | ||
| punica | ||
| quantization | ||
| activation_kernels.cu | ||
| cache_kernels.cu | ||
| cache.h | ||
| cuda_compat.h | ||
| cuda_utils_kernels.cu | ||
| cuda_utils.h | ||
| dispatch_utils.h | ||
| layernorm_kernels.cu | ||
| ops.h | ||
| pos_encoding_kernels.cu | ||
| pybind.cpp | ||
| reduction_utils.cuh | ||