|
activation_kernels.cu
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
activation.cpp
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
attention_kernels.cu
|
Support block size 32 (#35)
|
2023-04-09 23:07:18 -07:00 |
|
cache_kernels.cu
|
Memcpy kernel for flash attention (#29)
|
2023-04-10 18:22:49 -07:00 |
|
cache.cpp
|
Memcpy kernel for flash attention (#29)
|
2023-04-10 18:22:49 -07:00 |
|
pos_encoding_kernels.cu
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
pos_encoding.cpp
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |