vllm/csrc at 404422f42ed9c59ee816dacd9b54196a59ae65b2 - vllm

History

Woosuk Kwon 404422f42e [Model] Add support for MPT (#334 )		2023-07-03 16:47:53 -07:00
..
attention	[Model] Add support for MPT (#334 )	2023-07-03 16:47:53 -07:00
activation_kernels.cu	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
activation.cpp	Optimize data movement (#20 )	2023-04-02 00:30:17 -07:00
attention.cpp	Add support for BLOOM (#331 )	2023-07-03 13:12:35 -07:00
cache_kernels.cu	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
cache.cpp	Memcpy kernel for flash attention (#29 )	2023-04-10 18:22:49 -07:00
layernorm_kernels.cu	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
layernorm.cpp	Add custom kernel for RMS normalization (#16 )	2023-04-01 00:51:22 +08:00
pos_encoding_kernels.cu	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
pos_encoding.cpp	Add support for GPT-NeoX (Pythia) (#50 )	2023-04-28 00:32:10 -07:00
reduction_utils.cuh	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00