vllm/kernels at 96b6f475dda40a0c7d557f73c36fe09c07be2e9c - vllm

History

Kunshang Ji 96b6f475dd Remove hardcoded `device="cuda"` to support more devices (#2503 ) Co-authored-by: Jiang Li <jiang1.li@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>		2024-02-01 15:46:39 -08:00
..
conftest.py	Support FP8-E5M2 KV Cache (#2279 )	2024-01-28 16:43:54 -08:00
test_activation.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_attention.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_cache.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_layernorm.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_moe.py	Add unit test for Mixtral MoE layer (#2677 )	2024-01-31 14:34:17 -08:00
test_pos_encoding.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00
test_prefix_prefill.py	Remove hardcoded `device="cuda"` to support more devices (#2503 )	2024-02-01 15:46:39 -08:00