vllm/tests/kernels
Lily Liu 43c413ec57
[Kernel] Use flashinfer for decoding (#4353)
Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>
2024-05-03 15:51:27 -07:00
..
allclose_default.py [ROCm] Fix some kernels failed unit tests (#2498) 2024-02-05 14:25:36 -08:00
conftest.py [Kernel] Use flashinfer for decoding (#4353) 2024-05-03 15:51:27 -07:00
test_activation.py [CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00
test_attention.py [Core][Model runner refactoring 1/N] Refactor attn metadata term (#4518) 2024-05-03 10:20:12 -07:00
test_cache.py [Kernel] Use flashinfer for decoding (#4353) 2024-05-03 15:51:27 -07:00
test_layernorm.py [Kernel] Layernorm performance optimization (#3662) 2024-03-30 14:26:38 -07:00
test_moe.py [Core] Set linear_weights directly on the layer (#3977) 2024-04-11 16:35:51 -04:00
test_pos_encoding.py [CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00
test_prefix_prefill.py [Core][Model runner refactoring 1/N] Refactor attn metadata term (#4518) 2024-05-03 10:20:12 -07:00
test_rand.py [CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00
test_sampler.py [CI] Try introducing isort. (#3495) 2024-03-25 07:59:47 -07:00