vllm/tests/basic_correctness
Lily Liu 7041de4384
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>
2024-06-28 15:28:49 -07:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
test_basic_correctness.py [Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628) 2024-06-28 15:28:49 -07:00
test_chunked_prefill.py [CI/Test] improve robustness of test (vllm_runner) (#5357) 2024-06-08 08:59:20 +00:00
test_preemption.py [CI/Test] improve robustness of test (vllm_runner) (#5357) 2024-06-08 08:59:20 +00:00