vllm/tests/basic_correctness
2024-08-12 22:47:41 +00:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
test_basic_correctness.py [core][distributed] simplify code to support pipeline parallel (#6406) 2024-07-14 21:20:51 -07:00
test_chunked_prefill.py [Core/Bugfix] Add FP8 K/V Scale and dtype conversion for prefix/prefill Triton Kernel (#7208) 2024-08-12 22:47:41 +00:00
test_cpu_offload.py [Bugfix] Fix GPTQ and GPTQ Marlin CPU Offloading (#7225) 2024-08-06 18:34:26 -07:00
test_preemption.py [Core] Pipeline Parallel Support (#4412) 2024-07-02 10:58:08 -07:00