vllm/tests/basic_correctness
Luka Govedič 7937009a7e
[Kernel] Replaced blockReduce[...] functions with cub::BlockReduce (#7233)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-21 20:18:00 -04:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
test_basic_correctness.py [core][distributed] simplify code to support pipeline parallel (#6406) 2024-07-14 21:20:51 -07:00
test_chunked_prefill.py [Kernel] Replaced blockReduce[...] functions with cub::BlockReduce (#7233) 2024-08-21 20:18:00 -04:00
test_cpu_offload.py [CI] Move quantization cpu offload tests out of fastcheck (#7574) 2024-08-15 21:16:20 -07:00
test_preemption.py [Core] Optimize SPMD architecture with delta + serialization optimization (#7109) 2024-08-18 17:57:20 -07:00