vllm/tests/distributed
2024-05-12 17:47:59 -07:00
..
test_basic_distributed_correctness.py [Kernel] Use flashinfer for decoding (#4353) 2024-05-03 15:51:27 -07:00
test_chunked_prefill_distributed.py [Core][5/N] Fully working chunked prefill e2e (#3884) 2024-04-10 17:56:48 -07:00
test_comm_ops.py [Core][Distributed] refactor custom allreduce to support multiple tp groups (#4754) 2024-05-12 17:47:59 -07:00
test_custom_all_reduce.py [Core][Distributed] refactor custom allreduce to support multiple tp groups (#4754) 2024-05-12 17:47:59 -07:00
test_pynccl_library.py [Core] nccl integrity check and test (#4155) 2024-04-17 22:28:52 -07:00
test_pynccl.py [Core][Distributed] refactor custom allreduce to support multiple tp groups (#4754) 2024-05-12 17:47:59 -07:00