vllm/tests/distributed
2024-05-10 15:14:40 -07:00
..
test_basic_distributed_correctness.py [Kernel] Use flashinfer for decoding (#4353) 2024-05-03 15:51:27 -07:00
test_chunked_prefill_distributed.py [Core][5/N] Fully working chunked prefill e2e (#3884) 2024-04-10 17:56:48 -07:00
test_comm_ops.py [Core][Distributed] support cpu&device in broadcast tensor dict (#4660) 2024-05-07 19:34:47 -07:00
test_custom_all_reduce.py [Core][Test] fix function name typo in custom allreduce (#4750) 2024-05-10 15:14:40 -07:00
test_pynccl_library.py [Core] nccl integrity check and test (#4155) 2024-04-17 22:28:52 -07:00
test_pynccl.py [Core][Distributed] refactor pynccl (#4591) 2024-05-09 19:48:43 -07:00