vllm/distributed at 8344f7742b794ca6ec9bcb891c178cd0551f23d0 - vllm

History

Lily Liu 43c413ec57 [Kernel] Use flashinfer for decoding (#4353 ) Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>		2024-05-03 15:51:27 -07:00
..
test_basic_distributed_correctness.py	[Kernel] Use flashinfer for decoding (#4353 )	2024-05-03 15:51:27 -07:00
test_chunked_prefill_distributed.py	[Core][5/N] Fully working chunked prefill e2e (#3884 )	2024-04-10 17:56:48 -07:00
test_comm_ops.py	[Core][Refactor] move parallel_utils into vllm/distributed (#3950 )	2024-04-10 15:33:30 -07:00
test_custom_all_reduce.py	[Core][Refactor] move parallel_utils into vllm/distributed (#3950 )	2024-04-10 15:33:30 -07:00
test_pynccl_library.py	[Core] nccl integrity check and test (#4155 )	2024-04-17 22:28:52 -07:00
test_pynccl.py	[Core][Distributed] enable allreduce for multiple tp groups (#4566 )	2024-05-02 17:32:33 -07:00