vllm/tests at 9d9072a069202e7892a40ef94e9085019e73f370 - vllm

History

Zhuohan Li 9d9072a069 Implement prompt logprobs & Batched topk for computing logprobs (#1328 ) Co-authored-by: Yunmo Chen <16273544+wanmok@users.noreply.github.com>		2023-10-16 10:56:50 -07:00
..
async_engine	Implement prompt logprobs & Batched topk for computing logprobs (#1328 )	2023-10-16 10:56:50 -07:00
distributed	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
engine	TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )	2023-10-02 15:36:09 -07:00
kernels	Implement PagedAttention V2 (#1348 )	2023-10-16 00:59:57 -07:00
models	Add tests for models (#922 )	2023-09-01 11:19:43 +09:00
samplers	Implement prompt logprobs & Batched topk for computing logprobs (#1328 )	2023-10-16 10:56:50 -07:00
conftest.py	Implement prompt logprobs & Batched topk for computing logprobs (#1328 )	2023-10-16 10:56:50 -07:00