vllm/tests/samplers
2024-03-25 04:39:33 +00:00
..
test_beam_search.py [Core] Refactor Attention Take 2 (#3462) 2024-03-25 04:39:33 +00:00
test_logprobs.py Re-enable the 80 char line width limit (#3305) 2024-03-10 19:49:14 -07:00
test_rejection_sampler.py Remove hardcoded device="cuda" to support more devices (#2503) 2024-02-01 15:46:39 -08:00
test_sampler.py Migrate logits computation and gather to model_runner (#3233) 2024-03-20 23:25:01 +00:00
test_seeded_generate.py Support per-request seed (#2514) 2024-02-21 11:47:00 -08:00