This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
44dcb52e39
vllm
/
tests
/
basic_correctness
History
Michael Goin
460c1884e3
[Bugfix] Support cpu offloading with fp8 quantization (
#6960
)
2024-07-31 12:47:46 -07:00
..
__init__.py
[CI/Build] Move
test_utils.py
to
tests/utils.py
(
#4425
)
2024-05-13 23:50:09 +09:00
test_basic_correctness.py
[core][distributed] simplify code to support pipeline parallel (
#6406
)
2024-07-14 21:20:51 -07:00
test_chunked_prefill.py
[CI/Build] Reuse code for checking output consistency (
#5988
)
2024-06-30 11:44:25 +08:00
test_cpu_offload.py
[Bugfix] Support cpu offloading with fp8 quantization (
#6960
)
2024-07-31 12:47:46 -07:00
test_preemption.py
[Core] Pipeline Parallel Support (
#4412
)
2024-07-02 10:58:08 -07:00