This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
2,182
Commits
1
Branch
0
Tags
24
MiB
fb2c1c86c1
Commit Graph
3 Commits
Author
SHA1
Message
Date
Michael Goin
460c1884e3
[Bugfix] Support cpu offloading with fp8 quantization (
#6960
)
2024-07-31 12:47:46 -07:00
Matt Wong
06d6c5fe9f
[Bugfix][CI/Build][Hardware][AMD] Fix AMD tests, add HF cache, update CK FA, add partially supported model notes (
#6543
)
2024-07-20 09:39:07 -07:00
youkaichao
f53b8f0d05
[ci][test] add correctness test for cpu offloading (
#6549
)
2024-07-18 23:41:06 +00:00