This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
1,360
Commits
1
Branch
0
Tags
24
MiB
9a31a817a8
Commit Graph
3 Commits
Author
SHA1
Message
Date
Cody Yu
c833101740
[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (
#4535
)
2024-05-09 18:04:17 -06:00
Simon Mo
c7f2cf2b7f
[CI] Reduce wheel size by not shipping debug symbols (
#4602
)
2024-05-04 21:28:58 -07:00
Simon Mo
021b1a2ab7
[CI] check size of the wheels (
#4319
)
2024-05-04 20:44:36 +00:00