This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
2,110
Commits
1
Branch
0
Tags
24
MiB
60d1c6e584
Commit Graph
3 Commits
Author
SHA1
Message
Date
Lucas Wilkinson
55712941e5
[Bug Fix] Illegal memory access, FP8 Llama 3.1 405b (
#6852
)
2024-07-27 02:27:44 +00:00
Tyler Michael Smith
703475f6c2
[Kernel] Fix CUTLASS 3.x custom broadcast load epilogue (
#5516
)
2024-06-14 09:30:15 -07:00
Tyler Michael Smith
260d119e86
[Kernel] Refactor CUTLASS kernels to always take scales that reside on the GPU (
#5137
)
2024-06-01 06:45:32 +00:00