This website requires JavaScript.
Explore
Help
Register
Sign In
squall
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
1
Packages
Projects
Releases
Wiki
Activity
8eadcf0b90
vllm
/
vllm
/
model_executor
History
Isotr0py
daef218b55
[Model] Initialize Phi-3-vision support (
#4986
)
2024-06-17 19:34:33 -07:00
..
guided_decoding
[Frontend][Core] Update Outlines Integration from
FSM
to
Guide
(
#4109
)
2024-06-05 16:49:12 -07:00
layers
[Speculative Decoding 1/2 ] Add typical acceptance sampling as one of the sampling techniques in the verifier (
#5131
)
2024-06-17 21:29:09 -05:00
model_loader
[mypy] Enable type checking for test directory (
#5017
)
2024-06-15 04:45:31 +00:00
models
[Model] Initialize Phi-3-vision support (
#4986
)
2024-06-17 19:34:33 -07:00
__init__.py
[Core] Refactor Attention Take 2 (
#3462
)
2024-03-25 04:39:33 +00:00
custom_op.py
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (
#3814
)
2024-06-17 11:01:25 -07:00
pooling_metadata.py
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (
#3734
)
2024-05-11 11:30:37 -07:00
sampling_metadata.py
[Core] Avoid copying prompt/output tokens if no penalties are used (
#5289
)
2024-06-06 18:12:00 -07:00
utils.py
[Hardware][Neuron] Refactor neuron support (
#3471
)
2024-03-22 01:22:17 +00:00