| .. |
|
attention
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
core
|
[mypy] Enable type checking for test directory (#5017)
|
2024-06-15 04:45:31 +00:00 |
|
distributed
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
engine
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
entrypoints
|
[Fix] Use utf-8 encoding in entrypoints/openai/run_batch.py (#5606)
|
2024-06-17 23:16:10 +00:00 |
|
executor
|
[bugfix][distributed] fix 16 gpus local rank arrangement (#5604)
|
2024-06-17 21:35:04 +00:00 |
|
logging
|
[MISC] Rework logger to enable pythonic custom logging configuration to be provided (#4273)
|
2024-05-01 17:34:40 -07:00 |
|
lora
|
[mypy] Enable type checking for test directory (#5017)
|
2024-06-15 04:45:31 +00:00 |
|
model_executor
|
[Model] Initialize Phi-3-vision support (#4986)
|
2024-06-17 19:34:33 -07:00 |
|
multimodal
|
[Model] Initialize Phi-3-vision support (#4986)
|
2024-06-17 19:34:33 -07:00 |
|
spec_decode
|
[mypy] Enable type checking for test directory (#5017)
|
2024-06-15 04:45:31 +00:00 |
|
transformers_utils
|
[mypy] Enable type checking for test directory (#5017)
|
2024-06-15 04:45:31 +00:00 |
|
usage
|
[Misc] Add vLLM version getter to utils (#5098)
|
2024-06-13 11:21:39 -07:00 |
|
worker
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
__init__.py
|
[Misc] Add vLLM version getter to utils (#5098)
|
2024-06-13 11:21:39 -07:00 |
|
_custom_ops.py
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
_ipex_ops.py
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
block.py
|
[misc][typo] fix typo (#5620)
|
2024-06-17 20:54:57 -07:00 |
|
config.py
|
[Bugfix] Fix KV head calculation for MPT models when using GQA (#5142)
|
2024-06-17 15:26:41 -07:00 |
|
envs.py
|
[Hardware] Initial TPU integration (#5292)
|
2024-06-12 11:53:03 -07:00 |
|
inputs.py
|
[Bugfix] TYPE_CHECKING for MultiModalData (#5444)
|
2024-06-12 14:08:52 -07:00 |
|
logger.py
|
[Misc] add logging level env var (#5045)
|
2024-05-24 23:49:49 -07:00 |
|
outputs.py
|
[Core] Consolidate prompt arguments to LLM engines (#4328)
|
2024-05-28 13:29:31 -07:00 |
|
pooling_params.py
|
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734)
|
2024-05-11 11:30:37 -07:00 |
|
py.typed
|
Add py.typed so consumers of vLLM can get type checking (#1509)
|
2023-10-30 14:50:47 -07:00 |
|
sampling_params.py
|
[Core]: Option To Use Prompt Token Ids Inside Logits Processor (#4985)
|
2024-05-23 22:04:24 +00:00 |
|
sequence.py
|
[mypy] Enable type checking for test directory (#5017)
|
2024-06-15 04:45:31 +00:00 |
|
utils.py
|
[Hardware][Intel GPU] Add Intel GPU(XPU) inference backend (#3814)
|
2024-06-17 11:01:25 -07:00 |
|
version.py
|
bump version to v0.5.0.post1 (#5522)
|
2024-06-13 19:42:06 -07:00 |