vllm/core at 8279078e218833b357f7c5076850e3688714d570 - vllm

History

Zhuohan Li 8279078e21 [Bugfix] Remove deprecated @abstractproperty (#5174 )		2024-06-01 22:40:25 +00:00
..
block	[Core] Cross-attention KV caching and memory-management (towards eventual encoder/decoder model support) (#4837 )	2024-05-29 16:09:13 +00:00
__init__.py	Change the name to vLLM (#150 )	2023-06-17 03:07:40 -07:00
block_manager_v1.py	[Core] Cross-attention KV caching and memory-management (towards eventual encoder/decoder model support) (#4837 )	2024-05-29 16:09:13 +00:00
block_manager_v2.py	[Core] Cross-attention KV caching and memory-management (towards eventual encoder/decoder model support) (#4837 )	2024-05-29 16:09:13 +00:00
embedding_model_block_manager.py	[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734 )	2024-05-11 11:30:37 -07:00
evictor_v1.py	[Bugfix] Remove deprecated @abstractproperty (#5174 )	2024-06-01 22:40:25 +00:00
evictor_v2.py	[Bugfix] Remove deprecated @abstractproperty (#5174 )	2024-06-01 22:40:25 +00:00
interfaces.py	[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734 )	2024-05-11 11:30:37 -07:00
policy.py	[Chunked Prefill][4/n] Chunked prefill scheduler. (#3853 )	2024-04-05 10:17:58 -07:00
scheduler.py	[Core] Fix scheduler considering "no LoRA" as "LoRA" (#4897 )	2024-05-20 17:48:32 -07:00