vllm/vllm/model_executor/model_loader
2024-06-15 04:45:31 +00:00
..
__init__.py [Misc] Enhance attention selector (#4751) 2024-05-13 10:47:25 -07:00
loader.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
neuron.py [Typing] Mypy typing part 2 (#4043) 2024-04-17 17:28:43 -07:00
tensorizer.py [Frontend] [Core] Support for sharded tensorized models (#4990) 2024-06-12 14:13:52 -07:00
utils.py [Kernel] FP8 support for MoE kernel / Mixtral (#4244) 2024-04-24 01:18:23 +00:00
weight_utils.py [mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00