vllm/cacheflow/parallel_utils
2023-04-01 00:51:08 +08:00
..
tensor_parallel Optimize tensor parallel execution speed (#17) 2023-04-01 00:51:08 +08:00
__init__.py Support tensor parallel (#2) 2023-03-21 13:45:42 -07:00
parallel_state.py Support tensor parallel (#2) 2023-03-21 13:45:42 -07:00
README.md Support tensor parallel (#2) 2023-03-21 13:45:42 -07:00
utils.py Support tensor parallel (#2) 2023-03-21 13:45:42 -07:00

The files in this folder are ported from Megatron-LM. We only keep the codes that are used in inference.