vllm/vllm/triton_utils
Thomas Parnell eaec4b9153
[Bugfix] Add custom Triton cache manager to resolve MoE MP issue (#6140)
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Co-authored-by: Chih-Chieh-Yang <chih.chieh.yang@ibm.com>
2024-07-15 10:12:47 -07:00
..
__init__.py [Bugfix] Add custom Triton cache manager to resolve MoE MP issue (#6140) 2024-07-15 10:12:47 -07:00
custom_cache_manager.py [Bugfix] Add custom Triton cache manager to resolve MoE MP issue (#6140) 2024-07-15 10:12:47 -07:00