|
distributed
|
Implement custom all reduce kernels (#2192)
|
2024-01-27 12:46:35 -08:00 |
|
engine
|
Migrate linter from pylint to ruff (#1665)
|
2023-11-20 11:58:01 -08:00 |
|
entrypoints
|
Support Batch Completion in Server (#2529)
|
2024-01-24 17:11:07 -08:00 |
|
kernels
|
Support FP8-E5M2 KV Cache (#2279)
|
2024-01-28 16:43:54 -08:00 |
|
lora
|
[Experimental] Add multi-LoRA support (#1804)
|
2024-01-23 15:26:37 -08:00 |
|
models
|
Add StableLM3B model (#2372)
|
2024-01-16 20:32:40 -08:00 |
|
samplers
|
[Experimental] Add multi-LoRA support (#1804)
|
2024-01-23 15:26:37 -08:00 |
|
worker
|
[Experimental] Add multi-LoRA support (#1804)
|
2024-01-23 15:26:37 -08:00 |