|
engine
|
Use TGI-like incremental detokenization (#984)
|
2023-09-13 13:38:01 -07:00 |
|
kernels
|
Use FP32 in RoPE initialization (#1004)
|
2023-09-11 00:26:35 -07:00 |
|
models
|
Add tests for models (#922)
|
2023-09-01 11:19:43 +09:00 |
|
conftest.py
|
Use queue for finished requests (#957)
|
2023-09-05 19:27:23 -07:00 |