|
http_frontend
|
New weight loader without np copy (#52)
|
2023-05-03 15:32:04 +08:00 |
|
master
|
Add a system logger (#85)
|
2023-05-08 23:03:35 -07:00 |
|
models
|
Add a system logger (#85)
|
2023-05-08 23:03:35 -07:00 |
|
worker
|
Replace FlashAttention with xformers (#70)
|
2023-05-05 02:01:08 -07:00 |
|
block.py
|
Support beam search & parallel generation (#7)
|
2023-03-10 09:58:21 -08:00 |
|
logger.py
|
Add a system logger (#85)
|
2023-05-08 23:03:35 -07:00 |
|
sampling_params.py
|
FastAPI-based working frontend (#10)
|
2023-03-29 14:48:56 +08:00 |
|
utils.py
|
FastAPI-based working frontend (#10)
|
2023-03-29 14:48:56 +08:00 |