|
__init__.py
|
Support tensor parallel (#2)
|
2023-03-21 13:45:42 -07:00 |
|
activation.py
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
attention.py
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
input_metadata.py
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
llama.py
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
model_utils.py
|
Implement LLaMA (#9)
|
2023-03-30 12:25:32 +08:00 |
|
opt.py
|
Optimize data movement (#20)
|
2023-04-02 00:30:17 -07:00 |
|
utils.py
|
FastAPI-based working frontend (#10)
|
2023-03-29 14:48:56 +08:00 |