* Align top_p and top_k with huggingface * remove _get_prompt_and_output_tokens * rename _apply_top_p_top_k * compare top_p top_k with hf * fix test errors |
||
|---|---|---|
| .. | ||
| async_engine | ||
| distributed | ||
| engine | ||
| kernels | ||
| models | ||
| prompts | ||
| samplers | ||
| worker | ||
| __init__.py | ||
| conftest.py | ||
| test_regression.py | ||