vllm/tests/kernels
2024-09-18 10:38:11 +00:00
..
__init__.py [CI/Build] Move test_utils.py to tests/utils.py (#4425) 2024-05-13 23:50:09 +09:00
allclose_default.py [ROCm] Fix some kernels failed unit tests (#2498) 2024-02-05 14:25:36 -08:00
conftest.py [Kernel] Use flashinfer for decoding (#4353) 2024-05-03 15:51:27 -07:00
quant_utils.py [Feature][Hardware][Amd] Add fp8 Linear Layer for Rocm (#7210) 2024-08-16 10:06:30 -07:00
test_activation.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_attention_selector.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_attention.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_awq_triton.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_blocksparse_attention.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_cache.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_causal_conv1d.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_cutlass.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_encoder_decoder_attn.py [Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) 2024-08-06 16:51:47 -04:00
test_flash_attn.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_flashinfer.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_fp8_quant.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_gguf.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_int8_quant.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_layernorm.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_machete_gemm.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_mamba_ssm.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_marlin_gemm.py [Kernel][Misc] register ops to prevent graph breaks (#6917) 2024-09-11 12:52:19 -07:00
test_moe.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_pos_encoding.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
test_prefix_prefill.py [CI/Build] Avoid CUDA initialization (#8534) 2024-09-18 10:38:11 +00:00
utils.py [CI/Build] Reorganize models tests (#7820) 2024-09-13 10:20:06 -07:00