vllm.v1.attention.ops.flashmla ¶
is_flashmla_dense_supported ¶
Return: is_supported_flag, unsupported_reason (optional).
Source code in vllm/v1/attention/ops/flashmla.py
is_flashmla_sparse_supported ¶
Return: is_supported_flag, unsupported_reason (optional).