Skip to content

vllm.model_executor.models.glm4_moe_lite

Inference-only GLM-4.7-Flash model compatible with HuggingFace weights.