vllm.model_executor.models.paligemma ¶
PaliGemmaImageEmbeddingInputs ¶
Bases: TensorSchema
Dimensions
- bn: Batch size * number of images
- ifs: Image feature size
- hs: Hidden size (must match language model backbone)
Source code in vllm/model_executor/models/paligemma.py
PaliGemmaImagePixelInputs ¶
Bases: TensorSchema
Dimensions
- bn: Batch size * number of images
- c: Number of channels (3)
- h: Height
- w: Width