vllm.model_executor.layers.pooler ¶
Modules:
| Name | Description |
|---|---|
abstract | |
activations | |
common | |
seqwise | Poolers that produce an output aggregating all tokens in the sequence. |
special | |
tokwise | Poolers that produce an output for each token in the sequence. |
DispatchPooler ¶
Bases: Pooler
Dispatches calls to a sub-pooler based on the pooling task.
Source code in vllm/model_executor/layers/pooler/special.py
__init__ ¶
__init__(
poolers_by_task: Mapping[PoolingTask, Pooler],
) -> None
Source code in vllm/model_executor/layers/pooler/special.py
for_embedding classmethod ¶
for_embedding(pooler_config: PoolerConfig)
for_seq_cls classmethod ¶
for_seq_cls(
pooler_config: PoolerConfig,
*,
pooling: SequencePoolingMethod
| SequencePoolingFn
| None = None,
classifier: ClassifierFn | None = None,
)
Source code in vllm/model_executor/layers/pooler/special.py
forward ¶
forward(
hidden_states: Tensor, pooling_metadata: PoolingMetadata
) -> PoolerOutput
Source code in vllm/model_executor/layers/pooler/special.py
get_pooling_updates ¶
get_pooling_updates(
task: PoolingTask,
) -> PoolingParamsUpdate
get_supported_tasks ¶
get_supported_tasks() -> Set[PoolingTask]
IdentityPooler ¶
Bases: Pooler
Source code in vllm/model_executor/layers/pooler/special.py
forward ¶
forward(
hidden_states: Tensor, pooling_metadata: PoolingMetadata
) -> PoolerOutput
get_supported_tasks ¶
get_supported_tasks() -> Set[PoolingTask]
Pooler ¶
The interface required for all poolers used in pooling models in vLLM.
Source code in vllm/model_executor/layers/pooler/abstract.py
forward abstractmethod ¶
forward(
hidden_states: Tensor, pooling_metadata: PoolingMetadata
) -> PoolerOutput
get_pooling_updates ¶
get_pooling_updates(
task: PoolingTask,
) -> PoolingParamsUpdate
Construct the updated pooling parameters to use for a supported task.
get_supported_tasks abstractmethod ¶
get_supported_tasks() -> Set[PoolingTask]
PoolingParamsUpdate dataclass ¶
Source code in vllm/model_executor/layers/pooler/common.py
requires_token_ids class-attribute instance-attribute ¶
requires_token_ids: bool = False
Set this flag to enable get_prompt_token_ids for your pooler.
__or__ ¶
__or__(other: PoolingParamsUpdate) -> PoolingParamsUpdate
apply ¶
apply(params: PoolingParams) -> None