vllm.model_executor.layers.pooler.tokwise.heads ¶
TokenClassifierPoolerHead ¶
Bases: TokenPoolerHead
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
__init__ ¶
__init__(
classifier: ClassifierFn | None = None,
logit_bias: float | None = None,
head_dtype: dtype | str | None = None,
activation: ActivationFn | None = None,
) -> None
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
forward_chunk ¶
forward_chunk(
pooled_data: TokenPoolingMethodOutputItem,
pooling_param: PoolingParams,
) -> TokenPoolerHeadOutputItem
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
get_supported_tasks ¶
get_supported_tasks() -> Set[PoolingTask]
TokenEmbeddingPoolerHead ¶
Bases: TokenPoolerHead
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
__init__ ¶
__init__(
head_dtype: dtype | str | None = None,
projector: ProjectorFn | None = None,
activation: ActivationFn | None = None,
) -> None
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
forward_chunk ¶
forward_chunk(
pooled_data: TokenPoolingMethodOutputItem,
pooling_param: PoolingParams,
) -> TokenPoolerHeadOutputItem
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
get_supported_tasks ¶
get_supported_tasks() -> Set[PoolingTask]
TokenPoolerHead ¶
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
forward ¶
forward(
pooled_data: list[TokenPoolingMethodOutputItem],
pooling_metadata: PoolingMetadata,
) -> list[TokenPoolerHeadOutputItem]
Source code in vllm/model_executor/layers/pooler/tokwise/heads.py
forward_chunk abstractmethod ¶
forward_chunk(
pooled_data: TokenPoolingMethodOutputItem,
pooling_param: PoolingParams,
) -> TokenPoolerHeadOutputItem