[Feature]: TPU Embedding models support?

### 🚀 The feature, motivation and pitch

Hi I want to run latest Embedding models, eg `Qwen/Qwen3-Embedding-0.6B`, on TPU nodes. I found that although vLLM has support on TPU it does not really support embedding models since the only available attention implementation on TPU is `PALLAS` which is DECODER only. https://github.com/vllm-project/vllm/blob/99b4f080d83ae284941b01922d7fe3b9a39034fd/vllm/v1/attention/backends/pallas.py#L164-L168

Meanwhile, Qwen3 Embedding is ENCODER-only so it can't run on TPU. https://github.com/vllm-project/vllm/blob/99b4f080d83ae284941b01922d7fe3b9a39034fd/vllm/model_executor/models/qwen3.py#L166-L173

It will be nice if we can support Qwen3 Embedding on TPU,

### Alternatives

I am trying to use Qwen3 Embedding via `transformers` but it's not as performant as vLLM.

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

	if attn_type != AttentionType.DECODER:
	raise NotImplementedError("Encoder self-attention and "
	"encoder/decoder cross-attention "
	"are not implemented for "
	"PallasAttentionBackendImpl")

	# By default, Qwen3 uses causal attention as it is a decoder-only model.
	# You can override the HF config with `is_causal=False` to enable
	# bidirectional attention, which is used in some embedding models
	# (e.g. Alibaba-NLP/gte-Qwen3-7B-instruct)
	if getattr(config, "is_causal", True):
	attn_type = AttentionType.DECODER
	else:
	attn_type = AttentionType.ENCODER_ONLY

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: TPU Embedding models support? #20869

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: TPU Embedding models support? #20869

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions