Qwen3-VL training with flashattn+vllm

### Question

Hi, I ran into a training issue when using rLLM with a VL model backend stack and wanted to check whether this is a known compatibility problem on the rLLM side.

`RuntimeError: This flash attention build does not support headdim not being a multiple of 32.`



### Context

[https://github.com/vllm-project/vllm/issues/26989](url) I have read about this one, but it doses not seem to work at rllm

### Relevant Code / Config

```python

```

### Environment

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3-VL training with flashattn+vllm #464

Question

Context

Relevant Code / Config

Environment

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Qwen3-VL training with flashattn+vllm #464

Description

Question

Context

Relevant Code / Config

Environment

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions