feat(recipe): add qwen3-vl-8b NPU recipe with DAPO by yidingdou · Pull Request #89 · verl-project/verl-recipe

yidingdou · 2026-04-23T11:08:57Z

Title: Adds Qwen3-VL-8B vision-language reasoning NPU recipe with DAPO on GEO3K

Why this recipe?

verl-recipe currently lacks vision-language (VL) reasoning recipes on NPU, despite the growing importance of multimodal reasoning.

This recipe is designed as VL reasoning with verifiable rewards:

Shows how run Qwen3-VL-8B dapo on NPU

Recipe Design

Model: Qwen3-VL-8B-Instruct (full-weight, NPU A2 * 2 nnodes)
Dataset: geo3k

Results

100 steps of DAPO on 2 A2(910B3) nnodes 64GB (~53 h):

算法: DAPO
Baseline (mean@1): 0.438
Test acc best (mean@1): 0.715

gemini-code-assist

Code Review

This pull request introduces a new shell script run_dapo_qwen3_vl_8b_fsdp2_npu.sh to configure and execute DAPO training for the Qwen3-VL-8B model on NPU hardware. The script defines various hyperparameters, environment variables, and model configurations. Feedback identifies critical syntax errors where spaces were incorrectly inserted into configuration keys (e.g., actor rollout_ref instead of actor_rollout_ref) and a minor formatting inconsistency regarding line continuation characters.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

yidingdou added 3 commits April 23, 2026 16:56

add qwen3_vl_8b dapo

dd4333d

fix: update parameter of dapo script for npu

f3f645d

fix: update parameter of dapo script for npu

141c7f2

gemini-code-assist Bot reviewed Apr 23, 2026

View reviewed changes

Comment thread dapo/run_dapo_qwen3_vl_8b_fsdp2_npu.sh Outdated

Comment thread dapo/run_dapo_qwen3_vl_8b_fsdp2_npu.sh Outdated

yidingdou and others added 2 commits April 23, 2026 19:11

Update dapo/run_dapo_qwen3_vl_8b_fsdp2_npu.sh

33daf2a

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update dapo/run_dapo_qwen3_vl_8b_fsdp2_npu.sh

44e8591

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(recipe): add qwen3-vl-8b NPU recipe with DAPO#89

feat(recipe): add qwen3-vl-8b NPU recipe with DAPO#89
yidingdou wants to merge 5 commits into
verl-project:mainfrom
yidingdou:main

yidingdou commented Apr 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yidingdou commented Apr 23, 2026

Why this recipe?

Recipe Design

Results

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant