Skip to content

feat(recipe): add qwen3-vl-8b NPU recipe with DAPO#89

Open
yidingdou wants to merge 5 commits into
verl-project:mainfrom
yidingdou:main
Open

feat(recipe): add qwen3-vl-8b NPU recipe with DAPO#89
yidingdou wants to merge 5 commits into
verl-project:mainfrom
yidingdou:main

Conversation

@yidingdou

Copy link
Copy Markdown

Title: Adds Qwen3-VL-8B vision-language reasoning NPU recipe with DAPO on GEO3K

Why this recipe?

verl-recipe currently lacks vision-language (VL) reasoning recipes on NPU, despite the growing importance of multimodal reasoning.

This recipe is designed as VL reasoning with verifiable rewards:

Shows how run Qwen3-VL-8B dapo on NPU

Recipe Design

Model: Qwen3-VL-8B-Instruct (full-weight, NPU A2 * 2 nnodes)
Dataset: geo3k

Results

100 steps of DAPO on 2 A2(910B3) nnodes 64GB (~53 h):

算法: DAPO
Baseline (mean@1): 0.438
Test acc best (mean@1): 0.715

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a new shell script run_dapo_qwen3_vl_8b_fsdp2_npu.sh to configure and execute DAPO training for the Qwen3-VL-8B model on NPU hardware. The script defines various hyperparameters, environment variables, and model configurations. Feedback identifies critical syntax errors where spaces were incorrectly inserted into configuration keys (e.g., actor rollout_ref instead of actor_rollout_ref) and a minor formatting inconsistency regarding line continuation characters.

Comment thread dapo/run_dapo_qwen3_vl_8b_fsdp2_npu.sh Outdated
Comment thread dapo/run_dapo_qwen3_vl_8b_fsdp2_npu.sh Outdated
yidingdou and others added 2 commits April 23, 2026 19:11
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant