[recipe] feat: add TRTLLM FP8-E2E recipe for Qwen3-30B-A3B DAPO on GB200 by Superjomn · Pull Request #112 · verl-project/verl-recipe

Superjomn · 2026-06-16T04:46:41Z

The result should roughly align with the [vllm recipe's experiment](https://verl.readthedocs.io/en/latest/low_precision/fp8.html#experiments-and-results):

gemini-code-assist

Code Review

This pull request introduces a new end-to-end FP8 training recipe for Qwen3-30B-A3B using Megatron and TRT-LLM, along with updated documentation in the README. Feedback on the new shell script includes wrapping the logger list in double quotes to prevent bash globbing issues and addressing an unused variable (rollout_token_veto_threshold) that is defined but not mapped to the algorithm configuration.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

…lackwell Signed-off-by: Chunwei Yan <chunweiy@nvidia.com>

sophiayyya · 2026-06-23T16:02:16Z

Hi @Superjomn It seems that the entropy is higher than entropy in https://verl.readthedocs.io/en/latest/low_precision/fp8.html#qwen3-30b-a3b-moe-model. Have you applied TIS? And it would be better to have a comparison between fp8 and bf16.

Superjomn marked this pull request as draft June 16, 2026 04:46

gemini-code-assist Bot reviewed Jun 16, 2026

View reviewed changes

Comment thread low_precision/run_dapo_qwen3_moe_30b_megatron_trtllm_fp8e2e.sh Outdated

Comment thread low_precision/run_dapo_qwen3_moe_30b_megatron_trtllm_fp8e2e.sh Outdated

Superjomn changed the title ~~[recipe] feat: add TRT-LLM FP8-E2E recipe for Qwen3-30B-A3B DAPO on B…~~ [recipe] feat: add TRTLLM FP8-E2E recipe for Qwen3-30B-A3B DAPO on GB200 Jun 16, 2026

[recipe] feat: add TRT-LLM FP8-E2E recipe for Qwen3-30B-A3B DAPO on B…

1022973

…lackwell Signed-off-by: Chunwei Yan <chunweiy@nvidia.com>

Superjomn force-pushed the chunweiy/low-precision-trtllm-fp8e2e branch from 6a8c0b1 to 1022973 Compare June 16, 2026 06:15

Superjomn marked this pull request as ready for review June 22, 2026 03:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[recipe] feat: add TRTLLM FP8-E2E recipe for Qwen3-30B-A3B DAPO on GB200#112

[recipe] feat: add TRTLLM FP8-E2E recipe for Qwen3-30B-A3B DAPO on GB200#112
Superjomn wants to merge 1 commit into
verl-project:mainfrom
Superjomn:chunweiy/low-precision-trtllm-fp8e2e

Superjomn commented Jun 16, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

sophiayyya commented Jun 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Superjomn commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

sophiayyya commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Superjomn commented Jun 16, 2026 •

edited

Loading

sophiayyya commented Jun 23, 2026 •

edited

Loading