feat: add Qwen3.5 Dense Model EAGLE3 training support by 36330 · Pull Request #516 · sgl-project/SpecForge

36330 · 2026-03-28T03:27:26Z

Add qwen3_5_eagle_patch.py: Monkey patch for SGLang's Qwen3.5 models
- Captures aux_hidden_states from 3 layers (layer 2, mid, layer-3)
- Supports both Qwen3_5ForCausalLM and Qwen3_5ForConditionalGeneration
- Environment variable QWEN35_EAGLE3_ENABLE=1 to enable
Add qwen3.5-4b-eagle3.json: Draft model config for Qwen3.5-4B
Modify train_eagle3.py: Auto-apply patch on initialization
Modify eagle3_target_model.py: Auto-detect and patch Qwen3.5 models
Modify llama3_eagle.py: Handle 'default' RoPE scaling type for Qwen3.5

Tested:

Hidden states generation: ✓
Training with TTT: ✓
Qwen3.5-4B Dense model: ✓

example

# generate hidden states
torchrun --nproc_per_node=1 scripts/prepare_hidden_states.py \
    --target-model-path Qwen/Qwen3.5-4B \
    --enable-aux-hidden-states

# train
torchrun --nproc_per_node=4 scripts/train_eagle3.py \
    --target-model-path Qwen/Qwen3.5-4B \
    --draft-model-config configs/qwen3.5-4b-eagle3.json

- Add qwen3_5_eagle_patch.py: Monkey patch for SGLang's Qwen3.5 models - Captures aux_hidden_states from 3 layers (layer 2, mid, layer-3) - Supports both Qwen3_5ForCausalLM and Qwen3_5ForConditionalGeneration - Environment variable QWEN35_EAGLE3_ENABLE=1 to enable - Add qwen3.5-4b-eagle3.json: Draft model config for Qwen3.5-4B - Modify train_eagle3.py: Auto-apply patch on initialization - Modify eagle3_target_model.py: Auto-detect and patch Qwen3.5 models - Modify llama3_eagle.py: Handle 'default' RoPE scaling type for Qwen3.5 Tested: - Hidden states generation: ✓ - Training with TTT: ✓ - Qwen3.5-4B Dense model: ✓

gemini-code-assist · 2026-03-28T03:27:29Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

36330 · 2026-03-28T03:43:41Z

jiapingW · 2026-03-31T03:18:53Z

We have implemented the qwen3.5 eagle3 training in PR.

36330 requested review from FlamingoPg, FrankLeeeee, shuaills and sleepcoo as code owners March 28, 2026 03:27

Add example script for Qwen3.5-4B EAGLE3 training

eb3f4f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Qwen3.5 Dense Model EAGLE3 training support#516

feat: add Qwen3.5 Dense Model EAGLE3 training support#516
36330 wants to merge 2 commits intosgl-project:mainfrom
36330:feat/qwen3-5-eagle3-support

36330 commented Mar 28, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Mar 28, 2026

Uh oh!

36330 commented Mar 28, 2026

Uh oh!

jiapingW commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

36330 commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

example

Uh oh!

gemini-code-assist bot commented Mar 28, 2026

Uh oh!

36330 commented Mar 28, 2026

Uh oh!

jiapingW commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

36330 commented Mar 28, 2026 •

edited

Loading