[contrib] Add Qwen2-Audio-7B NeuronX port by lutfanm-aws · Pull Request #99 · aws-neuron/neuronx-distributed-inference

lutfanm-aws · 2026-03-21T02:36:07Z

Summary

Adds NeuronX Distributed Inference implementation of Qwen/Qwen2-Audio-7B
Multimodal audio-to-text model (~8.2B params): audio encoder + Qwen2 7B language model
Both audio encoder and language model run entirely on Neuron hardware
Validated with speech transcription, audio captioning, and text-only generation

Model Details

Architecture: Multimodal encoder-decoder (Whisper-style audio encoder + Qwen2 decoder)
Parameters: ~8.2B (audio encoder ~600M + LM ~7.6B)
TP Degree: 2
Precision: BF16

Validation

Speech transcription: exact match ✅
Audio captioning: correct caption ✅
Text-only generation: correct response ✅
Configuration: TP=2, batch_size=1, seq_len=1024

Performance

Token generation: 15-16 tok/s on trn1.32xlarge (TP=2)
Audio encoding: ~60ms for 3-4s audio

Files

contrib/models/Qwen2-Audio-7B/src/modeling_qwen2_audio.py — Model implementation
contrib/models/Qwen2-Audio-7B/src/configuration_qwen2_audio.py — Config classes
contrib/models/Qwen2-Audio-7B/test/ — Integration tests
contrib/models/Qwen2-Audio-7B/README.md — Documentation

Multimodal audio-to-text model (~8.2B params) with audio encoder and Qwen2 7B language model running entirely on Neuron hardware. Supports audio captioning, speech transcription, and text-only generation.

[contrib] Add Qwen2-Audio-7B NeuronX port

5027134

Multimodal audio-to-text model (~8.2B params) with audio encoder and Qwen2 7B language model running entirely on Neuron hardware. Supports audio captioning, speech transcription, and text-only generation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[contrib] Add Qwen2-Audio-7B NeuronX port#99

[contrib] Add Qwen2-Audio-7B NeuronX port#99
lutfanm-aws wants to merge 1 commit intomainfrom
contrib/Qwen2-Audio-7B

lutfanm-aws commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lutfanm-aws commented Mar 21, 2026

Summary

Model Details

Validation

Performance

Files

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant