Add openai-gpt (GPT-1) contrib model port by dhwanw · Pull Request #87 · aws-neuron/neuronx-distributed-inference

dhwanw · 2026-03-18T01:44:43Z

Description

Adds NeuronX port of OpenAI GPT-1 (117M) to the contrib models collection.

Model Information

Field	Value
Model	openai-community/openai-gpt
Architecture	Decoder-only transformer (GPT-1)
Parameters	117M
TP Degree	1
Precision	BF16

Checklist

Model compiles successfully on Neuron
Token matching validated (100% match, 200/200 tokens)
Performance profiled (449.3 tok/s)
README with architecture details, usage, validation results
Integration tests included

Folder Structure

contrib/models/openai-gpt/
├── README.md
├── src/
│   ├── __init__.py
│   └── modeling_openai_gpt.py
└── test/
    └── integration/
        └── test_model.py

Testing

Token Match: 100.0% (200/200 tokens, 10 prompts)
Throughput: 449.3 tok/s (TP=1, BS=1, seq_len=128)

Compatibility

Neuron SDK: 2.22+
Instance: trn1.32xlarge

🤖 Generated with Claude Code

Validated with 100% token match (200/200 tokens across 10 prompts). Performance: 437.4 tokens/sec on trn1 with TP=1, seq_len=128, bf16. Key architecture: post-norm transformer, Conv1D weights, no final LayerNorm, fused QKV attention, standard GELU activation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Not needed — validation instructions are in README. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

dhwanw and others added 3 commits March 16, 2026 21:11

Add performance profiling metrics to README

0aaec31

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Remove validation shell script from contrib package

332e8f7

Not needed — validation instructions are in README. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

dhwanw marked this pull request as ready for review March 19, 2026 22:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add openai-gpt (GPT-1) contrib model port#87

Add openai-gpt (GPT-1) contrib model port#87
dhwanw wants to merge 3 commits intomainfrom
contrib/openai-gpt

dhwanw commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dhwanw commented Mar 18, 2026

Description

Model Information

Checklist

Folder Structure

Testing

Compatibility

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant