Add HyperCLOVA X SEED Think 14B

It would be great to add native support for **HyperCLOVA X SEED Think 14B** to the Transformers library, so users can load it without `trust_remote_code=True`. In addition, this model is intended to serve as the backbone for future multimodal models to be released on the Hugging Face Hub. Without native Transformers support, every new model variant must bundle its own copy of `modeling_hyperclovax.py`, leading to code duplication, and increased maintenance burden.

### Model description

**HyperCLOVA X SEED Think 14B** is a 14.74B-parameter reasoning LLM developed by NAVER Cloud. It is a LLaMA-style decoder-only transformer with two architectural modifications not present in standard LLaMA:

- **Peri-Layer Normalization**: an extra RMSNorm is applied *after* each sub-layer output (in addition to the standard pre-norm), controlled by a `use_post_norm` config flag.
- **Maximal Update Parametrization (μP)**: per-config scaling factors (`attention_multiplier`, `residual_multiplier`, `embedding_multiplier`, `logits_scaling`) replace the standard fixed scaling, enabling stable training across model sizes.

The model supports dual-mode reasoning: **Think** (chain-of-thought before answering) and **Non-Think** (direct answer), switchable via `apply_chat_template(force_reasoning=True/False)`. It also supports function calling via a custom ChatML dialect. The model is [supported in vLLM](https://github.com/vllm-project/vllm/pull/37107) as of March 2026.

I checked that no existing PR covers this. I have also prepared a draft PR (#44956) in case it is helpful for the discussion or review.

### Open source status

- [x] The model implementation is available
- [x] The model weights are available

### Provide useful links for the implementation

- **Huggingface hub**: [naver-hyperclovax/HyperCLOVAX-SEED-Think-14B](https://huggingface.co/naver-hyperclovax/HyperCLOVAX-SEED-Think-14B)
- **Technical report**: [arXiv 2506.22403](https://arxiv.org/abs/2506.22403)
- **vLLM upstream**: [vllm-project/vllm#37107](https://github.com/vllm-project/vllm/pull/37107) (merged 2026-03-16)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add HyperCLOVA X SEED Think 14B #44957

Model description

Open source status

Provide useful links for the implementation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add HyperCLOVA X SEED Think 14B #44957

Description

Model description

Open source status

Provide useful links for the implementation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions