Feature Request: Add NVFP4 tensor mapping for GEMMA4 architecture

### Prerequisites
- [x] I am running the latest code.
- [x] I carefully followed the README.md.
- [x] I searched using keywords relevant to my issue.
- [x] I reviewed the Discussions.

### Feature Description
The `GEMMA4` architecture implementation in `llama.cpp` (specifically the C++ loader) does not yet map `GGML_TYPE_NVFP4` weights to internal architectural slots. Even with a valid NVFP4 GGUF, the loader fails with a tensor count mismatch because it only recognizes the FP32/BF16 tensors.

### Motivation
NVFP4 is the native 4-bit format for **NVIDIA Blackwell** GPUs. Supporting this mapping in the `GEMMA4` architecture is essential for leveraging hardware acceleration on RTX 50-series and B200 hardware.

### Possible Implementation
Update the architectural registry for `GEMMA4` to accept `GGML_TYPE_NVFP4` for weights and their associated scale tensors.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add NVFP4 tensor mapping for GEMMA4 architecture #21777

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature Request: Add NVFP4 tensor mapping for GEMMA4 architecture #21777

Description

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions