wandb · johndmulhausen · Apr 9, 2026 · Apr 8, 2026
@@ -13,6 +13,7 @@ W&B Inference provides access to several open-source foundation models. Each mod
 | Model                           | Model ID (for API usage)                       | Type         | Context Window | Parameters                | Description                                                                                                                                       |
 | ------------------------------- | ---------------------------------------------- | ------------ | -------------- | ------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------- |
 | DeepSeek V3.1                   | `deepseek-ai/DeepSeek-V3.1`                    | Text         | 161k           | 37B-671B (Active-Total)   | A large hybrid model that supports both thinking and non-thinking modes via prompt templates.                                                     |
+| Google Gemma 4 31B              | `google/gemma-4-31B-it`                        | Text, Vision | 262k           | 31B (Total)               | Gemma 4 31B Dense is designed for advanced reasoning, agentic workflows, and longer context and is natively trained on 140+ languages.            |
 | Meta Llama 3.3 70B              | `meta-llama/Llama-3.3-70B-Instruct`            | Text         | 128k           | 70B (Total)               | Multilingual model excelling in conversational tasks, detailed instruction-following, and coding.                                                 |
 | Meta Llama 3.1 70B              | `meta-llama/Llama-3.1-70B-Instruct`            | Text         | 128k           | 70B (Total)               | Efficient conversational model optimized for responsive multilingual chatbot interactions.                                                        |
 | Meta Llama 3.1 8B               | `meta-llama/Llama-3.1-8B-Instruct`             | Text         | 128k           | 8B (Total)                | Efficient conversational model optimized for responsive multilingual chatbot interactions.                                                        |