Summary
The ollama service in @lab/ll-KNOWRAG/compose.yml is pinned to ollama/ollama:0.5.4. That release predates the qwen3-embedding model family — pulling qwen3-embedding:4b fails with:
Error: pull model manifest: 412: The model you are attempting to pull requires a newer version of Ollama.
Change
- Bump the pinned image:
ollama/ollama:0.5.4 → ollama/ollama:0.30.0-rc20.
- Update the
Current pins list in @lab/ll-KNOWRAG/.claude/rules/docker-infra.md to match.
Acceptance criteria
Zone
@lab — disposable sandbox. 0.30.0-rc20 is a release-candidate, chosen deliberately for the latest model support.
Summary
The
ollamaservice in@lab/ll-KNOWRAG/compose.ymlis pinned toollama/ollama:0.5.4. That release predates theqwen3-embeddingmodel family — pullingqwen3-embedding:4bfails with:Change
ollama/ollama:0.5.4→ollama/ollama:0.30.0-rc20.Current pinslist in@lab/ll-KNOWRAG/.claude/rules/docker-infra.mdto match.Acceptance criteria
docker compose --env-file .env.example config -qvalidates.docker exec knowrag-ollama nvidia-smistill reports the RTX 5060 (GPU reservation intact).docker exec knowrag-ollama ollama pull qwen3-embedding:4bsucceeds.nomic-embed-text,qwen2.5:*) remain listed.Zone
@lab— disposable sandbox.0.30.0-rc20is a release-candidate, chosen deliberately for the latest model support.