feat(valkey-mcp-server): V2 — GLIDE migration, 12 focused tools by Jonathan-Improving · Pull Request #7 · Bit-Quill/mcp

Jonathan-Improving · 2026-04-24T15:39:28Z

Valkey MCP Server V2 — Focused Search + JSON + Command Runner Tools

Major rewrite of the Valkey MCP server: migrates from valkey-py to Valkey-GLIDE, replaces ~40+ generic data-type tools with 12 purpose-built tools (4 AI Search + 5 JSON Intelligence + 3 Command Runners), and adds a multi-provider embeddings layer.

What Changed

Architecture:

Migrated from valkey-py to valkey-glide (async-native GLIDE client)
Replaced ~40 per-data-type tool wrappers with 12 focused tools
Added multi-provider embeddings (Bedrock, OpenAI, Ollama, Hash) with automatic fallback
Native stdout fd redirect to prevent GLIDE's Rust logger from corrupting MCP stdio transport
Configurable GLIDE client logging via VALKEY_GLIDE_LOG_LEVEL env var

New Tools:

#	Tool	Category
1	`manage_index`	AI Search — FT.CREATE / DROP / INFO / _LIST with structured JSON input
2	`add_documents`	AI Search — Embed → binary pack → HSET pipeline, auto-creates index
3	`search`	AI Search — Unified semantic, text, hybrid, find-similar with mode auto-detection
4	`aggregate`	AI Search — FT.AGGREGATE structured pipeline builder
5	`json_get`	JSON Intelligence — Typed JSON.GET with path
6	`json_set`	JSON Intelligence — Typed JSON.SET with path + optional TTL
7	`json_arrappend`	JSON Intelligence — Append to JSON array
8	`json_arrpop`	JSON Intelligence — Pop from JSON array
9	`json_arrtrim`	JSON Intelligence — Trim JSON array
10	`valkey_read`	Command Runner — Read-only commands (GET, HGETALL, LRANGE, etc.)
11	`valkey_write`	Command Runner — Mutating commands (SET, HSET, LPUSH, etc.)
12	`valkey_admin`	Command Runner — Destructive commands (FLUSHDB, DEBUG, etc.) — opt-in via `VALKEY_ADMIN_ENABLED`

3-Tier Command Safety:

valkey_read — always available, read-only commands
valkey_write — available unless readonly mode is active
valkey_admin — disabled by default, requires VALKEY_ADMIN_ENABLED=true

Removed: All generic data-type tools (Strings, Lists, Sets, Sorted Sets, Hashes, Streams, Bitmaps, HyperLogLog, misc, server_management) — these performed worse than raw CLI in benchmarks.

Safety & Reliability

@tool_errors decorator catches RequestError only — programming errors propagate for debugging
@readonly_guard decorator enforces readonly mode on mutating tools
asyncio.Lock on GLIDE client singleton (double-check pattern)
threading.Lock on embeddings provider singleton
Defensive atexit shutdown handler for GLIDE client + Ollama httpx client
Deferred STS credential validation (runs in executor on first Bedrock embedding call)
File-based logging with loguru for post-mortem analysis

Testing

137 unit tests (mocked)
55 live integration tests against real Valkey + Ollama
TLS integration tests with CA cert verification and password auth
Error case tests covering all previously-discovered crash scenarios
Tests organized: tests/ (unit) and tests/integration/ (live)
pytest-timeout to prevent hanging tests

Documentation

README fully rewritten with quickstart, configuration tables, troubleshooting
Tool docstrings with Args/Returns/Examples
JSON.SET TTL non-atomicity documented

Code Review: Valkey MCP Server V2 — GLIDE Migration + 9 Focused Tools

Summary

This is a well-structured major rewrite that replaces ~40 generic data-type tools with 9 purpose-built tools for AI Search and JSON operations, migrating from valkey-py to valkey-glide. The architecture is significantly cleaner, the tool surface is more focused, and the pre-validation pattern to prevent GLIDE native crashes is a smart defensive measure.

🟢 What's Good

Pre-validation pattern — Checking index_exists() and _require_array() before GLIDE calls prevents native crashes. This is well-tested in test_error_cases_live.py.
Structured JSON input — The aggregate pipeline builder eliminates DSL syntax errors from AI agents. The LOAD field auto-detection is particularly well done.
Multi-provider embeddings — Clean factory pattern with Bedrock/OpenAI/Ollama/Hash providers. Good separation of concerns.
Test coverage — 110 unit tests + 35 live integration tests. Error case tests specifically targeting previously-discovered crash scenarios is excellent.
README rewrite — Troubleshooting table, quickstart, and configuration tables are much more useful than before.
decode_value utility — Recursive bytes decoding handles GLIDE's raw responses cleanly.

🔴 Issues to Address

connection.py — Global singleton race condition (see inline comment)
- get_client() has no locking. Concurrent coroutines can create duplicate GLIDE clients.
main.py — atexit handler will crash (see inline comment)
- asyncio.run(close_client()) fails when an event loop is already running at shutdown.

🟡 Issues to Consider

providers.py — OllamaEmbeddings httpx client never closed — Resource leak over time.
providers.py — BedrockEmbeddings.__init__ makes synchronous STS call — Blocks event loop at startup.
search_aggregate.py — Wildcard * query rejected — This is a valid FT.AGGREGATE pattern. The restriction limits usability.
search_query.py — _find_similar pagination bug — limit + 1 propagated to offset causes off-by-one when paginating.
json.py — JSON.SET + EXPIRE not atomic — TTL race condition if process crashes between the two calls.
json.py — Bare Exception catching — Swallows programming errors. Consider catching RequestError specifically.
providers.py — HashEmbeddings cycles 32 bytes for dimensions > 32 — Produces repeated patterns in test vectors.
connection.py — Synchronous file read for CA cert — Minor, but blocks event loop if cert is on network storage.

📝 Minor Notes

The docs/AEA-355-plan.md is a great internal document but probably shouldn't ship in the published package. Consider adding it to .gitignore or a docs/internal/ directory.
pyproject.toml adds boto3, openai, and httpx as hard dependencies. Users who only need JSON tools or text search will pull in AWS SDK and OpenAI SDK unnecessarily. Consider making these optional extras: pip install awslabs.valkey-mcp-server[bedrock], [openai], [ollama].
The hybrid_weight parameter in search() is documented as "advisory — reserved for future weighted scoring" but it's used for mode auto-detection (!= 0.5 triggers hybrid). This is confusing — either implement weighted scoring or remove the auto-detection behavior.

Verdict

The architecture and tool design are solid. The two 🔴 issues (singleton race condition and atexit crash) should be fixed before merge. The 🟡 items are worth addressing but aren't blockers.

…n`) and their respective unit tests. Update AEA-355 plan and main imports to reflect phase completion.

…hared `client` fixture to `conftest.py`

…-RPC transport corruption. Add inline documentation explaining the issue and pre-validation adjustments.

… checks in MCP server and tests. Simplify error handling paths.

…ough environment variables. Refactor JSON operations to utilize `glide_json` for consistency and improved maintainability.

… and embeddings logic

…level.

Co-authored-by: Animesh Barua <animebar@amazon.com>

…` injection in search queries.

edlng

Once the note is added to the docstring for hybrid search, lgtm!

…ine default dimension logic, and refactor AWS credential validation.

Jonathan-Improving · 2026-04-30T14:02:20Z

Closing, superceded by duplicated PR on AWS Labs.

jbrinkman reviewed Apr 24, 2026

View reviewed changes

Jonathan-Improving force-pushed the feature/mcp/v2 branch from 5ba8694 to b57e4e3 Compare April 24, 2026 18:14

github-advanced-security AI found potential problems Apr 24, 2026

View reviewed changes

Comment thread src/valkey-mcp-server/awslabs/valkey_mcp_server/embeddings/providers.py Fixed

Jonathan-Improving force-pushed the feature/mcp/v2 branch 2 times, most recently from 9107f21 to 78127b8 Compare April 24, 2026 18:31

github-advanced-security AI found potential problems Apr 24, 2026

View reviewed changes

Comment thread src/valkey-mcp-server/awslabs/valkey_mcp_server/main.py Fixed

jduo reviewed Apr 24, 2026

View reviewed changes

Comment thread src/valkey-mcp-server/awslabs/valkey_mcp_server/common/connection.py Outdated

Comment thread src/valkey-mcp-server/awslabs/valkey_mcp_server/common/connection.py Outdated

Comment thread src/valkey-mcp-server/awslabs/valkey_mcp_server/common/connection.py Outdated

Jonathan-Improving changed the title ~~feat(valkey-mcp-server): V2 — GLIDE migration, 9 focused search + JSON tools~~ feat(valkey-mcp-server): V2 — GLIDE migration, 12 focused tools Apr 24, 2026

Jonathan-Improving added 15 commits April 24, 2026 14:58

Introducing MCP server v2

cf0be8a

Round of clean-up / refactoring

4cafa2d

Add command runner tools (valkey_read, valkey_write, `valkey_admi…

985d32b

…n`) and their respective unit tests. Update AEA-355 plan and main imports to reflect phase completion.

Add inline documentation to explain some code

82da5df

Add "read-only" and exception handling guards

e626ee4

Remove redundant async def client fixture from tests and refactor s…

670c13f

…hared `client` fixture to `conftest.py`

Redirect GLIDE Rust logger output to stderr to prevent MCP stdio JSON…

3d370e3

…-RPC transport corruption. Add inline documentation explaining the issue and pre-validation adjustments.

Remove redundant index_exists utility and associated pre-validation…

ccb5047

… checks in MCP server and tests. Simplify error handling paths.

Add support for configurable vector algorithm and distance metric thr…

d13dced

…ough environment variables. Refactor JSON operations to utilize `glide_json` for consistency and improved maintainability.

Address PR review comments

72d5408

Lazy-load AWS STS credentials upon embedding creation

d6ef03c

Add nosec comments for security-related false positives in shutdown…

7c113e3

… and embeddings logic

Add TLS integration test

c9ff891

Move integration tests to their own scope

dc78525

Make reset_client async and add support for configurable GLIDE log …

7464928

…level.

Jonathan-Improving force-pushed the feature/mcp/v2 branch from fc380fb to 7464928 Compare April 24, 2026 21:00

edlng reviewed Apr 24, 2026

View reviewed changes

Jonathan-Improving and others added 5 commits April 24, 2026 16:30

Minor tidy-up

52eea22

Add Strands Agents SDK documentation support to README (awslabs#3298)

1867306

Co-authored-by: Animesh Barua <animebar@amazon.com>

Address latest comments, tidy-up code, and security hardening

4dd7466

Merge branch 'main' into feature/mcp/v2

e809c9b

Sanitize filter_expression and vector_field inputs to prevent `=>…

374381e

…` injection in search queries.

edlng approved these changes Apr 27, 2026

View reviewed changes

Bump version to 0.2.0, improve thread safety in reset_provider, ref…

21c307a

…ine default dimension logic, and refactor AWS credential validation.

Jonathan-Improving closed this Apr 30, 2026

Conversation

Jonathan-Improving commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Valkey MCP Server V2 — Focused Search + JSON + Command Runner Tools

What Changed

Safety & Reliability

Testing

Documentation

Related

Uh oh!

jbrinkman left a comment

Choose a reason for hiding this comment

Code Review: Valkey MCP Server V2 — GLIDE Migration + 9 Focused Tools

Summary

🟢 What's Good

🔴 Issues to Address

🟡 Issues to Consider

📝 Minor Notes

Verdict

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

edlng left a comment

Choose a reason for hiding this comment

Uh oh!

Jonathan-Improving commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Jonathan-Improving commented Apr 24, 2026 •

edited

Loading