sync: rebase on CPA v7.0.2 upstream by hpylsy · Pull Request #29 · Ve-ria/CLIProxyAPIPlus

hpylsy · 2026-05-11T10:01:28Z

What

Rebased CPAP onto CPA v7.0.2 (upstream latest stable). All 12 third-party providers preserved, all CPA v7.0.2 new features gained.

Why

CPAP was forked from CPA ~v6.0.x and never synced back. This caused:

/v1/images/generations broken (streaming response path missing image data)
Missing CPA features: home auth session reuse, Redis queue protocol, antigravity credits, protocol multiplexer, image generation mode control
Accumulating tech debt making future syncs harder

Approach

CPA v7.0.2 as base + CPAP extensions ported on top (not a merge — a clean rebase). CPA core code is untouched; CPAP extensions are additive.

All 12 CPAP providers preserved

Kiro, CodeBuddy, CodeBuddy AI, CodeArts, JoyCode, Cursor, GitHub Copilot, GitLab, iFlow, Kilo, BT, Qoder

Including: executors, translators, auth handlers, login commands, model definitions (static + dynamic fetch), management UI web panel, kiro-quota endpoint, usage statistics.

Gained from CPA v7.0.2

Working /v1/images/generations + /v1/images/edits
Home auth session reuse with scoped caching
Redis queue protocol (RESP interface)
Protocol multiplexer (buffered conn + mux listener)
Antigravity credits fallback
Codex image generation tool handling
Improved websocket responses handler
DisableCooling per-credential option
api-key-usage management endpoint

Testing

13/13 acceptance tests pass on production (154.201.92.23:8317):

Model list: 62 models (exact match)
Chat: Kiro, Codex, Gemini, Antigravity, alias models
Streaming: confirmed
Image generation: base64 response confirmed
Management: usage, auth-files, kiro-quota

Future CPA sync

git checkout main && git pull origin main
git checkout feat/cpap-integration && git rebase main

8 additive commits — minimal rebase conflicts expected.

- Simplify project ID selection to always use the backend project ID returned by Gemini onboarding - Update Gemini CLI version from 0.31.0 to 0.34.0 - Add 'terminal' to User-Agent string for better client identification Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ment Both branches assign finalProjectID = responseProjectID, so move the assignment outside the conditional and keep only the logging inside.

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

After a Codex CLI compact, the client sends a full conversation transcript (with compaction items or assistant messages) as input. Previously, normalizeResponseSubsequentRequest() unconditionally merged this with stale lastRequest/lastResponseOutput, breaking function_call/function_call_output pairings and causing 400 errors ("No tool output found for function call"). Add inputContainsFullTranscript() heuristic that detects compaction items (type=compaction/compaction_summary) or assistant messages in the input array, and bypasses the merge when a full transcript is present. Fixes router-for-me#2207

Custom headers configured under openai-compatibility (and any other provider passing through applyCustomHeaders) were silently dropped for the Host key, because Go's net/http reads the wire Host from req.Host, not req.Header["Host"]. As a result, virtual-host routed upstreams (e.g. LiteLLM behind an ingress) saw the base-url's host instead of the user-configured override and returned 404. Detect the Host key with http.CanonicalHeaderKey and assign it to req.Host so it is actually written on the wire. Other headers continue to use Header.Set as before. Fixes router-for-me#2833

…r tests

Addressing the P1 note from the Codex reviewer: applyCustomHeaders is also called with a synthetic &http.Request{Header: ...} from the websockets executors (aistudio_executor.go, codex_websockets_executor.go), which forward only the header map. The previous continue meant a custom Host was dropped from that map, regressing virtual-host overrides on those flows. Mirror the value to both r.Host (for real net/http) and r.Header (for header-map-only consumers).

…ni, Claude, Codex, OpenAI, and Vertex

feat(api): integrate auth index into key retrieval endpoints for Gemi…

…dex-GPT and Codex-Gemini flows - Introduced `LastImageHashByItemID` in Codex-GPT and `LastImageHashByID` in Codex-Gemini for deduplication of generated images. - Added support for handling `partial_image` and `image_generation_call` types, with inline data embedding for Gemini and URL payload conversion for GPT. - Extended unit tests to verify image handling in both streaming and non-streaming modes.

…outer-for-me#2866) Anthropic has moved the 1M-context-window feature to General Availability, so the context-1m-2025-08-07 beta flag is no longer accepted and now causes 400 Bad Request errors when forwarded upstream. Remove the X-CPA-CLAUDE-1M detection and the corresponding injection of the now-invalid beta header. Also drop the unused net/textproto import that was only needed for the header-key lookup.

…remove-obsolete-1m-beta-header fix(executor): drop obsolete context-1m-2025-08-07 beta header

… links - Added VisionCoder sponsorship information to `README.md`, `README_CN.md`, and `README_JA.md`. - Updated external links to include `target="_blank"` for improved user experience. - Added new logo asset `visioncoder.png` for README use.

- Introduced `refreshIneffectiveBackoff` to prevent tight-looping in auto-refresh when token refresh fails to update expiry. - Adjusted refresh logic to apply backoff when `shouldRefresh` evaluates true. Closes: router-for-me#2830

- Refactored `/healthz` handler to support `HEAD` requests alongside `GET`. - Updated tests to include validation for `HEAD` requests with expected status and empty body. Closes: router-for-me#2929

…m-output-backfill fix(codex): backfill streaming response output

…t-host-header fix(util): forward custom Host header to upstream

- Added `GPT-Image-2` as a built-in model to avoid dependency on remote updates for Codex. - Updated model tier functions (`CodexFree`, `CodexTeam`, etc.) to include built-in models via `WithCodexBuiltins`. - Introduced new handlers for image generation and edit operations under `OpenAIAPIHandler`. - Extended tests to validate 503 response for unsupported image model requests.

… image handlers

…AI image handlers

Codex CLI gates the built-in image_generation tool behind AuthMode::Chatgpt (OAuth only). When clients connect via API key auth through CPA, the tool is absent from requests, making image generation unavailable through the reverse proxy. Changes: 1. Inject image_generation tool (codex_executor.go): Add ensureImageGenerationTool() that appends {"type":"image_generation","output_format":"png"} to the tools array if not already present. Applied to all three execution paths: Execute, executeCompact, and ExecuteStream. 2. Route aliases for Codex CLI direct access (server.go): Add /backend-api/codex/responses routes that map to the same OpenAI Responses API handlers as /v1/responses. This allows Codex CLI to connect via chatgpt_base_url config while keeping AuthMode::Chatgpt, which enables the built-in image_generation tool on the client side. 3. Unit tests (codex_executor_imagegen_test.go): Cover no-tools, existing tools, already-present, empty array, and mixed built-in tool scenarios.

…mplify session affinity logic

fix(executor): ignore null OpenAI stream usage chunks

fix: apply default auth-dir when config value is empty

- Updated `FileTokenStore` and related stores (`objectstore`, `gitstore`, `postgresstore`) to include the `disabled` flag in metadata for token storage. - Adjusted `Auth` metadata handling to initialize empty maps when absent. - Refined logic in `auto_refresh_loop` and `conductor` to exclude `disabled` tokens from refresh checks. - Added comprehensive unit tests to verify proper handling of the `disabled` flag in storage and retrieval operations.

…dling - Updated `EnsureIndex` logic to incorporate absolute and cleaned file paths when generating auth indexes. - Refined metadata handling to include OAuth type in auth index seed. - Improved compatibility for `json` file paths as sources in auth attributes. - Added unit tests to validate correct auth index behavior for various path and type scenarios.

…s-tool-output-context Fix Responses WebSocket tool output context repair

- Introduced `SetHomeEnabled` to enable/disable request-log forwarding to the home control plane. - Implemented `forwardRequestLogToHome` for non-streaming logs and `homeStreamingLogWriter` for real-time streaming logs. - Enhanced `FileRequestLogger` to bypass local logging when home forwarding is enabled. - Updated server configuration to dynamically toggle home request-log forwarding based on changes. - Added corresponding unit tests to ensure correct forwarding behavior and fallback mechanisms.

…ble usage statistics - Added `count` attribute to `homeAuthCount` requests to improve home message batching. - Enabled usage statistics for home mode by default and added config-level enforcement. - Adjusted failure logging to include detailed metadata in `UsageReporter`. - Updated multiple executors to pass error details to `PublishFailure` for better debugging. - Enhanced unit tests to validate `count` behavior and usage statistics enforcement across components.

…ve error handling in home mode - Introduced `shouldReturnLastErrorOnPickFailure` to streamline error return logic during provider selection. - Added `isHomeRequestRetryExceededError` for better home-specific error classification. - Updated fallback conditions to enhance error handling clarity in `pickNextMixed`.

…upport - Introduced `homeRuntimeAuths` to cache home auths for websocket session reuse. - Updated `pickNextViaHome` to prioritize cached auths for pinned websocket sessions. - Implemented automatic clearing of cached home auths when home mode is disabled. - Added unit tests to validate caching behavior, clearing logic, and fallback scenarios.

…ef counting - Added `homeRuntimeAuthSessions` and `homeRuntimeAuthRefs` for scoped caching of home auths per session. - Updated `pickNextViaHome` to prevent reuse of already-tried pinned auths during session retries. - Implemented reference counting for shared auths across multiple sessions to improve memory management. - Enhanced session cleanup logic to clear cached auths only when all referencing sessions are closed. - Added unit tests to validate scoped caching, retry logic, and session cleanup behavior.

…counting - Consolidated `homeRuntimeAuths` to store a map of session-scoped auth maps, replacing `homeRuntimeAuthSessions` and `homeRuntimeAuthRefs`. - Adjusted session cleanup logic to directly remove session-scoped auths without reference counting. - Added `GetExecutionSessionAuthByID` to retrieve auths scoped to a specific execution session. - Updated tests to reflect the new session-scoped caching behavior.

- Introduced `logHomeConfigChanges` to compare old and new configs, logging detected differences. - Leveraged `diff.BuildConfigChangeDetails` for structured change detection. - Adjusted logging behavior to enable debug-level logs dynamically when required.

… /v6 -> /v7 Copied 87 CPAP-exclusive paths (12 third-party providers: bt, codearts, codebuddy, codebuddy_ai, copilot, cursor, gitlab, iflow, joycode, kilo, kiro, qoder) plus management UI web assets and provider-specific handlers. Rewrote all /v6 import paths to /v7 (95 files affected). Added 4 provider constants (Kiro, Kilo, CodeArts, JoyCode) to internal/constant/constant.go needed by copied translator init files. Not yet wired into CPA core files — next batch handles registration in main.go, service.go, conductor.go, server.go, model_definitions.go, etc.

- internal/config/config.go: added KiroKey, BTKey, KiroFingerprintConfig structs, KiroFingerprint/KiroPreferredEndpoint/IncognitoBrowser fields, SanitizeKiroKeys/SanitizeBTKeys methods - internal/browser/browser.go: added SetIncognitoMode, IsIncognitoMode, CloseBrowser - internal/registry/model_registry.go: added SupportedEndpoints field to ModelInfo - internal/registry/model_definitions.go: added GetCodeBuddyModels, GetCodeBuddyAIModels, GetBTModels, GetCursorModels, GetGitHubCopilotModels, GetKiroModels, GetAmazonQModels, GetQoderModels, extended GetStaticModelDefinitionsByChannel + LookupStaticModelInfo - internal/tui/client.go: added GetUsage method - internal/api/handlers/management/handler.go: added sdkAuthManager field + SetSDKAuthManager - internal/runtime/executor/helps/token_helpers.go: added CountClaudeChatTokens, CollectOpenAIContent, collectClaudeMessages/Content/Tools, estimateImageTokens - internal/util/gemini_schema.go: added CleanupOrphanedRequiredInTools - sdk/auth/manager.go: added ProviderInfo, providerMetadata, ListProviders, SaveAuth - Fixed ApplyPayloadConfigWithRoot calls (added requestPath arg) in bt/iflow/qoder/compat_helpers Result: go build ./... passes with 0 errors

Replaced with CPAP version that includes Kiro, BT, CodeBuddy, Cursor, GitLab, GitHub Copilot, JoyCode, CodeArts, Qoder, Kilo examples. recent_requests and kiro-quota already in CPA v7.0.2 base (PR router-for-me#24 merged upstream).

- sdk/cliproxy/service.go: added 12 provider cases to registerModelsForAuth switch (kiro, kilo, cursor, github-copilot, codebuddy, codebuddy-ai, codearts, joycode, gitlab, bt, qoder, iflow) - sdk/cliproxy/service.go: added 12 executor registrations (NewKiroExecutor, etc.) Server binary builds successfully. Kiro models should now appear in /v1/models.

… providers - internal/api/server.go: register /kiro-quota route + StartKiroQuotaRefresher - sdk/cliproxy/auth/oauth_model_alias.go: add kiro, github-copilot, gitlab, cursor, codebuddy, codebuddy-ai, codearts, joycode, qoder, kilo, bt, iflow to OAuthModelAliasChannel switch (enables fork aliases in model list)

Ported from CPAP service.go: - fetchKiroModels: fetches models from Kiro API with fallback to static list - extractKiroTokenData: extracts token from auth attributes/metadata - convertKiroAPIModels: converts API response to ModelInfo - normalizeKiroModelID, formatKiroDisplayName: formatting helpers - generateKiroAgenticVariants: creates -agentic model variants Result: 62 models on test server (matches production exactly)

- internal/api/handlers/management/usage.go: added GetUsageStatistics, ExportUsageStatistics, ImportUsageStatistics handlers - internal/api/handlers/management/handler.go: added usageStats field + SetUsageStats - internal/api/server.go: registered /usage, /usage/export, /usage/import routes All 13 acceptance tests pass (models, chat, streaming, images, aliases, management)

Copilot

Copilot wasn't able to review this pull request because it exceeds the maximum number of files (300). Try reducing the number of changed files and requesting a review from Copilot again.

github-actions · 2026-05-11T10:01:40Z

This pull request targeted main.

The base branch has been automatically changed to dev.

The usageStats field was added to Handler but never initialized in NewHandler, causing GetUsageStatistics to always return an empty snapshot (total_requests: 0). Now initialized with usage.GetRequestStatistics() global singleton.

kslamph and others added 30 commits March 29, 2026 23:52

refactor(gemini-cli): simplify redundant if/else in project ID assign…

91387ca

…ment Both branches assign finalProjectID = responseProjectID, so move the assignment outside the conditional and keep only the logging inside.

fix(claude-auth): dedupe OAuth refresh and honor 429 backoff

6431cec

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(executor): route Claude refresh through retry-aware auth

29e32aa

Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

fix(websocket): narrow compact replay detection

d2d0e6f

fix(websocket): gate compact replay by downstream support

4ca00f7

fix(tests): update model lookup references and enhance Claude executo…

d9a3b3e

…r tests

fix(tests): update Gemini family test case numbers for consistency

da43f63

feat(api): integrate auth index into key retrieval endpoints for Gemi…

894baad

…ni, Claude, Codex, OpenAI, and Vertex

fix(management): stabilize auth-index mapping

c26936e

fix(tests): remove obsolete config_auth_index_test file

a64141a

Merge pull request router-for-me#2892 from router-for-me/fix-provider

c6baa64

feat(api): integrate auth index into key retrieval endpoints for Gemi…

Merge pull request router-for-me#2898 from octo-patch/fix/issue-2866-…

e05abec

…remove-obsolete-1m-beta-header fix(executor): drop obsolete context-1m-2025-08-07 beta header

fix(codex): backfill streaming response output

bb8408c

perf(codex): avoid repeated output patch writes

b6781d6

feat(api): add support for HEAD requests to /healthz endpoint

1716a84

- Refactored `/healthz` handler to support `HEAD` requests alongside `GET`. - Updated tests to include validation for `HEAD` requests with expected status and empty body. Closes: router-for-me#2929

Merge pull request router-for-me#2939 from stringer07/fix/codex-strea…

3444820

…m-output-backfill fix(codex): backfill streaming response output

Merge pull request router-for-me#2834 from muzhi1991/fix/openai-compa…

8ced7a5

…t-host-header fix(util): forward custom Host header to upstream

feat(models): add Kimi K2.6 model entry to registry JSON

4fc2c61

fix(handlers): remove handling of unsupported n parameter in OpenAI…

fd71960

… image handlers

fix(handlers): remove references to unsupported n parameter in Open…

a188159

…AI image handlers

luispater and others added 23 commits May 9, 2026 10:51

refactor(cliproxy): remove ClaudeCodeSessionAffinity support and si…

0dcb8bd

…mplify session affinity logic

Merge pull request router-for-me#3221 from mochenya/main

09ac8a1

fix(executor): ignore null OpenAI stream usage chunks

Merge pull request router-for-me#3292 from lihan3238/fix-3272

68fddaa

fix: apply default auth-dir when config value is empty

Merge pull request router-for-me#3283 from wuchulonly/fix/responses-w…

3cbd587

…s-tool-output-context Fix Responses WebSocket tool output context repair

Merge branch 'v7' into dev

a44e5eb

Merge remote-tracking branch 'origin/pr/3239'

1fca942

chore: ignore .porting scratch dir

b936068

Copilot AI review requested due to automatic review settings May 11, 2026 10:01

Copilot AI reviewed May 11, 2026

View reviewed changes

github-actions Bot changed the base branch from main to dev May 11, 2026 10:01

hpylsy closed this May 12, 2026

hpylsy deleted the feat/cpap-integration branch May 12, 2026 05:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync: rebase on CPA v7.0.2 upstream#29

sync: rebase on CPA v7.0.2 upstream#29
hpylsy wants to merge 185 commits into
Ve-ria:devfrom
hpylsy:feat/cpap-integration

hpylsy commented May 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

hpylsy commented May 11, 2026

What

Why

Approach

All 12 CPAP providers preserved

Gained from CPA v7.0.2

Testing

Future CPA sync

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants