repo·evals
· 2026-05-04 ·v6.10.4 (main@HEAD)

CLIProxyAPI

router-for-me/CLIProxyAPI

🛠78 / 100
🎯

🛠
🗺
01Research调研02Plan & design计划与设计03Code & review开发与评审04Package打包发布05Maintain维护
📍
📍
🧬

🛑
0–29
⚠️
30–49
🛠
50–79
🏭
80–100
78
🛠· 78 / 100
  • 6 claims passed, no critical failures
  • MIT / Apache / etc., installable per deployment.install_methods
  • release_pipeline_score=3 + pushed in 90-day window
  • multilingual_readme=true
  • static-only eval; live e2e pending

#1👤
#2🎯
#3🧭
#4

Your tool / IDE你的工具 / IDE(OpenAI-shaped request)(OpenAI 形状请求)HTTP listener + parseHTTP 监听 + 解析(OpenAI / Gemini / Claude)(OpenAI / Gemini / Claude)Account rotator账号调度(round-robin / failover)(轮询 / 故障切换)Provider forwardProvider 转发(Claude / Codex / Gemini CLI)(Claude / Codex / Gemini CLI)Response translator响应翻译(back to client format)(转回客户端格式)OpenAI-shaped responseOpenAI 形状响应(your tool consumes)(你的工具消费)

Download release binary (8 OS/arch combinations)macOS / Linux / Windows / FreeBSD × aarch64+amd64easy
go install / go buildany (Go 1.26+)moderate
Embed via Go SDKany Go servicemoderate
  • 🌐
Claude Code / ChatGPT Codex / Gemini CLI / Antigravity
Upstream OAuth sessions to wrap
Wraps existing CLI subscriptions — no new keys needed
Kimi / Vertex (extra OAuth flows)
Additional model providers
Optional providers
· 7
4 2 1
+40
+17
+12
+9
0
0

6 / 7
passed claim-001

passed claim-002

passed claim-003

passed claim-004

passed claim-005

untested claim-006

passed claim-007

input_contract
output_contract
determinism
idempotence
no_skill_callouts
failure_mode_clarity

workflow_correctness
declared_call_graph
stop_conditions
handoff_points
atom_evidence
error_propagation
partial_failure_handling

  • core user-facing layer untested → capped at 'usable'
  • evidence_completeness='partial' (not portable) → capped at 'usable'

  • only 3/4 critical claims covered

archetype: api-servicecore_layer_tested? Falseevidence: partialrecommended: usablefinal: usable
ceiling 1 · core user-facing layer untested → capped at 'usable'
ceiling 2 · evidence_completeness='partial' (not portable) → capped at 'usable'

claim-0017 个上游 protocol translator 真实存在criticalprotocol-coverage● passed
claim-0028 平台 release 二进制全员到齐criticaldistribution● passed
claim-0037 家 OAuth login flow 真实存在criticaloauth● passed
claim-004多账号 round-robin 配置在 config.example 真实出现highload-balancing◐ partial
claim-005Reusable Go SDK 真有文档 + 子包highsdk● passed
claim-006端到端 happy path:装 + 登录 + 一次 OpenAI 兼容请求criticalend-to-end○ untested
claim-007README 赞助段不冒充功能highmarketing-honesty◐ partial

0%
0.00s
0

run-static-checks

2026-05-04
0% tokens in ? / out ?

run-static-checks

2026-05-04
0% tokens in ? / out ?
# CLIProxyAPI — final verdict (2026-05-04)

## Repo

- **Name:** router-for-me/CLIProxyAPI
- **Branch evaluated:** main@HEAD (release v6.10.4)
- **Archetype:** api-service
- **Layer:** **molecule** — config-driven routing across predefined
  translators
- **Eval framework:** repo-evals layer model v1 (f9ed1e9)

## Bucket

**`usable`** — structurally mature OSS proxy, strong static
evidence on all 6 audit dimensions. Molecule rule caps `usable`
without a logged live API round-trip.

## What was evaluated

### Atom + molecule level (static, this run)

| Claim | Status | Notes |
|---|---|---|
| 001 7 translators | passed | antigravity / claude / codex / common / gemini-cli / gemini / openai under internal/translator/ |
| 002 8 release binaries | passed | darwin/freebsd/linux/windows × aarch64+amd64 + checksums.txt |
| 003 7 OAuth flows | passed | internal/auth/: 6 real providers + `empty` placeholder; includes Kimi + Vertex not headlined in EN README |
| 004 multi-account config | passed_with_concerns | 406-line config.example.yaml; concept exists; deeper schema read recommended |
| 005 Reusable Go SDK | passed | docs/sdk-usage.md 163 lines + 9 sdk/ packages + 4 docs in EN+CN |
| 007 sponsor disclosure | passed_with_concerns | 5+ sponsors front-load README; Overview section cleanly separates OSS features from sponsor offerings |

### Molecule level (deferred)

| Claim | Status | Required |
|---|---|---|
| 006 live OpenAI-compatible call | untested | Start binary, do one OAuth login, curl `/v1/chat/completions`, log response |

## Real findings worth surfacing

1. **Cross-platform release pipeline is mature.** 8 OS/arch
   combinations + checksums shipped consistently. FreeBSD coverage
   is unusual and signals goreleaser-style automation.

2. **Code is richer than docs.** Two OAuth providers (Kimi, Vertex)
   in `internal/auth/` not headlined in the English README; the
   community probably knows but a search would miss them.

3. **SDK story is unusually strong.** Most OSS proxies stop at "run
   the binary". This one ships a 9-package Go SDK with 4 doc files
   in EN+CN — implies the project expects to be embedded into other
   Go services, not just run standalone.

4. **README sponsorship section is heavy.** ~50 lines of sponsor
   tables before the "Overview" — honest disclosure but front-loaded.
   A casual reader skimming the top might mistake the sponsors for
   official providers; the Overview cleanly separates the OSS feature
   set from sponsor offerings, but a reader who stops at the top
   misses that.

## Why not higher

`usable` is the right ceiling because:

- No live API round-trip logged on this evaluator's machine. (Wendy
  already runs CLIProxyAPI per memory record, but the eval is
  independent static-only.)
- claim-004 needs a deeper config-schema audit to confirm the
  multi-account / round-robin claim is fully wired (not just keyword-
  level present).

## Path to `reusable`

1. Start binary, perform one OAuth login (claude / gemini / codex —
   pick one).
2. Send a `/v1/chat/completions` request, verify the structured
   response.
3. Run an OAuth-expired scenario; verify clear error.
4. Read config.example.yaml end-to-end and confirm multi-account
   list semantics.
5. Update claim-006 → `passed`, claim-004 → `passed` if the schema
   is fully wired. Re-run verdict_calculator.

## Recommended

```yaml
current_bucket: usable
status: evaluated
```