repo·evals
· ·main@HEAD

Agent-Reach

Panniantong/Agent-Reach

🛠79 / 100
🎯

📝
🗺
01Signal scanning信号发现02Content acquisition内容获取03Content understanding内容理解04Topic curation选题决策05Content production内容生产06Creative assembly创意组装07Distribution & feedback分发反馈08Learning学习
📍
🧬

🛑
0–29
⚠️
30–49
🛠
50–79
🏭
80–100
79
🛠· 79 / 100
  • 6 claims passed, no critical failures
  • MIT / Apache / etc., installable per deployment.install_methods
  • release_pipeline_score=2 + pushed in 90-day window
  • multilingual_readme=true
  • static-only eval; live e2e pending

#1👤
#2🎯
#3🧭
#4

Query / URL查询 / URLfrom agent从 agent 来Platform identify平台识别Auth / cookie鉴权 / cookieload加载Per-platform reader各平台 reader(search / fetch)(搜 / 抓)Structured JSON结构化 JSONfor LLM给 LLMAgent quotes /Agent 引用 /cites with sources标来源

git clone + uv syncany (Python 3.10+)easy
  • 🌐
Per-platform public web (Twitter / Reddit / YouTube / GitHub / Bilibili / 小红书)
Source of read content
Public access; some platforms need cookie/token (X especially); zero per-call fees
AI agent harness (Claude Code / Codex / etc.)
Caller — invokes agent-reach as a tool
Standard agent-side cost; agent-reach itself adds no LLM cost
· 7
5 1 1
+40
+15
+15
+9
0
0

6 / 7
passed claim-001

passed claim-002

passed claim-003

passed claim-004

passed claim-005

untested claim-006

passed claim-007

input_contract
output_contract
determinism
idempotence
no_skill_callouts
failure_mode_clarity

workflow_correctness
declared_call_graph
stop_conditions
handoff_points
atom_evidence
error_propagation
partial_failure_handling

  • core user-facing layer untested → capped at 'usable'
  • evidence_completeness='partial' (not portable) → capped at 'usable'

  • only 2/3 critical claims covered

archetype: pure-clicore_layer_tested? Falseevidence: partialrecommended: usablefinal: usable
ceiling 1 · core user-facing layer untested → capped at 'usable'
ceiling 2 · evidence_completeness='partial' (not portable) → capped at 'usable'

claim-001Multi-platform internet-reading skill — Twitter / Reddit / YouTube / GitHub / Bilibili / 小红书criticalplatform-coverage● passed
claim-002MIT LICENSE presentcriticallicensing● passed
claim-003Multi-language documentation (EN / 中文 / 日本語 / 한국어)highdocs-i18n● passed
claim-004Test suite + CONTRIBUTING + SECURITY disciplinehighproject-discipline● passed
claim-005Recently active developmenthighmaintenance● passed
claim-006Live e2e — actually read content from at least one platform via the CLIcriticalend-to-end○ untested
claim-007Zero API fees claim — uses public-web access not paid APIshighcost-architecture◐ partial

0%
0.00s
0