repo·evals
· 2026-05-04 ·main@HEAD (4.0.456, private package, not on npm registry)

@remotion/skills

remotion-dev/skills

🛠69 / 100
🎯

📝
🗺
01Signal scanning信号发现02Content acquisition内容获取03Content understanding内容理解04Topic curation选题决策05Content production内容生产06Creative assembly创意组装07Distribution & feedback分发反馈08Learning学习
📍video
🧬

🛑
0–29
⚠️
30–49
🛠
50–79
🏭
80–100
69
🛠· 69 / 100
  • 4 claims passed, no critical failures
  • README may claim a license but no LICENSE file exists
  • release_pipeline_score=2 + pushed in 90-day window
  • EN-only or ZH-only README
  • static-only eval; live e2e pending

#1👤
#2🎯
#3🧭
#4

Remotion taskRemotion 任务(write a component / scene / asset)(写组件 / 场景 / 素材)@remotion/skills@remotion/skills(SKILL.md + 35 rule docs)(SKILL.md + 35 个规则)Render-correct Remotion coderender 一次过的代码(first try)(first try 通过)

git clone + copy skills/remotion/ to ~/.claude/skills/anymoderate
Inside Remotion monorepo (intended path)anyeasy
  • 📡
  • 🔒
Anthropic Claude / Cursor / Codex
LLM that consumes the skill
Standard agent-side cost
Google Fonts CDN
Font loading at course render time
Optional — only used if rendered course imports fonts
· 5
4 1
+40
+9
+15
+3
+5
-3

4 / 5
passed claim-001

passed claim-002

passed claim-003

passed claim-004

untested claim-005

input_contract
output_contract
determinism
idempotence
no_skill_callouts
failure_mode_clarity

  • evidence_completeness='partial' (not portable) → capped at 'usable'

  • only 1/2 critical claims covered

archetype: prompt-skillcore_layer_tested? Trueevidence: partialrecommended: usablefinal: usable
ceiling 1 · evidence_completeness='partial' (not portable) → capped at 'usable'

claim-001SKILL.md 存在且具备 Claude Skills frontmattercriticalskill-shape● passed
claim-002rules/ 目录是 SKILL.md 的深度参考资料highdepth● passed
claim-003包是 internal / private,不通过 npm 公开发行highdistribution● passed
claim-004src/ 是真实可跑的 Remotion studio harnesshighharness● passed
claim-005端到端:把 SKILL.md 装到自己的 agent,写出符合规则的 Remotion 代码criticalend-to-end○ untested

0%
0.00s
0

run-static-checks

2026-05-04
0% tokens in ? / out ?

run-static-checks

2026-05-04
0% tokens in ? / out ?
# @remotion/skills — final verdict (2026-05-04)

## Repo

- **Name:** remotion-dev/skills (`@remotion/skills`)
- **Branch evaluated:** main@HEAD (4.0.456, private package)
- **Archetype:** prompt-skill
- **Layer:** **atom** — single SKILL.md with reference rules
- **Eval framework:** repo-evals layer model v1 (4acbd5d)

## Bucket

**`usable`** — sparse public README hides high-quality, well-curated
content. Atom rule caps `usable` until a live agent session
demonstrates the skill drives behaviour.

## What was evaluated

### Atom level (static, this run)

| Claim | Status | Notes |
|---|---|---|
| 001 SKILL.md shape | passed | Standard frontmatter (name + description + tags) + "When to use" + 340-line body with code examples |
| 002 rules depth | passed | 35 markdown rules, 30 over 1 KB; covers audio, captions, ffmpeg, lottie, mapbox, fonts, etc. |
| 003 internal-only distribution | passed | `package.json: { private: true }` + npm registry 404, consistent with README's "internal package" |
| 004 studio harness | passed | `src/index.ts` calls registerRoot; Root.tsx imports 3 rule assets that all exist; the maintainers self-test a few rules as live React |

### Atom level (deferred — live)

| Claim | Status | Required |
|---|---|---|
| 005 agent reads SKILL.md and generates Remotion code | untested | Copy SKILL.md + rules/ into a Claude Code session, ask for a Remotion task, verify the agent references the skill |

## Real findings worth surfacing

1. **README dramatically undersells the package.** One sentence
   ("internal package, no documentation") reads as abandonware, but
   the actual content is 110 KB of curated guidance authored by the
   official Remotion team. A maintainer-facing repo with an
   end-user-facing skill — the README is honest about the audience
   but misleading about the value.

2. **The studio harness is a self-test.** `src/Root.tsx` loads three
   rules/assets/*.tsx files as live Remotion compositions. That's a
   structural guarantee that at least three rules survive `npm run
   dev` — better than a "all rules are markdown only" claim.

3. **Internal package = clone-and-copy.** Users wanting this skill
   for their own AI agent need to git clone the repo and copy
   `skills/remotion/` into their skills directory. There's no `npm
   install` path, and the README doesn't tell you that — a small
   discoverability gap.

## Why not higher

`usable` is the right ceiling because:

- No live agent evidence on this machine. The skill's job is to
  influence LLM output; that can only be measured by running an LLM.
- Single-evaluator, single-day pass.

## Path to `reusable`

1. `git clone` and copy `skills/remotion/` to `~/.claude/skills/remotion/`.
2. Open Claude Code, ask: "use Remotion to write a 2-second fade-in
   animation".
3. Verify the agent uses `useCurrentFrame()` + `interpolate()` +
   Easing as the skill prescribes.
4. Repeat for an audio or caption topic; verify the agent loads the
   matching rules/<topic>.md.
5. Log under `runs/<date>/run-live-agent/`.

## Recommended

```yaml
current_bucket: usable
status: evaluated
```