CLI Reference
skillci <command> [options]
Invoke via the built binary (node dist/cli/index.js …), or npm run demo for
the offline run. After npm link / global install, the skillci binary is on
your PATH.
skillci run
Run a baseline-vs-candidate evaluation (or the offline demo) and print the report.
| Flag | Default | Description |
|---|---|---|
--agent <kind> |
claude-code |
Target agent: claude-code | cursor | codex. |
--baseline <dir> |
— | Directory holding the baseline (trusted) config. |
--candidate <dir> |
— | Directory holding the candidate (proposed) config. |
--tasks <dir> |
bundled samples | Directory of task definitions. |
--demo |
false |
Fully-offline run over the sample tasks via the mock adapter. |
--open-pr |
false |
Open a real PR when promotable (otherwise dry-run). |
--no-color |
— | Disable colorized output. |
Behavior notes
- If
--baselineor--candidateis omitted,runfalls back to demo mode. - The real agent adapter is used when available; otherwise it falls back to the offline mock (and says so).
- Judge backend: Anthropic SDK when
ANTHROPIC_API_KEYis set, else theclaude -pCLI judge when judging Claude and its binary is present, else skipped.
Exit codes
| Code | When |
|---|---|
0 |
Verdict improved or neutral. |
| non-zero | Verdict regressed (use as a CI gate). |
Examples
# Offline, deterministic
skillci run --demo
# Live, on a Claude subscription (no API key)
skillci run --agent claude-code --baseline ./cfg/base --candidate ./cfg/cand
# Custom task suite, open a PR if promotable
skillci run --agent claude-code --baseline ./base --candidate ./cand \
--tasks ./my-tasks --open-pr
skillci validate <dir>
Discover and validate the config artifacts for an agent in a directory — a quick sanity check that your skills/hooks/rules parse before you evaluate them.
| Flag | Description |
|---|---|
--agent <kind> |
Agent to interpret the directory as (default claude-code). |
--baseline <dir> |
Optional baseline to diff against (reports added/removed/modified artifacts). |
skillci validate ./my-config --agent claude-code
skillci validate ./pr-config --agent claude-code --baseline ./main-config
skillci tasks
List the available task definitions (bundled samples by default).
| Flag | Description |
|---|---|
--tasks <dir> |
List tasks from a custom directory instead of the samples. |
skillci tasks
skillci tasks --tasks ./my-tasks
Output shows each task's id, target agent, title, check count, and whether it's judged.
Global
skillci --help/skillci <command> --help— usage.skillci --version— version.