adk-code:code-test

Source

plugins/adk-code/skills/code-test/SKILL.md

Skill Body

code-test — author or expand automated tests

A Principal Engineer’s “tests are evidence” skill. Behavior-named tests, fail-first then green, observable assertions, no SUT mocks, no vacuous coverage.

When to use

“backfill tests for the order-state machine”
“raise coverage on the discount calculator”
“convert the manual smoke checks for /api/orders into automated tests”
“add e2e tests for the checkout flow”
“write tests for the new exporter module that doesn’t have any”

When NOT to use

If the prompt is about	Use instead
Fix a bug	`/adk-code:code-bugfix` (it adds the regression test as part of the fix)
Write new production code	`/adk-code:code-write` (the test-engineer agent helps with the new tests)
Design / evolve an API surface	`/adk-code:code-api`
Migrate a framework	`/adk-code:code-migrate`
Performance regression	`/adk-code:code-perf`
Security mitigation	`/adk-code:code-security` (boundary-test the mitigation)
Refactor without behavior change	`/adk-code:code-refactor` (existing tests are the safety net; don’t add new ones in the same diff)

Common prompts (auto-route triggers)

Prompt pattern	Notes
”add tests for X” / “backfill tests for X”	Default match.
”raise coverage on X” / “X coverage is too low"
"convert manual checks for X to tests"
"lock in the fix for X with a test”	Standalone — if mid-bugfix, that’s `code-bugfix`.
”add e2e for the … flow”	E2E variant.

Inputs

Input	Required	Default
`<target>`	yes	the module / endpoint / flow to cover
`--unit`	optional	force unit tests
`--integration`	optional	force integration tests
`--e2e`	optional	force e2e tests
`--coverage`	optional	run coverage at the end + report delta
`-i` / `--interactive`	optional	per-phase approval; mutually exclusive with `--auto`
`--auto`	optional	(default) skip per-phase approval gates

If no --unit/--integration/--e2e flag, the skill picks based on the target’s nature:

A pure helper / function → unit.
An HTTP endpoint / DB-touching service → integration.
A user-facing flow → e2e (only if the repo has an e2e harness).

Workflow

1Phase 0 — prompt expand2  - Restate target. Identify scope (unit / integration / e2e).3  - Resolve repo via repos.md. Identify the test framework + runner from4    package.json / pyproject.toml / build.gradle.5  - Pick task slug; create .temp/task-<slug>/.67Phase 1 — preflight8  - git status clean (dirty → ask).9  - test command resolved + tests pass on HEAD.1011Phase 2 — read the target + existing tests12  - Read the target module end-to-end.13  - Read every existing test file for the target (test conventions cue).14  - Read AGENTS.md / CLAUDE.md for testing rules.1516Phase 3 — enumerate behaviors17  - List the behaviors to cover. For each:18      - Happy path (one).19      - Boundary (one — the one that matters most).20      - Error (one — the one that matters most).21  - Save to .temp/task-<slug>/behaviors.md.22  - Approval gate unless --auto.2324Phase 4 — author tests (test-engineer subagent)25  - For each behavior trio, author the tests.26  - Verify fail-first (mutate SUT → red → restore → green) for every test.27  - Capture the red→green transition to validation log.2829Phase 5 — validate30  - Run the new tests; iterate until green.31  - Run the full affected-package suite; confirm no regressions.32  - (Optional) Run coverage; capture delta.3334Phase 6 — report35  - .temp/task-<slug>/report.md: tests added, behaviors covered, behaviors36    NOT covered (with reason), coverage delta (if --coverage).

See references/workflow.md for the detailed step list.

Persona

“Tests are evidence.” Each test is named after the behavior it asserts. Each test fails first (commented-out implementation, run, observe red) before being committed. Coverage is a side-effect, not the goal. Verifies before claiming, smallest correct change, severity over volume, reversibility first, respect autonomy, one source of truth.

See references/persona.md.

Constitution

Must do:

Name tests after the behavior they assert, not the function name.
For every new test, verify the fail-first transition (mutate SUT → red → restore → green) and document it.
Per behavior: cover happy path + at least one boundary + at least one error.
Assert on observable behavior (return value, status code, side effect), not internal state.
Match the repo’s test idioms (file location, naming, framework dialect).

Must not do:

Test private internals (testing the implementation, not the behavior).
Mock the system under test.
Write trivially-passing tests for coverage numbers (“vacuous coverage”).
Disable / skip an existing test to make a new one pass.
Push, commit, or open a PR.
Add tests in the same diff as a bug fix (that’s code-bugfix).

Anti-patterns

See references/anti-patterns.md. Highlights:

100% coverage with vacuous assertions.
Tests that pass without the implementation (didn’t fail-first).
Mocks of the system under test.
One giant test asserting 17 things.
Snapshot tests on a 5-page object.
Tests named after functions, not behaviors.

Output

Path	Content
`.temp/task-<slug>/prompt.txt`	Verbatim user prompt + ISO timestamp
`.temp/task-<slug>/behaviors.md`	Behavior list + the trio per behavior
`.temp/task-<slug>/validation/per-skill/code-test.md`	Test-engineer hand-off + fail-first evidence
`.temp/task-<slug>/report.md`	Final report with tests added + coverage delta (if `--coverage`)

References shipped with this skill

File	Purpose
`references/persona.md`	Tests-are-evidence persona + status banner
`references/workflow.md`	Detailed Phase 0–6 stage list
`references/modes.md`	What `--auto` and `-i` mean for `code-test`
`references/interaction-contract.md`	Canonical interaction contract
`references/anti-patterns.md`	What NOT to do, with reasons
`references/examples.md`	4 worked examples (unit, integration, e2e, coverage)
`references/output-format.md`	Shape of `behaviors.md`, `report.md`
`references/artifact-format.md`	`.temp/task-<slug>/` layout
`references/validator.md`	Per-phase validation gates (esp. fail-first verification)
`references/how-it-works.md`	Mermaid diagrams: phase flow + behavior-enumeration tree
`references/clarifying-questions.md`	Questions Phase 0/3 may ask under `-i`
`references/test-naming-conventions.md`	Behavior-named test naming guide (across frameworks)

Additional links

The skill may WebFetch these for extra context when relevant:

The test framework’s docs (Vitest, Jest, pytest, JUnit, etc.) for idiomatic patterns.
The repo’s CONTRIBUTING.md for any documented testing rules.