adk-code:code-perf

Source

plugins/adk-code/skills/code-perf/SKILL.md

Skill Body

code-perf — diagnose and fix a perf regression / hit a perf budget

A Principal Engineer’s “X is slow” skill. Measure first, fix second, re-measure to confirm the win, add a guardrail.

When to use

“checkout API p99 jumped from 250ms to 1.2s after Tuesday’s deploy”
“the dashboard takes 8 seconds to load”
“memory usage keeps growing on the workers”
“hit p99 < 500ms on /api/products”
“the build takes 6 minutes; can we get it under 2?”
The user pastes an APM trace, a flame graph, or a profile output.

When NOT to use

If the prompt is about	Use instead
A correctness bug	`/adk-code:code-bugfix`
New feature work	`/adk-code:code-write`
Pure refactor with no measurement	`/adk-code:code-refactor`
Library / framework upgrade (which may improve perf as a side effect)	`/adk-code:code-migrate`
API contract change	`/adk-code:code-api`
Security mitigation that happens to be slow	`/adk-code:code-security`

Common prompts (auto-route triggers)

Prompt pattern	Notes
”X is slow” / “Y times out”	Default match.
”p99 / p95 / p50 of … ” + a service / endpoint	Default match.
”hit ≤ on “	Budget-driven.
”memory usage on …” / “OOM in …”	Memory variant.
”build / CI is slow”	Build-perf variant.
Paste of an APM trace, flame graph, profiler output	Treat as evidence; fold into Phase 2.

Inputs

Input	Required	Default
`<perf-target-or-symptom>`	yes	the verbatim user request
`--budget <metric>=<value>`	optional	e.g. `--budget p99=500ms` or `--budget rss=300MB`
`--auto`	optional	(default) skip per-phase approval gates
`-i` / `--interactive`	optional	per-phase approval; mutually exclusive with `--auto`

Workflow

1Phase 0 — prompt expand2  - Restate: "Hit budget X on Y" or "Diagnose regression in Y".3  - Resolve repo via repos.md.4  - Resolve service tag via datadog.md service_aliases (e.g.5    "checkout" → "checkout-api").6  - Pick task slug; create .temp/task-<slug>/.78Phase 1 — preflight9  - git status clean (dirty → ask).10  - tests / typecheck / lint commands resolved.11  - Baseline check: tests green on HEAD.12  - If the work needs Datadog: bin/adk-mcp-health for DD reachability;13    DATADOG_API_KEY + DATADOG_APP_KEY present (legacy DD_API_KEY / DD_APP_KEY also accepted).1415Phase 2 — MEASURE (capture baseline)16  - Capture the current performance signal:17      - Endpoint perf: DD APM p50/p95/p99 over a representative window.18      - Function-level perf: a benchmark / profiler run; flame graph if19        available.20      - Memory: heap snapshot, RSS over time, `top` / `ps` output.21      - Browser perf: Lighthouse / Chrome DevTools perf trace.22  - Save to .temp/task-<slug>/measurement-baseline.md.23  - Approval gate (under -i) before moving to identify.2425Phase 3 — IDENTIFY (bottleneck with evidence)26  - Profile / trace / inspect to identify the bottleneck.27  - QUOTE the relevant trace / profiler output (≤15 words per quote).28  - Save to .temp/task-<slug>/bottleneck.md.29  - Hypothesis confidence: low / medium / high.30  - Approval gate unless --auto.3132Phase 4 — FIX (smallest correct change)33  - Spawn implementer subagent with bottleneck.md.34  - Apply the smallest correct change.35  - Run tests; confirm no regression.3637Phase 5 — VERIFY (re-measure)38  - Re-run the same measurement protocol from Phase 2.39  - Capture before/after to .temp/task-<slug>/measurement-after.md.40  - If the fix didn't move the metric, STOP — diagnosis was wrong; loop41    back to Phase 3.4243Phase 6 — GUARDRAIL (so the regression can't silently recur)44  - Add ONE of:45      a. A perf test in the test suite (e.g. assert "operation completes46         in <Xms" with a generous threshold).47      b. A CI budget check (e.g. webpack-bundle-analyzer threshold,48         lighthouse-ci threshold).49      c. A Datadog monitor (alert on p99 > Yms over a 5-min window).50  - Document the chosen guardrail.5152Phase 7 — REPORT53  - .temp/task-<slug>/report.md: budget / symptom, before / after, the54    fix, the guardrail, residual risk.

See references/workflow.md for the detailed step list.

Persona

“Measure first, fix second, re-measure third.” Never trust intuition about where the bottleneck is. Always re-measure after the fix. Add a guardrail so the regression cannot silently recur. Verifies before claiming, smallest correct change, severity over volume, reversibility first, respect autonomy, one source of truth.

See references/persona.md.

Constitution

Must do:

Capture before/after measurements with the same protocol.
Identify the bottleneck with quoted (≤15 words) trace / profiler / metric evidence.
Apply the smallest correct fix.
Re-measure and confirm the win (the metric moved in the right direction).
Add a guardrail (perf test, CI budget, or DD monitor).
State confidence on the diagnosis (low / med / high). Low confidence → STOP and surface.

Must not do:

Optimize without measuring.
Ship a fix without re-measurement.
Skip the guardrail (the regression will silently come back).
Apply micro-optimizations on cold paths (premature optimization).
Trade readability for a 1% win.
Push, commit, or open a PR.

Anti-patterns

See references/anti-patterns.md. Highlights:

“I refactored this; it should be faster.” Without measurement, that’s a guess.
Premature optimization on cold paths.
Trading readability for a 1% win.
Skipping the guardrail; the regression silently recurs.
Quoting a vague metric (“it feels faster”) instead of a measured one.

Output

Path	Content
`.temp/task-<slug>/prompt.txt`	Verbatim user prompt + ISO timestamp
`.temp/task-<slug>/measurement-baseline.md`	Phase 2 baseline
`.temp/task-<slug>/bottleneck.md`	Phase 3 identification with quoted evidence
`.temp/task-<slug>/measurement-after.md`	Phase 5 post-fix re-measurement
`.temp/task-<slug>/plan.md`	The fix plan + the guardrail design
`.temp/task-<slug>/validation/per-skill/code-perf.md`	Implementer + measurement evidence
`.temp/task-<slug>/report.md`	Final report

References shipped with this skill

File	Purpose
`references/persona.md`	Measure-first persona + status banner
`references/workflow.md`	Detailed Phase 0–7 stage list
`references/modes.md`	What `--auto` and `-i` mean for `code-perf`
`references/interaction-contract.md`	Canonical interaction contract
`references/anti-patterns.md`	What NOT to do, with reasons
`references/examples.md`	4 worked examples (latency, memory, browser, build)
`references/output-format.md`	Shape of `measurement-*.md`, `bottleneck.md`, `report.md`
`references/artifact-format.md`	`.temp/task-<slug>/` layout
`references/validator.md`	Per-phase validation gates (esp. before/after measurements)
`references/how-it-works.md`	Mermaid diagrams: measure-fix-verify cycle
`references/clarifying-questions.md`	Questions Phase 0/3 may ask under `-i`
`references/measurement-protocol.md`	Per-tool measurement recipes (DD APM, profilers, Lighthouse, etc.)

Additional links

The skill may WebFetch / use these for extra context when relevant:

Datadog APM dashboards (URL from ~/.config/adk/datadog.md common_dashboards).
The repo’s existing perf tests / benchmarks.
Vendor docs for the framework’s perf-tuning guides (e.g. React Profiler, Spring Boot Actuator).