/onyx Tune my PID controller gains, minimize error, the agent starts turning that goal into measured git history.
From Prompt to Branch
The agent first converts your prompt into branch metadata:| Field | Example |
|---|---|
| Name | tune-pid-gains |
| Metric | tracking_error |
| Unit | error or % |
| Direction | minimize |
| Description | Tune PID gains while keeping overshoot below 5%. |
onyx/{name}.
Baseline Setup
Before changing product code, the agent should create and commit:onyx/onyx.md: the research brief and steering file;onyx/eval.sh: the measurement script;- optionally
onyx/checks.sh: correctness checks.
Experiment Loop
Each attempt follows the same shape:What Gets Recorded
Every experiment points to an exact commit and includes:- status, such as
succeeded,failed, orchecks_failed; - primary metric value;
- secondary metrics emitted by
eval.sh; - output summary;
- optional checks result;
- agent notes about what was learned.
How to Watch Progress
In the Onyx app:- Graph view shows the branch and current best metric.
- Timeline shows every attempt and the best-so-far steps.
- File tree shows changed files plus
onyx/context. - Diff view compares an experiment to the previous best when possible.
Manual Debugging
The agent runs the CLI for you, but these commands are useful when debugging a setup:onyx/eval.sh first. It must print a numeric line such as: