Complete installation before continuing. You should be logged in with
onyx login.1. Grant Repository Access
Open the Onyx app, sign in, and make sure the Onyx GitHub App has access to the repository you want to optimize:- Install or manage the GitHub App for the GitHub account or organization.
- Grant access to the repository.
- For monorepos, use
projectPathwhen starting the branch so Onyx tracks the relevant subdirectory.
2. Install the Agent Skill
Run this once for your local agent environment:onyx/ files, run evals, log experiments, and push or sync results.
By default this installs the Claude Code personal skill at
~/.claude/skills/onyx/SKILL.md, so Claude Code can invoke it as /onyx.
3. Start with /onyx
Open your repository in an Onyx-enabled agent such as Codex, Claude Code, Pi, or Cursor. Then give the agent a goal:
4. What the Agent Creates
The Onyx agent sets up the research surface for you.| File or branch | What it does |
|---|---|
onyx/{name} | The append-only git branch for this research direction. |
onyx/onyx.md | The shared research brief and memory for future agent runs. |
onyx/eval.sh | The measurement entry point. It prints METRIC name=value lines. |
onyx/checks.sh | Optional correctness checks that run after a passing eval. |
5. Steer the Loop with onyx.md
onyx/onyx.md is the best way to steer ongoing auto research.
Edit it when you want to change:
- the objective;
- the primary metric or secondary monitors;
- files the agent may edit;
- files or behaviors that are off limits;
- correctness constraints;
- stop conditions;
- lessons from what has already been tried.
onyx.md, ask the agent to continue:
6. Watch Results in Onyx
As the agent runs, open the project in the app:- Graph view shows research branches and best metrics.
- Branch timeline shows every experiment.
- File tree shows changed files plus the
onyx/context. - Diff view compares attempts against previous best results.
Next Steps
Use Onyx with agents
Learn the full agent runner contract behind
/onyx.Understand onyx.md and eval.sh
Learn how the agent creates and uses the repo-side research files.