AI Coding Agent
Orchestrator
— loops until done.
Give it a spec. It writes tests, implements code, verifies quality, and retries until everything passes.
Why nax
Built around discipline,
not convenience.
AGENT-AGNOSTIC
Works with any agent
Claude Code, Codex, Gemini CLI, or any ACP-compatible agent. nax drives them all through the same disciplined loop.
TDD-ENFORCED
Tests first, always
Acceptance tests must be RED before implementation starts. No shortcuts, no skipped steps — the loop enforces it.
LOOP UNTIL DONE
Verify, retry, escalate
Failed verify? Retry. Still failing? Escalate to a more powerful model tier. Loop until every test passes.
MONOREPO-READY
Per-package everything
Per-package config, per-package test commands, per-story working directories. Full monorepo isolation out of the box.
SEMANTIC REVIEW
LLM-powered quality gates
Behavioral review against story acceptance criteria. Catches stubs, placeholders, and out-of-scope changes before they merge.
ADVERSARIAL REVIEW
Probes edge cases
A second LLM pass that attacks the implementation — probing input handling, error paths, and abandoned implementations.
LANGUAGE-AWARE
Auto-detects your stack
Detects Go, Rust, Python, TypeScript from manifest files. Adapts test commands, mocking patterns, and context per language.
EXTENSIBLE
Plugin system
Hook into routing, review, reporting, and post-run actions. Drop a plugin into .nax/plugins/ and it is picked up automatically.
How it works
A loop with teeth.
Plan
optionalGenerate prd.json from a spec file using an LLM. Stories are decomposed, sized, and sequenced automatically.
Acceptance setup
Generate acceptance tests and assert RED before implementation begins. No green tests allowed at this stage.
Route
Classify story complexity and select a model tier — fast, balanced, or powerful. Security-critical stories always escalate.
Execute
Run the agent session. Claude Code, Codex, Gemini CLI, or any ACP-compatible agent. Context and constitution are injected per story.
Verify
Run scoped tests. On failure, the rectification loop fires first — fixing before escalating to the next model tier.
Review
Lint, typecheck, semantic review against ACs, and adversarial review for edge cases. Autofix runs before escalation.
Escalate & loop
On repeated failure, retry with a higher model tier. Repeat until all stories pass or cost / iteration limits are hit.
Regression gate
Full test suite runs once after all stories complete. nax retries failed suites; a gate failure does not block the feature.
Acceptance
Run the original acceptance tests against the finished feature. GREEN here means done.
Agents
Bring your own agent.
nax doesn't care which model you use. Swap agents without touching your workflow.
Claude Code
ACP · CLIFull ACP support with multi-turn continuity, token-cost tracking, and structured JSON-RPC. Recommended.
Codex / OpenCode
CLIOpenAI Codex CLI adapter. Pass --agent opencode to activate.
Gemini CLI
CLIGoogle's Gemini CLI adapter. Pass --agent gemini to activate.
Any ACP agent
ACPAny agent implementing the Agent Client Protocol connects via acpx. Structured sessions, cost alignment, multi-session continuity.
Install
Requires Bun 1.3.7+ or Node 22+.
Git must be initialized in your project.
npm
bun
Quick start
cd your-project
nax init # Create .nax/ structure
nax features create my-feature # Scaffold a feature
# Write your spec, then run
nax run -f my-feature --plan --from spec.md
Or skip interactive planning with --plan --from spec.md for a single-command run.