Autonomous Agent
FeaturesAutonomous Agent
The Autonomous Agent takes a GitHub milestone (or individual issue) and works through each issue automatically — creating branches, writing code, running quality checks, and opening pull requests.
How It Works
- Pick a milestone — Select a GitHub repo and one or more open milestones
- Review the estimate — See per-issue cost estimates before committing
- Create the job — The agent queues each issue in dependency order
- Execute — The agent processes issues one at a time inside a sandboxed container
- Review results — Each issue produces a PR with quality gate results
You (dashboard) --> Create Job --> Agent picks up Issue #1
|
+--> Creates branch
+--> Writes code
+--> Runs checks (typecheck, lint, test, build)
+--> Opens PR
+--> Moves to Issue #2 ...
Getting Started
Prerequisites
- A project with a linked GitHub repository
- Issues organized under a GitHub milestone
- Issues formatted with the Agent Issue Specification for best results
Step 1: Open the Jobs Dashboard
Navigate to your project, then click Autonomous Agent in the sidebar (or go to /app/project/:id/jobs).
Step 2: Select a Milestone
- Choose the GitHub repo (auto-populated if your project has a linked repo)
- Select one or more open milestones
- Click Estimate Cost to see a breakdown before committing
Step 3: Configure Options
| Option | Default | What it does |
|---|---|---|
| Auto-merge PRs | Off | Merge PRs automatically when all quality gates pass |
| Security scan | On | Run npm audit before committing |
| Deploy on complete | Off | Trigger a staging deploy after the last issue finishes |
| Pause per issue | Off | Pause after each issue so you can review before continuing |
Set budget limits to control maximum spend:
- Cost limit (USD) — the agent pauses when 80% consumed, stops at 100%
- Token limit — hard cap on total LLM tokens used
Step 4: Create and Execute
Click Create Job to queue the work. Then click Execute to start processing.
Monitoring a Job
Real-Time Dashboard
While a job runs you see:
- Progress bar — issues completed vs. total
- Budget tracker — tokens used, compute time, estimated cost
- Per-issue table with:
- Current stage (planning, building, testing, committing)
- Quality gates (TypeScript, lint, tests, build, security)
- Duration and token usage
- PR link when created
Updates stream in real time via server-sent events (SSE).
Job Statuses
| Status | Meaning |
|---|---|
| Queued | Created but not yet started |
| Running | Actively processing issues |
| Paused | Stopped at a budget threshold or by user request |
| Completed | All issues processed |
| Failed | Stopped due to an error or budget exceeded |
| Cancelled | Manually cancelled by user |
Issue Stages
Each issue within a job moves through:
Queued —> Running —> Planning —> Building —> Testing —> Committing —> Done
If a step fails the agent retries up to 3 times with a targeted fix prompt before marking the issue as Failed.
Budget and Cost
How Cost Is Calculated
| Component | Rate |
|---|---|
| LLM input tokens | 1 credit / 1,000 tokens |
| LLM output tokens | 4 credits / 1,000 tokens |
| Container compute | 1 credit / minute |
A 25% safety margin is applied to all estimates.
Budget Enforcement
- At 80% of your limit the job pauses automatically
- At 100% the job stops
- You can resume a paused job and optionally raise the limits
Complexity Tiers
The estimator classifies each issue into a tier based on its body, labels, and scope:
| Tier | Typical Work | Token Range | Compute |
|---|---|---|---|
| Trivial | Typo fix, config change | 5k-15k | 1-3 min |
| Small | Bug fix, simple feature | 15k-40k | 3-8 min |
| Medium | Feature with tests | 40k-100k | 8-20 min |
| Large | Multi-file refactor | 100k-250k | 20-45 min |
| Epic | Major system change | 250k-500k | 45-90 min |
Quality Gates
Every issue runs through up to five checks:
- TypeScript —
tsc --noEmit - Lint —
eslint - Tests —
npm test - Build —
npm run build - Security —
npm audit(when enabled)
Results appear as pass/fail icons on each issue row. If a gate fails, the agent attempts a self-healing retry.
Writing Good Issues
The agent works best with well-structured issues. Include:
- What — clear description of the change
- Why — motivation and context
- Requirements — specific acceptance criteria
- Technical Specification — files to create/modify, integration points
- Scope Constraints — complexity tier, line estimate
See the Agent Issue Specification for the full template.
Pause, Resume, and Cancel
- Pause — stops after the current issue finishes; no work is lost
- Resume — continues from where it left off; you can raise budget limits
- Cancel — marks remaining issues as skipped; completed PRs are kept
FAQ
Can I run multiple jobs at once? One job runs at a time per project. You can queue additional jobs.
What happens if my budget runs out? The job pauses automatically. You can increase the limit and resume.
Does auto-merge bypass code review? Auto-merge only triggers when all quality gates pass. You can disable it to review every PR manually.
Can I use this on a repo without milestones? Yes — create a single-issue job instead of a milestone job.