SIMPSEN

Self-IMProving Software ENgine

01

The Problem

Current coding agent orchestration systems require constant human intervention. Developers must manually identify issues, craft prompts, and trigger improvements. This creates a fundamental bottleneck: your software can only improve as fast as developers can observe problems, formulate solutions, and execute changes.

The hidden cost:

While your product runs 24/7, improvement happens only during working hours. Customer feedback accumulates. Issues pile up. Opportunities are missed. The gap between user needs and product evolution grows wider every day.

And even when developers are working, they lack the instrumentation to know which changes actually helped. You ship and hope. There's no structured system tracking how one change connects to another or how past decisions inform future ones. Product knowledge lives in people's heads, not in structured batch history.

02

The SIMPSEN Solution

SIMPSEN orchestrates end-to-end autonomous product building—from concept through creation through continuous improvement. It operates on two layers:

Intelligence layer: A structured batch history that tracks Trigger → Plan → Build → Test → Ship across the full orchestration lifecycle, accumulating context that compounds over time.

Execution layer: A fully autonomous pipeline from trigger evaluation through isolated sandbox execution to quality-gated deployment—shipping tested code without human intervention.

Instead of waiting weeks for iteration loops, SIMPSEN's CEO agent drives strategic product evolution grounded in your roadmap, mission, and accumulated batch history. Each cycle informs the next.

03

How It Works

SIMPSEN orchestrates autonomous coding agents driven by a CEO agent that plans what to build based on your product vision:

Trigger Sources

Today, three trigger sources drive the loop. The webhook endpoint supports additional integrations as they come online.

CEO Agent

Plans features from roadmap

Testing Bot

Browser-based bug detection

Webhooks

External event ingestion

Planned integrations:

Support Chats

Coming soon

Usage Analytics

Coming soon

Error Logs

Coming soon

The Intelligence Loop

1
Ingest
2
Evaluate
3
Execute
4
Validate
5
Learn
← every batch informs the next →
1

Ingest

The CEO agent reviews your source documents (roadmap, mission, personality) and recent batch history to decide what to build next. Trigger events can also arrive via the testing bot or webhooks. Each event is stored with its source, type, and payload in the trigger_events table.

2

Evaluate

Claude analyzes each trigger and decides whether action is warranted. When it is, the evaluator generates a structured PRD: dependency-ordered user stories with machine-verifiable acceptance criteria (every story requires “typecheck passes”) and priority classification.

3

Execute

A Vercel Sandbox clones the repository, creates a feature branch, and installs the agent SDK. The agent loop iterates through user stories, building context-rich prompts that include progress from previous iterations. Circuit breakers enforce safety: max 3 attempts per story, max 10 iterations per run. The agent has full tool access—read, write, edit, search, and bash—to implement each story.

4

Validate

After each story implementation, quality gates run: pnpm typecheck and pnpm test must both pass before a story is marked complete. Every iteration produces structured JSON logs—iteration starts, agent messages, pass/fail results—creating a full audit trail.

5

Learn

The CEO agent reviews test results and merge outcomes, deciding what worked and what needs another pass. Patterns emerge across batches—recurring test failures surface systemic issues, successful merges validate the approach. Every cycle makes the next one smarter, compounding product intelligence over time.

Pipeline Traceability

The batch pipeline organizes each run into five stages that form a chain:

Trigger
Plan
Build
Test
Ship

These stages span the full orchestration lifecycle—from trigger ingestion through CEO planning, parallel dev agents, automated merging, and browser-based testing—with cross-batch patterns that surface insights no individual run could reveal. The dashboard makes this intelligence accessible: batch history for exploring past decisions, job logs for tracing each agent's work, and test feedback for reviewing what the browser testing agent found.

04

Transparent Autonomy

Autonomous doesn't mean opaque. The natural objection to self-improving software is trust: how do you know what it's doing, and how do you know it's working? SIMPSEN is designed so that every autonomous action is individually visible, measured, and reversible.

Human oversight isn't bolted on as an afterthought. It's built into two layers: the dashboard shows what changed; the batch logs show why it changed—CEO rationale, dev agent output, and test feedback. You maintain full control—but intervention becomes the exception, not the rule, because the data gives you reasons to trust the system.

What you see:

Every batch is tracked end to end—the CEO's rationale for planning it, each dev agent's structured logs, merge results, and browser test feedback from three personas. Every change lands on its own branch, so git revert isolates the blast radius to a single PR.

05

From MVP to Production in Days, Not Months

Traditional Approach

Build MVP → Launch → Wait for feedback → Schedule sprint → Developer implements → Deploy → Repeat

Iteration cycle: 2-4 weeks

SIMPSEN Approach

Build rough MVP → Launch with SIMPSEN → Autonomous improvement begins immediately

Iteration cycle: Hours to days

Your MVP becomes production-grade through continuous autonomous iteration. The CEO agent plans features, dev agents build them, and browser tests validate each deployment—all while you focus on product strategy.

06

Key Benefits

Founders & Product Teams

  • Launch faster with rougher MVPs
  • Eliminate backlog grooming overhead
  • CEO agent makes autonomous decisions
  • Scale improvement without scaling headcount

Development Teams

  • Focus on strategic features
  • Reduce context switching
  • Deploy with confidence—quality gates and browser test validation
  • Innovation over iteration

End Users

  • Product gets better every day
  • Continuous autonomous improvements
  • Collective improvements across all users
  • Software that adapts to needs

07

Use Cases

Early Stage Startups

Ship your MVP faster. Let SIMPSEN handle the polish while you validate product-market fit.

SaaS Platforms

Maintain feature velocity while automatically handling the long tail of user requests and refinements.

Product Pivots

Rapidly iterate on new directions with your CEO agent autonomously building and testing each batch of features.

Enterprise Tools

Scale product improvement across diverse user segments without proportionally scaling development teams.

08

Technical Foundation

SIMPSEN's architecture is built on concrete, production-tested components:

  • Evaluator: Claude-powered trigger analysis that generates structured PRDs with dependency-ordered user stories and machine-verifiable acceptance criteria
  • Sandbox Execution: Vercel Sandbox provides isolated environments with full repo access, authenticated git, and pnpm support
  • Agent Loop: Claude Agent SDK with up to 50 tool-use turns per story, 10 iterations per run, 3 attempts per story with circuit breakers
  • Quality Gates: Automated typecheck and test suite verification after every story implementation
  • Testing Pipeline: Stagehand + Browserbase for autonomous browser testing with multiple user personas per batch
  • Database: PostgreSQL via Drizzle ORM with 4 engine tables (trigger_events, agent_runs, agent_logs, workflow_configs)
  • API: tRPC procedures for trigger, status, and log retrieval, plus webhook endpoint for external integrations

Every run is logged with structured JSON from start to finish. Every change lands on its own branch and is reversible via git revert. The batch history provides a persistent, queryable record of what SIMPSEN has planned, built, tested, and shipped—ensuring that autonomous doesn't mean unaccountable.

09

The Future of Software Development

Software has always been about understanding users and building what they need. SIMPSEN takes this to its logical conclusion: a CEO agent that strategizes, plans, and orchestrates autonomous dev agents to build and ship tested code—continuously.

Not faster iteration—autonomous product building. You define the vision, roadmap, and personality. SIMPSEN handles the rest: plan, build, merge, test, review, repeat—without human bottlenecks.

Define the vision. The CEO agent builds the product.

10

Get Started

SIMPSEN is currently in private beta. We're working with select early-stage startups and SaaS platforms to prove the model.

If you're building software and want it to improve itself, let's talk.