§ Behind the scenes · How it's made

One human.
Eleven agents.
Two sentinels who never sleep.

TripOS was designed, built, deployed, and is monitored by a team of AI agents working for one founder. There's no engineering team. There's no on-call rotation. There's no Jira. The agents do the work — the human ships the vision.

✈Meet the team See the loop ↓

Krish Mehta · operator · the only human in the loop

§ Live demo · Skip the prose, watch them work

Tap the control plane.
Watch SEV-2 close itself.

A synthetic incident drill — what happens when a real one fires at 3am. Each tier owns its lane. The log on the right is what the agents say to each other. ~7 seconds end to end, no human in the loop.

Incident response · interactive demo

01 · Control plane

02 · First respondersEach owns a remediation lane

WARDEN

First responder · auth & security

standby

ATLAS

Reroute · architecture

standby

FORGE

Rollback · backend

standby

PILOT

Rotate · devops

standby

03 · Quality gateHolds the all-clear

CHECK

Verify · QA probes

standby

Probe results decide whether we're green or roll forward

04 · User commsCloses the loop with anyone affected

CONCIERGE

Remediate · runtime AI

standby

Apology + credit, drafted and sent without a human ping

Also on fleet · not pagedIdle by design

PIXEL

VAULT

DRAFT

VERSE

REACH

Agent feed · idle

PIXEL · committed to feature/live-mode-pill
CHECK · type-check passed · 3.1s · 0 errors
FORGE · /api/trips/[id]/replan · +12 / −4
PILOT · preview deploy READY
REACH · sitemap regenerated · 36 routes

Tap CONTROL above to fire a SEV-2 drill

§ The thesis · Why this exists

SaaS used to need
ten people to ship.
It needs one now.

The hard part of building a software company was never the typing. It was coordinating ten specialists — frontend, backend, devops, design, copy, QA, security — to all move in the same direction without dropping context.

That coordination problem is what AI agents solve. Give each one a charter, a shared codebase, and a human who can taste-test the output, and you have an engineering org. TripOS is the proof. Eleven agents put it on the internet. Two more keep it from breaking. Krish points the ship.

This is what the next decade of SaaS looks like. We figured we'd build one and show you.

§ The team · 11 builders

Meet the engineering
organisation.

Each agent owns a charter. They share the codebase, share context, and pass work between each other through pull requests and shared memory. They argue when they disagree. The human breaks ties.

ATLAS

System architect

opus

Owns the shape of the thing. Schemas, API contracts, data flow.

Runs on: Opus 4.7
Context: 1M tokens
Input rate: $15.00 / M
Output rate: $75.00 / M

Holds the whole codebase in head when planning cross-cutting work.

Tools

ReadGrepGlobTaskCreateSkill

Hands off to

PIXELFORGEVAULTWARDEN

Shipped

55 data models
127 API routes
Trip generation pipeline

PIXEL

Frontend craftsman

sonnet

Components, layouts, animations. Lives in TSX.

Runs on: Sonnet 4.6
Context: 200k tokens
Input rate: $3.00 / M
Output rate: $15.00 / M

Fast iteration loops. UI work needs many small passes, not deep planning.

Tools

ReadEditWriteGlobBash

Hands off to

DRAFTCHECK

Shipped

104 React components
Live Mode mobile-first
Custom primitives

FORGE

Backend engineer

opus

API routes, server actions, business logic. Where the verbs live.

Runs on: Opus 4.7
Context: 1M tokens
Input rate: $15.00 / M
Output rate: $75.00 / M

Business logic spans dozens of files; reasoning across them needs the bigger model.

Tools

ReadEditWriteBashGrep

Hands off to

VAULTCHECKWARDEN

Shipped

Replan engine
Geofence checkin
Booking sync

VAULT

Database keeper

sonnet

Schema, migrations, query plans, indexes that matter.

Runs on: Sonnet 4.6
Context: 200k tokens
Input rate: $3.00 / M
Output rate: $15.00 / M

Schema work is local and pattern-driven. Sonnet is the right cost/precision trade.

Tools

ReadEditBash

Hands off to

CHECKPILOT

Shipped

Zero-downtime migrations
Composite indexes
Soft deletes

WARDEN

Security & auth

opus

Auth, CSP, rate limits, secret rotation, RBAC. Boring on purpose.

Runs on: Opus 4.7
Context: 1M tokens
Input rate: $15.00 / M
Output rate: $75.00 / M

Security mistakes are expensive. Reasoning capacity > token cost here.

Tools

ReadGrepEditBashWebFetch

Hands off to

CHECKATLAS

Shipped

Owner-gated brand
CSP allowlist
IP rate limits

PILOT

DevOps

haiku

Deploys, env vars, edge functions, preview links.

Runs on: Haiku 4.5
Context: 200k tokens
Input rate: $0.25 / M
Output rate: $1.25 / M

Deploy ops are scriptable and repetitive. Haiku is fast and cheap for this.

Tools

BashReadWebFetch

Hands off to

SENTINEL-1SENTINEL-2

Shipped

Preview-per-PR
Env-pull on every clone
Error tracking wired

DRAFT

Design system

sonnet

Departure Board tokens, type ramp, brand consistency police.

Runs on: Sonnet 4.6
Context: 200k tokens
Input rate: $3.00 / M
Output rate: $15.00 / M

Visual judgment + token-level edits. Sonnet hits both well.

Tools

ReadEditWriteGlob

Hands off to

PIXELVERSE

Shipped

Sand/ink/cobalt/coral palette
Display + mono pairing
80 brand assets

CHECK

Quality control

sonnet

TypeScript correctness, integration verification, regression hunting.

Runs on: Sonnet 4.6
Context: 200k tokens
Input rate: $3.00 / M
Output rate: $15.00 / M

High-volume review pass over diffs. Sonnet absorbs the throughput.

Tools

BashReadGrep

Hands off to

PILOTATLAS

Shipped

Zero TS errors at HEAD
Smoke flows on critical paths

VERSE

Copywriter

haiku

Microcopy, error messages, marketing pages. Words that don't waste seconds.

Runs on: Haiku 4.5
Context: 200k tokens
Input rate: $0.25 / M
Output rate: $1.25 / M

Short generative bursts on bounded scope. Haiku ships words at speed.

Tools

ReadEditWrite

Hands off to

DRAFTPIXEL

Shipped

/agencies B2B page
Onboarding copy
Empty-state voice

REACH

SEO & growth

haiku

Meta tags, sitemap, structured data, analytics events.

Runs on: Haiku 4.5
Context: 200k tokens
Input rate: $0.25 / M
Output rate: $1.25 / M

Structured-output, repetitive work. Haiku is the cost-right answer.

Tools

ReadEditWebFetch

Hands off to

DRAFTPIXEL

Shipped

Per-page metadata
Sitemap generation
Funnel events

CONCIERGE

Runtime AI

sonnet

The agent that ships *to users*. Replan, alerts, geofence reasoning.

Runs on: Sonnet 4.6
Context: 200k tokens
Input rate: $3.00 / M
Output rate: $15.00 / M

User-facing latency budget. Sonnet hits sub-second responses on small inputs.

Tools

WebFetchSkill

Hands off to

End user

Shipped

Live Mode reroutes
Weather-aware swaps
Closure detection

§ Stack & cost · Which model runs each charter

Three model tiers.
One bill at the end.

Each agent runs on the model tier that fits its charter — cheap and fast where the work is repetitive, deeper reasoning where the work spans the repo. Public per-token pricing, no markup, no per-seat fees.

Opus 4.7

opus

Context: 1M tokens
Input: $15.00 / M tok
Output: $75.00 / M tok

3 agents on this tier

ATLASFORGEWARDEN

Sonnet 4.6

sonnet

Context: 200k tokens
Input: $3.00 / M tok
Output: $15.00 / M tok

5 agents on this tier

PIXELVAULTDRAFTCHECKCONCIERGE

Haiku 4.5

haiku

Context: 200k tokens
Input: $0.25 / M tok
Output: $1.25 / M tok

5 agents on this tier

PILOTVERSEREACHSENTINEL-1SENTINEL-2

§ How the bill adds up

Pay per token,
not per chair.

Traditional engineering teams cost in salaries — fixed monthly burn whether you ship or not. The agent stack costs in tokens — you pay for the actual work that lands.

Big-context Opus invocations are the priciest line item, used sparingly for architecture passes and security reviews. Sonnet handles the bulk of build work. Haiku runs the high-frequency, low-stakes loops — sentinels watching production, copy generation, deploy ops.

A typical platform-scale day for TripOS — schema work, a feature, a review pass, two preview deploys, sentinels watching — lands in the tens of dollars, not the hundreds. Idle days cost cents.

§ The loop · How they work together

Vision in.
Production out.

Krish posts the intent. The orchestrator breaks it into work. Builders ship. CHECK reviews. PILOT deploys. Sentinels watch. If anything looks off, the loop restarts on its own — no human ping required.

Workflow · vision → productionContinuous loop

01Krish
VISION
Posts intent
“Live Mode should mirror every platform feature.”
02ATLAS
PLAN
Breaks it down
Audits codebase, files tasks, picks owners.
03PIXEL · FORGE · VAULT
BUILD
Ships code
Components, APIs, schema. Commits land on a branch.
04CHECK · WARDEN
REVIEW
Catches problems
Type errors, regressions, security smells. Requests changes.
05PILOT
SHIP
Deploys
Preview → production. Migration runs. Cache warms.
06SENTINEL-1 · SENTINEL-2
WATCH
Holds the line
Errors, vitals, anomalies. Files new tasks. Loop restarts.

WATCH files new tasks back into PLAN — the loop closes itself

§ Interlinks · Who hands work to whom

The org graph,
in one read.

Every agent has a charter and a list of agents they routinely hand work to. The control plane records each hand-off; this is just the roster, flattened.

Agent

Runs on

Hands work to

ATLAS
System architect
Opus 4.7
1M tokens
PIXELFORGEVAULTWARDEN
PIXEL
Frontend craftsman
Sonnet 4.6
200k tokens
DRAFTCHECK
FORGE
Backend engineer
Opus 4.7
1M tokens
VAULTCHECKWARDEN
VAULT
Database keeper
Sonnet 4.6
200k tokens
CHECKPILOT
WARDEN
Security & auth
Opus 4.7
1M tokens
CHECKATLAS
PILOT
DevOps
Haiku 4.5
200k tokens
SENTINEL-1SENTINEL-2
DRAFT
Design system
Sonnet 4.6
200k tokens
PIXELVERSE
CHECK
Quality control
Sonnet 4.6
200k tokens
PILOTATLAS
VERSE
Copywriter
Haiku 4.5
200k tokens
DRAFTPIXEL
REACH
SEO & growth
Haiku 4.5
200k tokens
DRAFTPIXEL
CONCIERGE
Runtime AI
Sonnet 4.6
200k tokens
End user
SENTINEL-1
Error watch
Haiku 4.5
200k tokens
ATLASFORGE
SENTINEL-2
Uptime & vitals
Haiku 4.5
200k tokens
ATLASPILOT

§ Anatomy of a story · From your line to merged code

One story.
Six hand-offs.

A real story from this codebase — the Live Mode mirror epic — moving from a one-line vision to merged code. Every row is an event in the control plane.

Story tree · vision → mergedLive Mode mirror

KRISHVISION
EPIC · Live Mode mirror
“Everything that is on the platform should be on Live panel mode — people are travelling and don't have laptops.”
ATLASPLAN
STORY · Audit during-trip surfaces vs. existing Live Mode
Catalogued 35+ unwired components. Filed 4 sub-stories grouped by domain (Safety, Money, Local, Right-now).
audit doc · 4 child stories filed
PIXELBUILD
SUB-STORY · Extract NeighborhoodsPanel + NittyGrittyPanel
Inline panels lived in trip detail page only — moved to shared components so Live Mode can render them too.
src/components/neighborhoods-panel.tsx · src/components/nitty-gritty-panel.tsx
PIXELBUILD
SUB-STORY · LiveCollapse helper for mobile-first disclosures
Heavy panels (advisories, scam radar, calculators) collapsed by default so the mobile scroll stays scannable.
src/components/live-collapse.tsx
FORGEBUILD
SUB-STORY · Wire 28 components into the new sections
Safety / Documents / Money / Local knowledge / Right now / Plan ahead / Social / Capture / Achievements.
src/app/app/trips/[id]/live/page.tsx · +473 / −239
CHECKREVIEW
Type-check · zero errors
Every type was satisfied. No regressions in unrelated trip-detail flows.
WARDENREVIEW
CSP allowlist sweep
Caught a new third-party host needing connect-src; patched the security config before merge.
PILOTSHIP
Preview deploy on every commit
Krish opened the preview URL on his phone, eyeballed each section live.
preview-…<branch>.app
MERGED
4 commits squashed → main · production rolled forward
79b9f89 · 5922c24 · 77121ae · 4c57bf2

§ Control plane · The agent-native ticketing layer

There's a ticket system.
Just not for humans.

Underneath the team is a coordination layer — a control plane built for agents instead of for project managers. Every task, status update, delegation, comment, and review lives there. The agents read it; the agents write it. Krish only opens it when he wants to know what's in flight.

Agents pick up assignments, post progress as they work, hand off to the next specialist when their charter is done, and file new tickets when they spot something the human hasn't. Recurring jobs run as routines — Sentinels checking error rates every five minutes, REACH auditing meta tags weekly, VAULT verifying migrations after each deploy.

No standups. No sprints. No backlog grooming meetings. The control plane is the org chart, the kanban board, and the company chat — all consumed and produced by software.

§ Always on · 2 sentinels

While Krish sleeps,
they work.

The build team ships features. The sentinels keep them shipped. They watch error rates, deploy diffs, function panics, and Web Vitals — and open fixes before any human notices. The 3am Slack alert is a thing of the past.

SENTINEL-1

Error watch

haiku

Watches the exception stream, log streams, function panics. Opens fixes before pages.

Runs on: Haiku 4.5
Context: 200k tokens
Input rate: $0.25 / M
Output rate: $1.25 / M

Pattern-match on error shapes; cheap to run continuously.

Tools

WebFetchBashTaskCreate

Hands off to

ATLASFORGE

Shipped

Auto-triage runbooks
First-fix-attempt PRs
Owner notification gating

SENTINEL-2

Uptime & vitals

haiku

Performance scores, vitals, deploy health, cold-start anomalies.

Runs on: Haiku 4.5
Context: 200k tokens
Input rate: $0.25 / M
Output rate: $1.25 / M

Compares numbers against baselines; doesn't need a thinking model.

Tools

WebFetchBashTaskCreate

Hands off to

ATLASPILOT

Shipped

Vitals regression alerts
Deploy diff reports
Cold-start budget

What they watch

Exception stream
Function panics
Vitals regressions
Deploy diff anomalies

What triggers them

New error fingerprint
p95 latency spike
Failed cron run
5xx burst above baseline

What they're allowed to do

Open a triage ticket
Draft a fix PR
Roll back a bad deploy
Notify Krish if all else fails

§ Receipts · What they shipped

The first two months,
in numbers.

Components shipped

104

React + TS, fully typed.

API routes

127

Server actions + REST.

Database models

Relational, indexed, normalised.

Live Mode panels

30+

Every during-trip surface.

Pitch decks

VC, B2B, founder, demo.

Brand assets

PNGs, owner-gated.

Commits to main

171

Each reviewed by the human.

Engineers on payroll

Yes, that says zero.

Numbers above pulled from the repo at the time of last deploy

The agents are reading this page right now,
checking it for typos.

Built by agents · supervised by Krish · 2026

One human.Eleven agents.Two sentinels who never sleep.

Tap the control plane.Watch SEV-2 close itself.

SaaS used to needten people to ship.It needs one now.

Meet the engineeringorganisation.

Three model tiers.One bill at the end.

Pay per token,not per chair.

Vision in.Production out.

The org graph,in one read.

One story.Six hand-offs.

There's a ticket system.Just not for humans.

While Krish sleeps,they work.

The first two months,in numbers.

One human.
Eleven agents.
Two sentinels who never sleep.

Tap the control plane.
Watch SEV-2 close itself.

SaaS used to need
ten people to ship.
It needs one now.

Meet the engineering
organisation.

Three model tiers.
One bill at the end.

Pay per token,
not per chair.

Vision in.
Production out.

The org graph,
in one read.

One story.
Six hand-offs.

There's a ticket system.
Just not for humans.

While Krish sleeps,
they work.

The first two months,
in numbers.