Phos4

Context

Context

Mozart Data had a vision for a browser-based SQL IDE and a team of eleven engineers stuck in the wrong gear. They were building entirely by hand with no agentic tooling, and the frontend couldn't hold the weight of what they were trying to ship.

I came in to retrain the team on AI-assisted development. Within weeks, it was clear that coaching alone wouldn't close the gap. The codebase needed to be rebuilt from scratch. So the engagement became two things at once: rearchitect the frontend and rewire how the team works.

I designed the complete UI/UX, built the design system and theming engine, integrated the Monaco editor, and shipped the first agentic AI tools inside the product, all while coaching eleven engineers into working at a pace they didn't know they could sustain. Phos4 launched as a browser-based SQL IDE that understands the structure of what you're writing, not just the syntax.

Mozart Data had a vision for a browser-based SQL IDE and a team of eleven engineers stuck in the wrong gear. They were building entirely by hand with no agentic tooling, and the frontend couldn't hold the weight of what they were trying to ship.

I came in to retrain the team on AI-assisted development. Within weeks, it was clear that coaching alone wouldn't close the gap. The codebase needed to be rebuilt from scratch. So the engagement became two things at once: rearchitect the frontend and rewire how the team works.

I designed the complete UI/UX, built the design system and theming engine, integrated the Monaco editor, and shipped the first agentic AI tools inside the product, all while coaching eleven engineers into working at a pace they didn't know they could sustain. Phos4 launched as a browser-based SQL IDE that understands the structure of what you're writing, not just the syntax.

Mozart Data had a vision for a browser-based SQL IDE and a team of eleven engineers stuck in the wrong gear. They were building entirely by hand with no agentic tooling, and the frontend couldn't hold the weight of what they were trying to ship.

I came in to retrain the team on AI-assisted development. Within weeks, it was clear that coaching alone wouldn't close the gap. The codebase needed to be rebuilt from scratch. So the engagement became two things at once: rearchitect the frontend and rewire how the team works.

I designed the complete UI/UX, built the design system and theming engine, integrated the Monaco editor, and shipped the first agentic AI tools inside the product, all while coaching eleven engineers into working at a pace they didn't know they could sustain. Phos4 launched as a browser-based SQL IDE that understands the structure of what you're writing, not just the syntax.

Year

2025

Industry

Developer Tools · Data & Analytics

North Star

Make SQL feel like thinking, not typing.

Unlocking the Team

Unlocking the Team

The hardest problem on this engagement wasn't code. It was velocity.

Mozart's engineers were capable. They were also slow because they had no leverage. Nobody on the team had used agentic development in a disciplined way, and there was no infrastructure to support it. The first phase was trust: showing the team that AI could accelerate their work without sacrificing rigor, then building the harnesses that made the acceleration repeatable.

I stayed embedded with the team, running pair sessions, designing review workflows, building prompt architecture, and setting quality gates for AI-generated code. The goal from the beginning was self-sufficiency. The team needed to sustain the velocity after I left.

Metric

Before

After

--------

--------

-------

Commits per week

~50

300+ (peak: 570)

Lines changed per week

5,000 to 10,000

30,000 to 80,000

Contributors active per week

Sparse

Full team

The stacked commit chart tells the real story. The acceleration wasn't one person going faster. It was the entire team producing at a different level. That happened because the training was about patterns and judgment rather than dependency on any individual tool or person.

Designing for the Eight-Hour Day

Designing for the Eight-Hour Day

Every visual decision on Phos4 came back to one question: what does this feel like at hour six?

The product is built for engineers who still want to work with their hands. The kind of people who fix old electronics instead of replacing them. They use AI, but they also want to fiddle with the query, feel the data, and understand what's happening underneath. That sensibility drove everything.

I workshopped the design direction with the CEO around the idea of warmth. The SQL IDE category is dominated by cold, clinical interfaces, and we wanted something different. We landed on a Gruvbox-inspired palette with amber accents and warm tones that reduce eye strain over long sessions. The palette signals something the market hadn't articulated: a tool designed for the experience of actually living inside it all day.

The theming engine supports ten color palettes across light and dark modes with a dynamic CSS variable system and localStorage caching for instant switches. Each engineer tunes the environment to feel like theirs. That's ergonomics, not preference.

Zen Mode and Focus Mode are first-class interaction paradigms rather than features buried in a settings menu. Focus mode dims inactive panels so your attention stays on the active workspace. Zen mode strips everything except the query and the results. If the tool is fighting you at hour six, the tool is wrong.

An Editor That Comprehends SQL

An Editor That Comprehends SQL

Most SQL editors display your code. Phos4's Monaco integration understands the structure of what you're writing and surfaces that understanding through the interface.

I built the Monaco theming system with 18 themes, semantic token mapping, and CSS variable integration, then went deep on dialect-aware intelligence. The editor supports Snowflake, BigQuery, Databricks, and Redshift with behavior that goes well beyond syntax highlighting.

CTE-aware code folding uses visual decorations that reflect the query's logical structure. A join profiler provides predicate analysis and join type detection. SQL function autocomplete works with per-dialect snippet support. Jinja template support handles dbt workflows. And a baseline diff mode lets you compare query versions side by side.

The editorial decision behind all of this was that the intelligence should be ambient. You shouldn't have to invoke a feature to benefit from it. The editor should quietly understand what you're doing and make the right information available at the right moment.

AI as Crew

AI as Crew

I designed and built Phos4's first agentic AI system, and this is where the consulting and the product work converged. I was teaching the team to use AI as a development methodology while simultaneously building AI into the product they were shipping.

The design principle was that the AI should function like a crew rather than a single assistant. One agent writes SQL, another reviews it, a third pressure-tests the data. The AI doesn't replace the engineer's judgment. It gives them more surface area to apply that judgment.

The architecture integrates multiple LLM providers via OpenRouter with streaming insights delivered over server-sent events. A tool routing framework dispatches requests to specialized handlers for dbt commands, git operations, schema lookups, and query analysis. Conversation history includes context chips so users can steer the AI's attention toward specific tables, schemas, or prior queries. Auto-analysis proactively surfaces insights about query results, and model selection lets users choose their preferred provider.

The same philosophy we used for the team's development workflow carried through to the product: AI-assisted, human-directed.

The Stack

The Stack

Phos4 is a production IDE. The frontend comprises 285 React components across 33 feature categories, built to support ten tightly integrated capabilities without feeling bolted together.

The architecture runs on React 19 and TypeScript over Vite. There are 66 reusable UI primitives built on Radix and shadcn/ui with Tailwind CSS 4, including 13 button variants and 5 input variants, every one documented in Storybook. Nine React Context providers handle cross-cutting concerns like auth, file management, warehouse metadata, focus mode, and user preferences. Ten Redux slices manage domain logic across the editor state, query execution, AG Grid configuration, formatting, profiler, schema browser, and the AI assistant. TanStack Query handles data fetching and AG Grid Enterprise handles data display. Testing runs through Vitest for unit tests, Playwright for end-to-end, and Storybook for visual regression.

I should note what others built. The VP of Frontend Engineering built the DuckDB WASM integration, which handles client-side SQL processing with Apache Arrow and Parquet and eliminates server round-trips. Other engineers built backend services, the profiler pipeline, and warehouse connection infrastructure.

I built the design, the frontend architecture, and the AI integration layer. The team built on top of it.

The outcome I keep coming back to is the velocity chart. An entire team went from grinding at 50 commits a week to sustaining 300 or more. The architecture gave them something solid to build on, and the AI training gave them the speed to actually use it.

A production SQL IDE. Architecture, interaction design, AI integration, and team transformation, shipped and live.

Let’s talk.

We start by listening. Let's talk like people building something worthwhile.

Fast reply.

Expect a note from us within one business day with a link to book a 20-minute discovery call.

Clear next steps.

48 hours after that call you’ll receive a fixed-fee proposal, sprint-by-sprint roadmap, and launch timeline—everything you need to green-light with confidence.

Let’s talk.

We start by listening. Let's talk like people building something worthwhile.

Fast reply.

Expect a note from us within one business day with a link to book a 20-minute discovery call.

Clear next steps.

48 hours after that call you’ll receive a fixed-fee proposal, sprint-by-sprint roadmap, and launch timeline—everything you need to green-light with confidence.

Let’s talk.

We start by listening. Let's talk like people building something worthwhile.

Fast reply.

Expect a note from us within one business day with a link to book a 20-minute discovery call.

Clear next steps.

48 hours after that call you’ll receive a fixed-fee proposal, sprint-by-sprint roadmap, and launch timeline—everything you need to green-light with confidence.

© 2026 MetaModern, Inc. All rights reserved.

No spam

No data resale

© 2026 MetaModern, Inc. All rights reserved.

No spam

No data resale

© 2026 MetaModern, Inc. All rights reserved.

No spam

No data resale