Overview

Relevant source files

Purpose and Scope

This document provides a high-level introduction to gstack: its purpose, architecture, and design philosophy. It explains how gstack transforms AI agents from generalist assistants into a coordinated team of specialists through a structured skill system, backed by a persistent headless browser infrastructure and a semantic memory layer.

gstack is designed for technical founders, CEOs, and staff engineers who want to ship code at an accelerated scale—leveraging agentic workflows to move faster than a traditional team README.md29-33

What is gstack

gstack is an "open source software factory" README.md23 It provides specialized cognitive modes for different engineering activities, invoked via slash commands. The system consists of three primary layers:

Specialized Skills: 31+ markdown-based skills (e.g., /office-hours, /plan-ceo-review, /review, /ship) that define persona-driven prompts and workflows README.md23-24 These are generated from templates in .tmpl files using a resolution pipeline involving scripts/gen-skill-docs.ts CLAUDE.md86
Browse Infrastructure: A production-grade headless Chromium subsystem that enables the /browse and /qa skills to interact with web applications with sub-second latency ARCHITECTURE.md7-10
gbrain Integration: A semantic search and memory layer that provides cross-session continuity and project-specific context. Planning skills use {{BRAIN_PREFLIGHT}} templates to load context from digests of typed entities like gstack/product or gstack/goal CHANGELOG.md41-45

Core Cognitive Modes

Skill	Specialist Persona	Primary Function
`/office-hours`	YC Partner	Reframes product ideas via 6 forcing questions; brain-aware docs/skills.md7 CHANGELOG.md43
`/plan-ceo-review`	CEO / Founder	Rethinks the problem; finds the "10-star product" docs/skills.md9
`/plan-eng-review`	Eng Manager	Locks architecture, data flow, and edge cases docs/skills.md10
`/review`	Staff Engineer	Finds bugs that pass CI but fail in production docs/skills.md13
`/qa`	QA Lead	Tests apps, finds bugs, and verifies via browser docs/skills.md18
`/ship`	Release Engineer	Automates PR creation, test runs, and doc syncing docs/skills.md22
`/browse`	QA Engineer	Provides the agent "eyes" via a persistent browser docs/skills.md30
`/investigate`	Debugger	Systematic root-cause debugging with an "Iron Law" of no fixes without investigation docs/skills.md14

Sources: README.md23-24 docs/skills.md5-50 CLAUDE.md109-131 CHANGELOG.md41-46

Problem Statement

Standard AI agents often suffer from "blank prompt" syndrome or inconsistent depth. gstack solves several specific bottlenecks:

Literalism: Agents often take requests literally without questioning underlying value. gstack uses /office-hours to push back on premises docs/skills.md7
Context Fragmentation: Agents forget previous decisions. gstack integrates with gbrain to load cached digests of prior plans and product goals CHANGELOG.md41-43
Latency: Standard browser tools often cold-start a new instance. gstack's persistent daemon responds in ~100-200ms ARCHITECTURE.md36
State Loss: Without persistence, agents lose cookies and login sessions. gstack maintains state between calls ARCHITECTURE.md58-62
Fragile Selectors: Traditional CSS/XPath selectors break easily; gstack uses an accessibility-tree-based "ref" system ARCHITECTURE.md112-115

System Architecture

High-Level Component Diagram

This diagram bridges the Natural Language Space (user commands) to the Code Entity Space (specific implementation files and classes).

Sources: ARCHITECTURE.md12-34 CLAUDE.md78-131 CHANGELOG.md41-61 package.json12

Persistent Daemon Architecture

gstack's browser is not a one-off script; it is a persistent daemon. The key insight is that an AI agent needs sub-second latency and persistent state to be effective ARCHITECTURE.md7-10

Command Data Flow

The following diagram illustrates how a command like $B click @e1 moves through the codebase.

Key Architecture Decisions:

State Coordination: The CLI checks .gstack/browse.json for a running server. If none exists, it spawns the server which writes a random port and bearer token to this file ARCHITECTURE.md64-73
Ref System: Instead of CSS, the snapshot system in browse/src/snapshot.ts assigns temporary handles (e.g., @e1) based on the ARIA accessibility tree ARCHITECTURE.md112-121
Security Model: The daemon uses a dual-listener architecture to separate local traffic from remote pairing tunnels ARCHITECTURE.md88-97
Automatic Lifecycle: The server auto-starts on the first command and auto-shuts down after 30 minutes of idleness ARCHITECTURE.md32

Sources: ARCHITECTURE.md52-81 ARCHITECTURE.md112-121 ARCHITECTURE.md88-109

Design Philosophy

1. Specialized Cognitive Modes

gstack assumes that a single prompt cannot handle the nuance of a CEO, an Architect, and a QA Engineer simultaneously. By splitting these into separate skills, the model is forced into a specific depth of analysis docs/skills.md7-19

2. Brain-Aware Planning

Planning skills are no longer stateless. They preflight a typed entity model from gbrain (Wintermute or local PGLite) before asking questions. This allows the agent to detect contradictions with prior plans (e.g., "this contradicts your January CEO plan") CHANGELOG.md41-45

3. Multi-Host Support

gstack is designed to work across a wide variety of AI hosts (Claude, Codex, Gemini, Cursor, OpenClaw, etc.). It uses a template-based generation system (scripts/gen-skill-docs.ts) to adapt core skill logic to the specific constraints of each model CLAUDE.md86-93

4. Verification & E2E Testing

gstack uses a tiered testing system:

Tier 1: Static validation and generator quality checks (bun test) CLAUDE.md128
Tier 2: E2E tests via claude -p to verify real-world behavior (bun run test:e2e) CLAUDE.md129
Tier 3: LLM-as-judge quality evals (bun run test:evals) CLAUDE.md131

5. High-Performance Tooling with Bun

The choice of Bun enables compiled binaries via bun build --compile for sub-millisecond startup and native SQLite access for cookie decryption ARCHITECTURE.md42-45

Sources: docs/skills.md7-19 CLAUDE.md86-131 ARCHITECTURE.md38-51 CHANGELOG.md41-46