Research

Four programs.
One stack, made measurable.

We run our research as four programs of inquiry — generative AI stack, personal AI CEO, sovereign AI system, generative culture. Each ships one report per week with the artefacts attached. Replication is the point.

Build your stack from this research →Get reports first — free

Programs

4 active

Program2 reports

Generative AI Stack

Composition, model routing, agent orchestration — the working layer.

Open the program→

Program1 report

Personal AI CEO

The operating model — strategy, governance, talent, decisions.

Open the program→

Program1 report

Sovereign AI System

Vaults, IP, portability — your library, never locked.

Open the program→

Program1 report

Generative Culture

How operators compound together — cohorts, rituals, shared substrates.

Open the program→

Latest

5 reports

The six pillars of a Personal AI CoE — and why most creator stacks miss four of them

47 of 50 creators run only 2 of 6 pillars (Technology + a partial Library). The missing four — Strategy, Governance, Talent, Ethics — are where compounding actually happens.

22 Apr 20269 min

Read the report →

Claude 4.7 vs Gemini 3 Pro vs GPT-5.1 on long-form writing — 50 tasks, measured

Claude 4.7 wins voice-fit by a 23% margin on first draft. Gemini 3 Pro wins on factual density. GPT-5.1 wins on structural variety. The honest verdict: model choice depends on what you're optimising — and most creators are optimising the wrong thing.

Read →

22 Apr 202612 min

Three agent orchestration patterns we ran for four weeks — costs, outcomes, and which one earned its keep

Verifier-loop wins on irreversible work (publish, send, charge) by 38% on quality at ~3× cost. Pipeline wins on multi-format expansion (one essay → eight outputs) by 22% quality at 1.4× cost. Pure swarm loses to a single Sonnet on every metric except ego.

Read →

22 Apr 202610 min

Five MCP integrations that broke in production over six weeks — and what each failure taught

Five incidents, three root-cause categories — silent schema drift, auth-token rotation gaps, transport-layer back-pressure. A 12-line pre-flight check catches the patterns. Most production MCP failures are operational, not architectural.

Read →

22 Apr 20269 min

Source Pulse synchronous open + close vs async-only — pre-registered design for the June 2026 cohort

We are pre-registering this design before the first cohort runs, so the result lands without hindsight bias. Hypothesis published. Method published. Reproduction kit attached. We will publish the result whether it confirms or falsifies the claim.

Read →

Cadence

One report. Every Tuesday.

Each report follows one structure: hypothesis, method, setup, results, takeaway, next. Raw artefacts attached. Replication welcome. Members of the Lounge get the report on Monday — twenty-four hours before public.

Get reports first — free See the community

Four programs.One stack, made measurable.

Generative AI Stack

Personal AI CEO

Sovereign AI System

Generative Culture

The six pillars of a Personal AI CoE — and why most creator stacks miss four of them

Claude 4.7 vs Gemini 3 Pro vs GPT-5.1 on long-form writing — 50 tasks, measured

Three agent orchestration patterns we ran for four weeks — costs, outcomes, and which one earned its keep

Five MCP integrations that broke in production over six weeks — and what each failure taught

Source Pulse synchronous open + close vs async-only — pre-registered design for the June 2026 cohort

One report. Every Tuesday.

Four programs.One stack, made measurable.

Generative AI Stack

Personal AI CEO

Sovereign AI System

Generative Culture

The six pillars of a Personal AI CoE — and why most creator stacks miss four of them

Claude 4.7 vs Gemini 3 Pro vs GPT-5.1 on long-form writing — 50 tasks, measured

Three agent orchestration patterns we ran for four weeks — costs, outcomes, and which one earned its keep

Five MCP integrations that broke in production over six weeks — and what each failure taught

Source Pulse synchronous open + close vs async-only — pre-registered design for the June 2026 cohort

One report. Every Tuesday.

Four programs.
One stack, made measurable.

Four programs.
One stack, made measurable.