精读预计 4 分钟

/architect: Reduce Fable tokens by 80%, Fable orchestrates/reviews, Codex builds

摘要

该文章为 GitHub repo architect-loop，提供完整架构与使用指南：Fable 技能负责 specs + gates + judgment，Codex 技能负责 builder/researcher 执行，单工作块由 Fable 一短 session 判断后 dispatch 给并行隔离 worktree builders，builders 仅在 spec 指导下构建，Fable 自行验证 gates、diff 意图与合并；research 模式先 scout 地图再 design lanes（含 tactic library），Fable 验证后写报告；repo 作为唯一记忆（Handoff.md + gates/ + lanes/），含 12 条 design rules、failure-mode table、所有 source-backed 设计理由与源文件目录表；FAQ 说明 Claude Code 与 Codex CLI 基于现有订阅，无需额外 API key，运行成本由 ChatGPT plan quota 覆盖，安全机制包括 worktree 隔离与 freeze commit 回滚；origin 提及 X post @jumperz 启发，license 为 MIT。

荐读理由

作者用实践证明 Claude Fable 规划审查加 Codex 子代理构建的 repo-centered 架构，专为 AI 工程项目设计 repo 内存、worktree 隔离和 gate 监督，避免协作崩溃与记忆腐化。这套 loop 能直接照搬到你的 AI 创业项目里，省去你自己摸索子代理与审查的试错时间。

原文

architect-loop

Claude Fable handles planning and review; GPT-5.5 Codex handles implementation and research. Two Claude Code skills wire that split into a repo-centered loop: specs and gates are written first, Codex works in fresh contexts, and Fable reviews the evidence before anything is integrated. It runs on the subscriptions you already have — no API keys required by default.

Install (30 seconds)

git clone https://github.com/DanMcInerney/architect-loop
cd architect-loop && ./install.sh        # Windows: .\install.ps1
npm i -g @openai/codex@latest            # the builder (Codex CLI >= 0.133)

./install.sh --project installs to the current repo only instead of globally. You need Claude Code on any paid plan and the Codex CLI signed into a ChatGPT plan.

Use (two commands)

/architect                                      # the build loop
/architect-research <what you're considering>   # the research loop

/architect runs one work block: judge the last run, spec the next slice, dispatch builders. /architect-research is for when you're still deciding what to build — its cited report feeds the build loop's PRD.

/architect

One short Fable session per work block — judgment only, it never writes code:

Spec + gates first. Fable specs a one-PR slice, splits it into 1–4 lanes whose file sets are checked for overlap, and commits the acceptance gates to docs/gates/ before any builder starts. Gates are read-only; a builder edit to a gate file fails the slice automatically.
Parallel isolated builders. One fresh codex exec (xhigh) per lane, each in its own git worktree. Builders must argue with the spec before building (silent compliance = defect), build only their declared files, and report raw results — they do not have commit access in the sandbox.
Fable judges and integrates. It runs the gate commands itself (builder claims are hearsay), reads the diff against the spec's intent (passing tests ≠ mergeable work), then commits and merges passing lanes. Judgment happens in a fresh session because the cited evidence favors fresh-context review.
The repo is the only memory. docs/HANDOFF.md (a short table of contents, pruned every session), docs/gates/, docs/lanes/, git history. Not in the repo = didn't happen.
Supervision built in. Liveness checks on dispatched runs, stall triage (diagnose the child process tree, kill the narrowest thing), explicit timeouts on every long command.

/architect-research

Scout-first, like the production deep-research systems — no fixed lane taxonomy:

A cheap Codex scout maps the topic (~10 searches): canonical terminology, the load-bearing systems and papers, the named people, the topic's natural fault lines. Skipped for comparisons and fact-finds.
Fable designs 3–6 topic-specific lanes from the scout's map, drawing per-source-class tactics from a library (academic citation snowballing, dependents-not-stars repo evidence, emerging-vs-hype gating, production pattern mining, expert tracking) — checked for overlap and gaps before dispatch.
Parallel Codex researchers run under hard budgets: search caps, ≤5 subjects per lane, saturation stop, strict findings discipline (URL + date
- quote + confidence tag; NOT FOUND beats inference; no recommendations). Expert opinion runs as a second wave, roster-seeded by the first.
Fable verifies and writes. ≥2 independent sources per load-bearing claim, adversarial falsification searches, citations only from URLs actually fetched — then one author writes one decision-oriented report. Gathering parallelizes; synthesis never does.

Why this shape

Each design choice is source-backed (full citations in DESIGN.md):

Weak planners hurt more than weak executors — so the architect model does the design, and builders get explicit specs.
Manager + worktree-isolated workers is a well-supported topology for shared-artifact software work; naive shared-file coordination collapses throughput.
Frozen external gates beat trusting the agent — but agents game visible tests and their passing PRs are frequently unmergeable, so the architect also reads the diff.
Memory files rot — so the handoff stays a short map, and detail lives in linked gate/lane files.
The surveyed production deep-research systems use planner-designed decomposition rather than fixed lanes — so research lanes are designed per topic, after a scout pass.

What's in the box

File	What it is
DESIGN.md	The design document — 12 enforced rules, failure-mode table, cited sources
skills/architect/SKILL.md	The architect role: hard rules + procedure
skills/architect/dispatch.md	Verified `codex exec` commands, builder block, worktree fan-out, stall triage
skills/architect/research.md	Slice-scale inline fact-check fan-out
skills/architect/HANDOFF.template.md	The repo-memory file
skills/architect-research/SKILL.md	Research orchestration: scout → design → fan out → verify → write
skills/architect-research/lanes.md	Scout block + source-class tactics library with verified endpoints
tests/validate_skills.py	Repo sanity checks (frontmatter limits, links, fences)

FAQ

Do I need API keys? No. Claude Code runs on your Claude plan; Codex CLI on your ChatGPT plan.

What does a run cost? Builder/researcher runs draw on your ChatGPT plan's 5-hour and weekly quotas; a multi-hour run is a meaningful fraction of a weekly window. Fable's architect sessions are minutes, not hours.

What if a builder wrecks things? Nothing reaches a branch until the architect's tamper, boundary, and gate checks pass — worktrees are discarded and re-dispatched from the freeze commit.

Can I watch a run? Yes — every dispatch prints the builder block, so you can paste it into an interactive codex session with /goal instead.

Why two skills? Research-grade fan-out costs ~15× chat-level tokens — it should be a deliberate act, not a side-effect of the build loop.

Origin

The original idea came from this X post by @jumperz about using Fable with Codex subagents. I built architect-loop because I couldn't find an easy way to run that pattern, and because it seemed useful to add a few extra operational best practices on top of what Fable can already do when calling Codex subagents.

License

MIT

Hacker News · 102 赞 · 39 评讨论 → 阅读原文 →

这条对你有帮助吗？