The six phases
Phase 1 — Data Review
Audit, hygiene, exclusion filters, alignment, EDA, data versioning, DataLoader implementation with per-modality augmentation. Outputs:
reports/data_quality_report.md, drift baseline, src/data/.Phase 2 — Feature Engineering & Selection
Classical ML: derived features (lags, rolling stats, interactions), statistical filtering, VIF pruning, domain validation. Deep Learning: input representation design, transfer learning assessment, augmentation strategy.
Phase 3 — Model Design
Architecture selection (with families per data type), loss function design (custom losses, regularisation, auxiliary objectives), training strategy (optimiser, LR schedule, mixed precision, gradient clipping, checkpointing).
Phase 4 — Training & Iteration
Baseline training, DL diagnostics (gradient flow, LR finder, activation stats), autonomous iteration loop, cross-validation, ensemble exploration. The autonomous experiment loop runs here — Model Builder proposes, Oracle validates, orchestrator decides whether to continue.
Phase 5 — Analysis & Validation
Explainability (SHAP, GradCAM, attention viz), domain consistency, error analysis (per-class, failure cases, bias detection), ablation studies, statistical significance, reproducibility verification.
The three workflow modes
The orchestrator selects the right mode from the plan’sWorkflow configuration section:
- classical_ml
- deep_learning
- research
All six phases run with emphasis on feature engineering, statistical selection, and model comparison across algorithm families. Default for tabular data.
Gates
Between each phase sits a gate — an explicit validation checkpoint.Gate types
Automated gates
Run validation programmatically: artifact presence checks, oracle threshold checks, test suite passes, schema conformance. Auto-PROCEED when all checks pass.
Blocking gates
Require explicit human approval. Default at Gate 2 (feature/representation review) and Gate 5 (model + validation report review). The session pauses; the human runs
zo gates to inspect, then approves or rejects.Gate decisions
When a gate is reached, the orchestrator can take one of three actions:| Decision | Behavior |
|---|---|
| PROCEED | Move to the next phase. Phase snapshot written to {memory_root}/snapshots/. |
| ITERATE | Stay in the current phase. In Phase 4 with the autonomous loop, mints a child experiment and re-runs. |
| HUMAN_STOP | Pause for human input. Used when blocking gates fire or when the experiment loop hits a DEAD_END / BUDGET_EXHAUSTED verdict. |
Gate modes
The plan or CLI sets the gate mode for the whole run:- supervised (default)
- auto
- full-auto
Every gate — automated and blocking — pauses for human review. Maximum oversight. Best for the first run of a new project.
zo gates set <mode> -p <project>.
The autonomous experiment loop (Phase 4 only)
Inauto and full-auto modes, Phase 4 runs as a closed loop:
## Experiment Loop block):
- TARGET_HIT — best metric ≥
stop_on_tierthreshold - BUDGET_EXHAUSTED —
max_iterationsreached - PLATEAU — last
plateau_runsimprovements all withinplateau_epsilon - DEAD_END — last N hypotheses are all Jaccard-similar to an earlier experiment (Model Builder stuck rephrasing) — escalates to human
Phase artifact contracts
Each phase has required artifacts that the gate checks for. Missing any artifact = automatic gate failure with a clear message about what’s missing. Phase 6’s full set:reports/data_quality_report.md(Phase 1)reports/training_report.md(Phase 4)reports/analysis_report.md(Phase 5)reports/model_card.md(Phase 6)reports/validation_report.md(Phase 6)models/<project>_cnn.pt(PyTorch slim weights)models/<project>_cnn.onnx(ONNX export)- Tests: oracle threshold + per-class floor + summary tier
Phase snapshots
At every gate PROCEED — automated or human — the orchestrator writes aPhaseSnapshot to {memory_root}/snapshots/{phase_id}_{ISO-timestamp}.md. Markdown body + YAML frontmatter (same pattern as STATE.md). Captures: phase identity, gate decision, decisions logged during the phase, issues encountered, recent comms events.
Long-running projects accumulate thousands of comms events per phase. The snapshot is a single scannable file per phase — for humans, for next-phase agents, for future dashboards.
Next
Memory & continuity
Where phase snapshots, decisions, and priors live.
zo gates
The CLI for inspecting and toggling gate mode.