Operator Flow
This is the public runbook that connects host readiness, live state capture, incident export, replay, and artifact interpretation into one repeatable operator sequence.
1. Validate the target before execution
Start with the cheapest checks first:
validate --spec=...to catch structural spec errors.describe --spec=...ordot --spec=...when you need a human review of the intended topology.doctoron the destination host before enabling aggressive tuning profiles.
Treat doctor output as a contract check, not a benchmark. It exists to separate unsupported host posture from runtime logic failures.
2. Capture live state when behavior diverges
Use snapshot as the direct state-capture verb.
- SingleRing surfaces should export the ring state and gating view.
- EdgePlane surfaces additionally expose adapter-level transport health in the snapshot payload.
- ActionGraph and DataGuard still share the same snapshot and heartbeat machinery when running with agent reporting bootstrap.
Capture the snapshot before restarting a service if the state itself might explain the failure.
3. Export an incident bundle early
incident-bundle is the canonical triage handoff. Export it as soon as you know the event is no longer a transient operator typo.
The bundle is what turns a local runtime problem into a reproducible review unit.
4. Read artifacts in the right order
Start with the artifacts that explain runtime intent before the artifacts that explain downstream consequences:
spec_validation.jsonpolicy_audit.jsonenvironment.jsonsingle_ring_snapshot.jsonorlane_graph_state.json- product-specific artifacts such as DataGuard reports or replay audits
Typical interpretations:
- spec-validation failure means you are not debugging runtime behavior yet,
- policy-audit mismatch means the environment or approval posture is wrong,
- environment mismatch means the host posture may differ from the assumed tuning contract,
- snapshot anomalies point to lag, gating, adapter failures, or stalled execution state.
5. Replay from the bundle, not from memory
Replay should consume the captured bundle rather than an operator's reconstructed version of the incident.
Recommended posture:
- begin with dry-run replay,
- inspect the replay audit,
- promote to execute only when approval and environment posture are both correct.
That keeps replay audit trails meaningful and avoids turning replay into an uncontrolled second incident.