Step 17 of v4.1 — escalate-handler invoked. Live LLM-budget ($60-120 for
4 plan-runs á /trekplan --profile {economy,premium} on
examples/01-add-verbose-flag/brief.md) was not authorized for the
v4.1-execute-4b session.
Per Step 17 escalate-fallback (and NEXT-SESSION-PROMPT.local.md
fallback-strategy): document economy-Plan as parked, use balanced as
low-threshold profile, defer empirical calibration to v4.2.
Files:
tests/synthetic/profile-plan-run-economy-1.md — 30 steps, parked-synthetic
tests/synthetic/profile-plan-run-economy-2.md — 30 steps, parked-synthetic
tests/synthetic/profile-plan-run-premium-1.md — 40 steps, parked-synthetic
tests/synthetic/profile-plan-run-premium-2.md — 40 steps, parked-synthetic
tests/synthetic/profile-jaccard-calibration.md — threshold 0.55 pinned per
research/02 conservative starting value
Replacement procedure documented in calibration.md "How to replace"
section. Trigger conditions for empirical re-run:
1. Cross-tier smoke-test (Step 18) flips red on a real run
2. v4.2 LLM-budget approval
3. New profile tier added
3.3 KiB
| type | plan_version | created | task | slug | run_id | profile_used | status | steps | ||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| trekplan-synthetic | 1.7 | 2026-05-09 | Add --verbose flag to CLI | verbose-flag | premium-1 | premium | parked-synthetic |
|
Synthetic plan run premium-1 — Add --verbose flag to CLI (PARKED)
This fixture is a SYNTHETIC PLACEHOLDER for empirical Jaccard calibration
that requires live LLM-budget ($60-120 for 4 plan-runs). Marked
status: parked-synthetic per the Step 17 escalate-handler.
Why parked
Same rationale as profile-plan-run-economy-1.md. The session running
v4.1-execute-4b did not have authorization for live LLM invocation. This
fixture mirrors the existing baseline plan-run-A.md (40 steps, opus
granularity) since premium profile uses opus for plan and review
phases per lib/profiles/premium.yaml.
Step-shape rationale
Premium profile uses opus for plan + review phases (per
lib/profiles/premium.yaml). Empirical observation from research/02:
opus plans tend toward finer-grained steps, more explicit verification
entries, and richer edge-case decomposition than sonnet plans. The 40
entries here capture the level of detail typical of an opus run.
Cross-tier Jaccard pairing
Paired with profile-plan-run-economy-1.md and -economy-2.md in
tests/integration/profile-jaccard-smoke.test.mjs (Step 18). Expected
cross-tier Jaccard for the parked-synthetic run-pair is documented in
profile-jaccard-calibration.md.