Adds 6 files in tests/synthetic/ exercising the determinism pipeline at the SC7 brief floor (Jaccard >= 0.833). Plan fixture pair: 40 step titles each with 38 shared (Jaccard 0.905). Review fixture pair: 30 finding-IDs each with 28 shared (Jaccard 0.875). Reuses lib/parsers/jaccard.mjs + lib/parsers/finding-id.mjs. The new pair coexists with tests/lib/review-determinism.test.mjs which holds the older SC4 (0.70) floor against tests/fixtures/ultrareview/. The lower floor protects pipeline regressions; the higher floor anchors the speedup brief's determinism aspiration. [skip-docs]
3 KiB
3 KiB
| type | plan_version | created | task | slug | run_id | steps | ||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ultraplan-synthetic | 1.7 | 2026-05-04 | Add --verbose flag to CLI | verbose-flag | A |
|
Synthetic plan run A — Add --verbose flag to CLI
This fixture represents one synthesized run of /ultraplan-local against a
hand-calibrated brief. It is paired with plan-run-B.md for the
plan-determinism.test.mjs Jaccard floor (≥ 0.833).
How this fixture is used
tests/synthetic/plan-determinism.test.mjs reads the steps array from this
file's frontmatter and computes jaccardSimilarity(stepsA, stepsB). The test
asserts the similarity is at or above the SC7 brief threshold (0.833).
This is a SYNTHETIC fixture — it is NOT the output of a real LLM run. The purpose is to exercise the determinism pipeline (parser + jaccard) on a known input pair so regressions in the pipeline are caught even when LLM determinism cannot be cheaply re-measured.
Fixture math
- A has 40 unique step titles
- B has 40 unique step titles
- Intersection (shared titles): 38
- Union: 42
- Jaccard: 38/42 ≈ 0.9047 (well above 0.833 floor)