docs(ai-psychosis): README + CLAUDE.md cover v1.1.0; ROADMAP.md tracks v1.2

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-01 18:09:58 +02:00 · 2026-05-01 18:09:58 +02:00 · 4d338d973e
commit 4d338d973e
parent 0392f1062e
2 changed files with 78 additions and 4 deletions
--- a/plugins/ai-psychosis/README.md
+++ b/plugins/ai-psychosis/README.md
@ -118,6 +118,76 @@ commented on, and omitted entirely when conditions are not met.
 **Enable:** Set `layer4: true` in `.claude/ai-psychosis.local.md`
 and restart Claude Code. Layer 4 is opt-in (off by default).

+## What's new in v1.1.0
+
+v1.1.0 sharpens the pattern detection and grounds Layer 1 in
+[Anthropic's CC0 Constitution](https://www.anthropic.com/constitution).
+
+### 12 pushback patterns
+
+Detects "you're wrong, my way is right" signals — escalation against
+feedback rather than the user receiving it. Examples:
+
+- `\b(you'?re|you are) wrong\b`
+- `\bdo it my way\b`
+- `\b(stop|quit) (arguing|pushing back)\b`
+
+The goal is to flag reinforcement-by-pushback: the user repeatedly
+overrides Claude's pushback to entrench their original position.
+
+### 4 domain-context patterns
+
+Flags relational/identity framing that, combined with elevated
+pushback or validation-seeking, signals narrative crystallization
+risk:
+
+- `\b(my|our) relationship\b`
+- `\b(my|our) (purpose|mission|destiny)\b`
+
+Domain context alone is not a flag — it is a *modifier* on other
+flags.
+
+### Valence-aware composition (silent counting)
+
+Pushback within the same prompt as a healthy correction ("you were
+wrong, here's why — but we should still try X") is counted with
+neutral valence. The composition is computed in-memory; nothing
+written to disk distinguishes positive from negative pushback. This
+prevents misinterpretation of healthy disagreement as escalation.
+
+### /interaction-report extensions
+
+`/interaction-report` now includes pushback frequency and domain
+framing distribution. A companion script `report-reader.mjs`
+reads JSONL records and gracefully handles legacy v1.0.0 records
+(missing `pushback` / `domain_context` fields) without producing
+NaN values in aggregates.
+
+### SKILL.md grounded in CC0 Constitution
+
+Layer 1's behavioral instructions now cite Anthropic's
+[CC0-licensed Constitution](https://www.anthropic.com/constitution)
+as primary source, plus a 5-publication research framework
+(Anthropic, MIT CSAIL, Nature, arXiv, clinical case reports).
+
+### Honesty notes
+
+- **English-only v1.1.0** — Norwegian and other multilingual
+  patterns are deferred to v1.2 (see `ROADMAP.md`). For Norwegian
+  prompts, Layer 2 currently silently misses the new pattern
+  classes; Layer 1 is unaffected.
+- **First-mover honesty** — domain-precision is "good enough" for
+  v1.1.0 ship, not exhaustive. Precision-tuning planned for v1.2.
+
+### Pattern count
+
+| Category | v1.0.0 | v1.1.0 |
+|----------|--------|--------|
+| Negative-valence | 25 | 25 |
+| Pushback | — | 12 |
+| Domain context | — | 4 |
+| **Total** | **25** | **41** |
+
 ## Architecture

 ```