docs(ai-psychosis): README + CLAUDE.md cover v1.1.0; ROADMAP.md tracks v1.2

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
Kjell Tore Guttormsen 2026-05-01 18:09:58 +02:00
commit 4d338d973e
2 changed files with 78 additions and 4 deletions

View file

@ -118,6 +118,76 @@ commented on, and omitted entirely when conditions are not met.
**Enable:** Set `layer4: true` in `.claude/ai-psychosis.local.md`
and restart Claude Code. Layer 4 is opt-in (off by default).
## What's new in v1.1.0
v1.1.0 sharpens the pattern detection and grounds Layer 1 in
[Anthropic's CC0 Constitution](https://www.anthropic.com/constitution).
### 12 pushback patterns
Detects "you're wrong, my way is right" signals — escalation against
feedback rather than the user receiving it. Examples:
- `\b(you'?re|you are) wrong\b`
- `\bdo it my way\b`
- `\b(stop|quit) (arguing|pushing back)\b`
The goal is to flag reinforcement-by-pushback: the user repeatedly
overrides Claude's pushback to entrench their original position.
### 4 domain-context patterns
Flags relational/identity framing that, combined with elevated
pushback or validation-seeking, signals narrative crystallization
risk:
- `\b(my|our) relationship\b`
- `\b(my|our) (purpose|mission|destiny)\b`
Domain context alone is not a flag — it is a *modifier* on other
flags.
### Valence-aware composition (silent counting)
Pushback within the same prompt as a healthy correction ("you were
wrong, here's why — but we should still try X") is counted with
neutral valence. The composition is computed in-memory; nothing
written to disk distinguishes positive from negative pushback. This
prevents misinterpretation of healthy disagreement as escalation.
### /interaction-report extensions
`/interaction-report` now includes pushback frequency and domain
framing distribution. A companion script `report-reader.mjs`
reads JSONL records and gracefully handles legacy v1.0.0 records
(missing `pushback` / `domain_context` fields) without producing
NaN values in aggregates.
### SKILL.md grounded in CC0 Constitution
Layer 1's behavioral instructions now cite Anthropic's
[CC0-licensed Constitution](https://www.anthropic.com/constitution)
as primary source, plus a 5-publication research framework
(Anthropic, MIT CSAIL, Nature, arXiv, clinical case reports).
### Honesty notes
- **English-only v1.1.0** — Norwegian and other multilingual
patterns are deferred to v1.2 (see `ROADMAP.md`). For Norwegian
prompts, Layer 2 currently silently misses the new pattern
classes; Layer 1 is unaffected.
- **First-mover honesty** — domain-precision is "good enough" for
v1.1.0 ship, not exhaustive. Precision-tuning planned for v1.2.
### Pattern count
| Category | v1.0.0 | v1.1.0 |
|----------|--------|--------|
| Negative-valence | 25 | 25 |
| Pushback | — | 12 |
| Domain context | — | 4 |
| **Total** | **25** | **41** |
## Architecture
```