feat(profiler): extract motor.digraph_simhash keystroke biometric

Per-session 64-bit SimHash of inter-keystroke digraph flight times:
walk single-char input events, accumulate flight time per (c1,c2),
bucket the median, Charikar-SimHash the bucketed pairs. Locality-
sensitive so the same typist is Hamming-close across sessions; pastes
and think-pauses break the chain; silent below the sample-size floor.

New shared decnet/util/simhash.py (simhash64/hamming64/bytes helpers).
Registered as a conditional Tier-A primitive (count 37->38); requires
behave-shell>=0.1.2.
This commit is contained in:
2026-06-16 16:59:57 -04:00
parent 372375194c
commit 66c73ce59d
12 changed files with 283 additions and 5 deletions

View File

@@ -292,6 +292,17 @@ CV_BURSTY_MAX: float = 1.50
# Need this many input events before we'll claim a cadence at all.
MIN_INPUTS_FOR_CADENCE: int = 5
# ── motor.digraph_simhash (keystroke-rhythm biometric) ──────────────────────
# A digraph is two consecutive single-char keystrokes; its flight time is the
# inter-event gap. We need enough distinct pairs AND enough samples before the
# SimHash is stable enough to fingerprint a typist — below this it's noise.
MIN_DIGRAPHS_FOR_SIMHASH: int = 8
MIN_DIGRAPH_SAMPLES: int = 20
# Median flight time per digraph is quantized into these log-spaced buckets
# (upper edges, seconds). Coarse on purpose: the same typist must land in the
# same bucket despite jitter, while a clearly faster/slower typist separates.
DIGRAPH_FLIGHT_BUCKETS_S: tuple[float, ...] = (0.03, 0.06, 0.12, 0.25, 0.5, 1.0)
# ── motor.motor_stability (Step B.2) ────────────────────────────────────────
# Tremor proxy: fraction of within-burst IATs below TREMOR_FAST_FLOOR_S
# (30 ms — physiologically implausible double-press floor; humans can't