feat(profiler): extract motor.digraph_simhash keystroke biometric

Per-session 64-bit SimHash of inter-keystroke digraph flight times:
walk single-char input events, accumulate flight time per (c1,c2),
bucket the median, Charikar-SimHash the bucketed pairs. Locality-
sensitive so the same typist is Hamming-close across sessions; pastes
and think-pauses break the chain; silent below the sample-size floor.

New shared decnet/util/simhash.py (simhash64/hamming64/bytes helpers).
Registered as a conditional Tier-A primitive (count 37->38); requires
behave-shell>=0.1.2.
This commit is contained in:
2026-06-16 16:59:57 -04:00
parent 372375194c
commit 66c73ce59d
12 changed files with 283 additions and 5 deletions

View File

@@ -91,10 +91,14 @@ def test_no_extractor_set_drifts_from_registry() -> None:
)
def test_tier_a_count_is_37() -> None:
"""Sanity check: Tier-A count matches the design doc (37 primitives)."""
assert len(_tier_a_primitives()) == 37, (
f"Expected 37 Tier-A primitives per BEHAVE-EXTRACTOR.md; "
def test_tier_a_count_is_38() -> None:
"""Sanity check: Tier-A count matches the design doc.
38 since behave-shell 0.1.2 added ``motor.digraph_simhash`` (the
keystroke-rhythm biometric); was 37.
"""
assert len(_tier_a_primitives()) == 38, (
f"Expected 38 Tier-A primitives per BEHAVE-EXTRACTOR.md; "
f"got {len(_tier_a_primitives())}. Update Phase H if the "
f"spec genuinely changed, or adjust TIER_B_ALLOWLIST."
)