DECNET

Author	SHA1	Message	Date
anti	e4626879f6	perf(pytest): 194s → 4s collection — lazy heavy imports + norecursedirs Four-part fix for the collection bottleneck that was blocking the dev loop: 1. Lazy mitreattack.stix20 import in attack_stix.py — deferred to first _load() call (TYPE_CHECKING guard at top level) 2. Lazy misp_stix_converter import in both MISP export routers — moved from module level into the route handler body 3. Lazy attack_catalog / attack_stix in ttp.py repo mixin — thin wrapper functions so the import chain never fires at module load time 4. tests/api/conftest.py — `from decnet.web.api import app` moved inside the `client()` fixture; `pytest_ignore_collect` broadened to skip all test_schemathesis*.py variants (not just test_schemathesis.py), which were launching a subprocess server at module-import time 5. pyproject.toml — `norecursedirs` for tests/live, tests/stress, tests/service_testing, tests/docker, tests/perf so these directories are never entered; `-m` filter removed from addopts (now redundant); `--dist loadscope` → `--dist load` to unblock workers immediately 6. behave_core / behave_shell rename — BEHAVE packages dropped the `decnet_` prefix; reinstalled editable installs and updated all 14 import sites across profiler, ttp, bus, and correlation modules	2026-05-10 06:41:25 -04:00
anti	0fba6b6113	feat(profiler/behave_shell): emit cognitive.error_resilience.fallback_to_man For each errored command, check whether the next command's first_token_hash is in {man, help, info} (precomputed at module load). At least one match → present, else absent. The --help / -h flag forms aren't first tokens; v0.2 will reconsider once arg-token hashing is justified by corpus.	2026-05-04 00:01:45 -04:00
anti	8183218d29	feat(profiler/behave_shell): emit cognitive.error_resilience.frustration_typing Compares median within-command IAT for commands following an errored command vs commands following a successful one. Relative absolute delta buckets to low / moderate / high. Skips when either group is empty (no errors, or no clean baseline). v0.1; D.8 re-tunes.	2026-05-04 00:00:36 -04:00
anti	b704352783	feat(profiler/behave_shell): emit cognitive.error_resilience.retry_tactic Modal response across Command.errored=True commands: * same first_token_hash on next command → rerun * different first_token_hash → switch * no next command → abort Tiebreak in registry order. The fourth registry value 'modify' requires within-command arg diffing (PII boundary); deferred to v0.2.	2026-05-03 23:58:58 -04:00
anti	f286c84d95	feat(profiler/behave_shell): emit cognitive.tool_vocabulary Absolute distinct first_token_hash count, bucketed against TOOL_VOCAB_NARROW_MAX / TOOL_VOCAB_BROAD_MIN. v0.1; D.8 re-tunes.	2026-05-03 23:56:22 -04:00
anti	6c2e4ada83	feat(profiler/behave_shell): emit cognitive.planning_depth Distribution of inter-command IATs bucketed against IKI_THINK_MAX_S (deep) and INTER_CMD_INSTANT_MAX (reactive); fall-through is shallow. v0.1 thresholds; D.8 re-tunes.	2026-05-03 23:55:16 -04:00
anti	2254651270	feat(profiler/behave_shell): emit cognitive.exploration_style Two-axis classification over the first_token_hash sequence: repetition_rate (drilling) vs backtrack_rate (jumping among prior tools). chaotic/targeted/methodical buckets. v0.1 thresholds; D.8 re-tunes.	2026-05-03 23:54:03 -04:00
anti	f948e10830	feat(profiler/behave_shell): emit cognitive.cognitive_load Composite over three [0, 1]-clipped sub-signals (chunking variance, error rate from D.0's Command.errored, pace variability), mean-aggregated and bucketed against COGNITIVE_LOAD_LOW_MAX / COGNITIVE_LOAD_MEDIUM_MAX. Components missing data drop out of the mean rather than zeroing it. v0.1 thresholds; D.8 re-tunes once D.2-D.7 are stable. Confidence held at 0.60 (composite over soft sub-signals) and halved below the 5-command sample-size floor.	2026-05-03 23:52:29 -04:00
anti	842b7de950	feat(profiler/behave_shell): emit cognitive.inter_command_consistency BEHAVE-EXTRACTOR.md Phase A Step 8. Dispersion / bimodality of inter-command pauses. HUMAN-bimodal vs LLM-metronomic. * _features/cognitive.py:inter_command_consistency(ctx) emits one Observation in {metronomic, variable, bimodal}. * CV = stdev / mean of ctx.inter_cmd_iats. CV < 0.40 → metronomic (LLM-pure; corpus anchor 0.24); CV ≥ 1.50 → bimodal heuristic (LLM-assisted human; v0.1 placeholder, true bimodal via Hartigan dip is registry-flagged for v0.2); else → variable (human; corpus anchor 0.94). * < 2 IATs or zero mean → skip emission. < 5 commands halves confidence (0.40 vs 0.75) per sample-size honesty. Tests: too-few IATs → no emission, uniform → metronomic, human-like dispersion → variable, extreme bursts+gaps → bimodal, low-sample-count → reduced confidence. Step 8 closes the six-primitive calibration floor for Phase A. Step 9 (calibration grid lockdown) is the gate that pins it.	2026-05-03 07:56:49 -04:00
anti	2f8c107e70	feat(profiler/behave_shell): emit cognitive.feedback_loop_engagement BEHAVE-EXTRACTOR.md Phase A Step 7. The orthogonal axis — does the operator's pause-after-command correlate with bytes of output they just saw? Splits HUMAN/CLAUDE-CL (closed_loop) from LW-sim/CLAUDE-FF (fire_and_forget); cuts ACROSS the LLM/human axis. * _features/cognitive.py:feedback_loop_engagement(ctx) emits one Observation in {closed_loop, fire_and_forget, unknown}. * Pearson correlation between ctx.output_per_cmd[i] and ctx.inter_cmd_iats[i] (paired by construction in Step 4); via statistics.correlation with constant-series fallback to "unknown". * r > FEEDBACK_CORRELATION_MIN (0.30) → closed_loop; otherwise (zero, negative, or undefined) → fire_and_forget. * First primitive that depends on output events: zero output events in the shard or fewer than FEEDBACK_MIN_PAIRS (5) pairs → emit "unknown" at confidence 1.0 (the absence-of-data is itself a high-confidence answer). Zero-command session skips entirely. Tests: no-output → unknown, few-pairs → unknown, strong positive r → closed_loop, constant pace → fire_and_forget/unknown, negative r → fire_and_forget.	2026-05-03 07:55:38 -04:00
anti	3fc6ea5f75	feat(profiler/behave_shell): emit cognitive.command_branch_diversity BEHAVE-EXTRACTOR.md Phase A Step 6. Content-based playbook-vs- adaptive split. Splits CLAUDE-FF (linear_playbook, ~10 distinct tools) from CLAUDE-CL (adaptive_branching, 5-6 tools with curl re-invoked) per the 2026-05-02 empirical anchor. * _features/cognitive.py:command_branch_diversity(ctx) emits one Observation in {linear_playbook, adaptive_branching, unknown}. * unique_first_token_hashes / total_commands ratio. ≥ 0.80 → linear_playbook, otherwise adaptive_branching (the doc instructs bias-to-adaptive in the middle band — that's the discriminative signal we actually want). * < 5 commands → "unknown" at confidence 1.0 (the absence of data is itself a high-confidence answer per the registry's allowed vocabulary). Zero-command session skips emission entirely. Tests cover unique-tokens → linear, repeated-tokens → adaptive, middle band → adaptive (bias), under-floor → unknown @ 1.0, plus PII regression: raw tokens never appear in the serialised observation.	2026-05-03 07:54:13 -04:00
anti	e52a0e0381	feat(profiler/behave_shell): emit cognitive.inter_command_latency_class BEHAVE-EXTRACTOR.md Phase A Step 5. Classifies the operator's thinking pace between commands. Splits LW-sim / CLAUDE-FF / CLAUDE-CL. * _features/cognitive.py:inter_command_latency_class(ctx) emits one Observation in {instant, typing_speed, deliberate, llm_lightweight, llm_heavyweight, long}, computed as the median of ctx.inter_cmd_iats bucketed against the prototype thresholds (v0.2 split: lightweight 2-8s, heavyweight 8-30s). * Sample-size honesty: < 5 commands halves confidence (0.40 vs 0.80) per BEHAVE-EXTRACTOR.md. * Threshold consts (INTER_CMD_*_MAX, MIN_COMMANDS_FOR_FULL_CONFIDENCE, plus parked Step 6/7/8 thresholds for the next three commits) added to _thresholds.py. Tests cover all six buckets at empirically-anchored IATs (15s ≈ Claude Opus driving recon via tmux send-keys), plus the single-command no-IAT and low-sample-count paths.	2026-05-03 07:52:39 -04:00

12 Commits