DECNET

Author	SHA1	Message	Date
anti	75ff0ede1f	fix(ttp): correct intel_lifter mappings + repoint ThreatFox to threat_type Three bug classes uncovered by the 2026-05-02 ship-time audit: * AbuseIPDB code/name mismatch in v1: cat 10 was treated as DDoS (it's Web Spam — DDoS is cat 4, intentionally unmapped per A.10) and cat 17 as VPN IP (it's Spoofing — VPN IP is cat 13). Both typos mirrored in code AND the design doc Appendix A.10. Code now matches the AbuseIPDB taxonomy exactly; cat 17 retargets to T1566 (email-spoofing as a phishing precursor), and cats 7 (Phishing) and 16 (SQL Injection) pick up T1566 / T1190 emissions that v1 didn't cover. * ThreatFox dispatch keyed on `ioc_type` in v1, but `ioc_type` is the indicator format (url / domain / hash variants) and carries no ATT&CK signal. The canonical taxonomy field per ThreatFox's API is `threat_type` (botnet_cc / payload_delivery / payload / cc_skimming). Repoint dispatch through the new `threatfox_threat_types` payload field; `ioc_type` rides as evidence only. Also adds the missing cc_skimming -> T1056 (Input Capture) mapping and registers T1056 in attack_catalog.py. * GreyNoise bare-malicious lane: a `classification == "malicious"` row with no recognised tag used to emit nothing. Now lights T1071 at a half multiplier, suppressed when a tag already fires T1071 to avoid double-stamping at conflicting confidence levels.	2026-05-02 18:08:48 -04:00
anti	a31ad82880	feat(intel): project per-provider taxonomy into attacker.intel.enriched payload The TTP worker forwards the bus payload verbatim to the IntelLifter as TaggerEvent.payload. The pre-audit publish payload only carried {attacker_uuid, attacker_ip, aggregate_verdict, providers}, so even with the new AttackerIntel taxonomy columns populated the lifter still saw nothing. Lift the relevant fields (categories / tags / threat_types / malware family / score / classification) into the bus event and decode JSON-string list columns back to native lists at the boundary.	2026-05-02 18:08:29 -04:00
anti	999d3494b4	feat(intel): persist per-provider taxonomy on AttackerIntel for TTP dispatch The 2026-05-02 ship-time audit of the R0054-R0058 intel rule pack found that AbuseIPDB / GreyNoise / ThreatFox stored only the aggregate verdict (score / classification / listed-bool) plus the raw response blob. The TTP IntelLifter expects per-provider taxonomy fields (categories, tags, threat_types) that were never populated, so R0054 / R0055 / R0057 emitted zero tags in production despite passing unit tests. Add typed columns: abuseipdb_categories, greynoise_tags, greynoise_name, feodo_malware_family, threatfox_threat_types, threatfox_ioc_types, threatfox_malware_families. Each provider now parses the relevant taxonomy out of the upstream response and writes it through column_updates. JSON-list columns ride as TEXT with default "[]" to keep the SQLite/MySQL backend split honest, deserialised back to native lists by the repo on read.	2026-05-02 18:07:57 -04:00
anti	d1c4a48963	feat(ttp): split bash CMD evidence into structured uid/user/src/pwd/cmd rows The inspector was dumping the whole `CMD uid=0 user=root src=… pwd=… cmd=nmap -p- 192.168.1.0/24` syslog body into a single ``command_text`` blob. ANTI: "I'd like to separate the fields." Done — three layers work together: 1. Collector session aggregator: new `_parse_cmd_msg` splits the bash PROMPT_COMMAND msg into `{uid, user, src, pwd, command}`. The session-ended envelope's per-command dict now carries the structured fields, with `command_text` set to just the cmd= value (preserving embedded whitespace — `nmap -p- 1.2.3.0/24` etc.). 2. Rule engine: per-source_kind auxiliary evidence list (`_AUX_EVIDENCE_FIELDS`). For `command` events the engine automatically promotes uid/user/src/pwd into the persisted `evidence` dict on top of the rule's explicit `evidence_fields`. Engine-controlled, not per-rule — adding a new aux field is one line here, not a 30-rule YAML sweep, and rule authors can't accidentally drop it. 3. TTPInspector frontend: evidence renders as a structured `kvs` grid (UID / USER / SRC / PWD / CMD rows) instead of pretty-printed JSON. Primary-order list keeps shell fields at the top; everything else falls below alphabetically so unfamiliar evidence shapes still surface predictably. Tests: - session_aggregator pins the structured-fields emit (uid/user/src/ pwd/command_text without "CMD" prefix, embedded whitespace preserved). - rule_engine_tagger pins the aux-field auto-promotion + the no-`None`-leakage path when payload doesn't carry an aux key.	2026-05-02 03:20:53 -04:00
anti	84699f89da	feat(ttp): show canonical ATT&CK technique names in the TTPs UI "T1595" alone is opaque; "T1595 — Active Scanning" tells you the story at a glance. The names come from a backend-side static catalogue pinned to the same ATT&CK release as the rule engine (_ATTACK_RELEASE = "v15.1") — names are the canonical MITRE labels, not author-supplied strings on rules, so a rule author can't typo a name and the entire fleet sees the typo. - New `decnet/ttp/attack_catalog.py` with `TECHNIQUE_NAMES` covering every technique_id + sub_technique_id emitted by `rules/ttp/` (R0001..R0058 → 69 IDs in the v0 pack). - `IdentityTechniqueRow` / `TechniqueRollupRow` / `CampaignTechniqueRow` / `TTPTagDetailRow` gain optional `technique_name` / `sub_technique_name` fields. Repo + router populate them from the catalogue at row-construction time. None when an ID isn't in the catalogue — UI falls back to the bare ID. - Coverage test (`tests/ttp/test_attack_catalog.py`) walks every YAML rule and asserts every emitted ID has a catalogue entry, so a future rule author who forgets to update the catalogue gets a loud failure rather than a silent UI fallback. Frontend: - `TTPsObservedSection` shows "T1595.002 — Active Scanning: Vulnerability Scanning" instead of just the ID, with overflow ellipsis + tooltip for narrow viewports. Inspector header / TECHNIQUE row also surface the names.	2026-05-02 03:10:07 -04:00
anti	42e9492118	feat(ttp): inspector drawer surfaces evidence + rule_id behind each technique The TTPsObservedSection rollup tells the operator "we saw T1059" but not why. Click any technique row → side drawer opens listing every ttp_tag row in scope with the persisted evidence JSON, firing rule_id / rule_version, source_kind / source_id, confidence, and created_at. Mirrors the CredentialReuseInspector / BountyInspector pattern (drawer-backdrop + bd-head/bd-body + kvs grid). Backend: - New `GET /api/v1/ttp/tags/by-{scope}/{uuid}/{technique_id}` (`scope ∈ {identity, attacker, session}`, optional `?sub_technique_id=`, `?limit=` capped to 1000). Returns raw TTPTag rows newest-first. - New `TTPTagDetailRow` Pydantic model + re-export. - New repo method `list_tags_by_scope_and_technique` on TTPMixin (+ abstract on BaseRepository) — single query branched on scope; identity scope projects through `Attacker.identity_id` the same way `list_techniques_by_identity` does. - Tests: evidence round-trips, sub_technique filter, JWT-required, empty scope, unknown scope rejected. Frontend: - New `TTPInspector.tsx` + `TTPInspector.css` (violet accent, slide animation, focus-trapped panel matching the existing inspector family). - `TTPsObservedSection`'s TechniqueBar is now click+keyboard activatable; clicking opens the inspector for that (technique, sub_technique) tuple. mypy clean. 532 passed in the targeted sweep.	2026-05-02 02:55:05 -04:00
anti	c4e29e3bf9	fix(ttp): resolve attacker_uuid from attacker_ip on bus-event consume The collector's `attacker.session.ended` envelope carries `attacker_uuid: null` and `attacker_ip: <ip>` because the collector doesn't talk to the DB. The TTP worker passed that null straight through, and `TTPTag.__init__` raised the documented invariant: ValueError: ttp_tag requires at least one of attacker_uuid / identity_uuid; both NULL is not a valid anchor. The worker now resolves `attacker_uuid` from `attacker_ip` via `BaseRepository.get_attacker_uuid_by_ip` before fanning out the event. When the IP isn't in the DB yet (profiler hasn't ingested the row), the event is dropped with one log line — better than exploding mid-tag. - New `get_attacker_uuid_by_ip(ip) -> str \| None` on the repo (BaseRepository abstract + AttackersCoreMixin impl). - `_resolve_attacker_uuid` helper in `decnet/ttp/worker.py` runs before `_build_events`. Short-circuits when the payload already has either anchor; drops the event when neither anchor is resolvable. - Tests pin: short-circuit on existing uuid/identity, repo lookup, drop on unknown IP, drop on "Unknown" sentinel, drop on no-anchor payload, drop on repo failure.	2026-05-02 02:44:30 -04:00
anti	f9901befc4	docs(ttp): catalogue producer wiring for every TTP-watched topic Add a "Producer wiring" subsection under TTP_TAGGING.md §"Bus topics" mapping every topic the TTP worker subscribes to onto the file:line that publishes it. Calls out the gap (`email.received` has no producer today) and the new `attacker.session.ended` payload shape from the collector aggregator. Also lists the four producer regression tests added in this series so a future contributor sees the safety net before staring at the silent rule engine. DEBT.md gets the `attacker.email.received` follow-up entry — wire the producer when SMTP-receive persistence lands, since today the honeypot relay path doesn't store received emails anywhere a publisher could read from.	2026-05-02 02:39:23 -04:00
anti	b5ce236cab	test(bus): pin scope-(2) producer wiring for reuse / clusterer / intel Three producer-side regression guards. Each drives the worker's run loop with a fake bus + stubbed repo and asserts the documented topic fires when the producer has data: - reuse correlator → credential.reuse.detected (one finding row) - clusterer → identity.formed + identity.merged (one ClusterResult) - intel worker → attacker.intel.enriched (one unenriched attacker + a fake provider returning a "malicious" verdict) These complement commit 1's attacker.session.ended producer test — together the four cover every TTP-relevant publisher in the tree (modulo email.received, which has no producer yet; tracked in DEBT.md).	2026-05-02 02:38:24 -04:00
anti	b043c96d29	feat(collector): publish attacker.session.ended on session_recorded events The TTP worker subscribes to attacker.session.ended but no upstream component published it — the rule pack (R0001–R0030) therefore never fired on live SSH traffic even after the consume-side wiring landed in E.3.18a/b/c. The collector now hosts a per-attacker_ip command index (_SessionAggregator) that watches the same parsed-event stream as _publish_log. Shell `command` events are appended to a per-IP list; on `session_recorded` the aggregator slices the list to commands inside the [ended_at - duration_s, ended_at] window and publishes attacker.session.ended with the session metadata + commands list. The TTP worker's _build_events fan-out (E.3.18b) turns each command into a source_kind="command" TaggerEvent that the RuleEngineTagger (E.3.18c) matches against R0001–R0030. Memory bound: per-IP entries TTL-evict at DECNET_COLLECTOR_SESSION_AGG_TTL_SEC (default 3600 s). Publish failures are swallowed in the aggregator — a misbehaving bus cannot stall the per-container stream threads.	2026-05-02 02:35:08 -04:00
anti	d9d2a80573	fix(collector): unwrap double-wrapped RFC5424 around bash PROMPT_COMMAND Honeypot SSH containers run `PROMPT_COMMAND` that calls `logger --rfc5424 --msgid command -t bash "CMD …"`. The Docker-stdout reader prepends an outer RFC5424 envelope (HOSTNAME=<decky>, APP-NAME=1, MSGID=NIL) around that inner syslog line. Both the collector parser (`parse_rfc5424`) and the correlation parser (`parse_line`) saw the outer NIL MSGID and emitted `event_type="-"` for every shell command — which: - kept `Attacker.commands` rows missing `command_text` - left R0001–R0030 (the pattern rule pack that matches shell commands) with no haystack - made `decnet.collector.log` show `event written … type=-` for the very lines that should be `type=command` Both parsers now detect the inner-RFC5424 shape (`<TS> <HOST> <APP> <PROCID> <MSGID> <rest>`) when the outer MSGID is NIL and the SD-arm is also NIL, and re-extract HOSTNAME / APP-NAME / MSGID / remainder from the body. The collector parser also recovers the post-SD msg tail when the SD block isn't `relay@55555` (the bash CMD line carries a `[timeQuality …]` block) so the kv-fallback can find `src_ip`. Mirroring tests in tests/collector and tests/correlation pin both the unwrap and the regression guard for non-double-wrapped lines.	2026-05-02 02:32:21 -04:00
anti	e08bfc4a73	fix(ttp): /api/v1/ttp/rules returns the live rule catalogue The endpoint was a contract-phase stub returning `[]` even though the RuleStore loaded all 58 YAML rules at worker startup. UI saw an empty table; operators couldn't tell whether anything was wired up. - `api_list_rules` now calls `get_rule_store().load_compiled()` and serializes each CompiledRule + its operational state into a RuleCatalogueRow. Sorted by rule_id for stable golden snapshots. - Add `description: str` to RuleSchema (pydantic) and CompiledRule (NamedTuple, defaulted) + propagate through `_compile_one` so the catalogue surfaces the human-readable YAML description, not just the slug-style `name`. - Update `tests/ttp/test_rule_engine.py` _fields assertion for the new column; new `tests/api/ttp/test_rules_catalogue.py` pins the catalogue contents (R0001/R0014 presence, row shape, sort order). Worker behaviour is unchanged: it was already loading rules correctly. This is purely a read-side wiring fix on the operator API.	2026-05-02 01:54:06 -04:00
anti	7ab0df3680	chore(cleaning): deleted swp vimfile	2026-05-02 01:39:17 -04:00
anti	ca1e04033c	docs(ttp): E.5 verification log appended to TTP_TAGGING.md Closes the CDD design phase. Records: - §E.1 contract inventory (every file exists, compileall clean). - Targeted pytest pass: 604 passed, 1 skipped, 10 xfailed (all xfails are `xfail(strict=True)` with reason= pointing to the impl step that flips them; carry-overs, not flakes). - Strict mypy over decnet/ttp + decnet/cli/ttp.py + decnet/web/router/ttp + decnet/web/db/sqlmodel_repo/ttp.py: clean. - Stranger-readability spot check on tests/ttp/: no doc bugs. Notes the three pre-E.4 wiring fixes (E.3.18a/b/c) and the E.4 backfill CLI / DEBT entries that landed in this series.	2026-05-02 01:37:45 -04:00
anti	7d1f048764	docs(ttp): E.4.b/E.4.c DEBT entries — provider review + Sigma deferral Quarterly TTP provider mapping review for AbuseIPDB / GreyNoise / abuse.ch (Feodo Tracker, ThreatFox) catalogue drift against `rules/ttp/R0054..R0058`, and the post-v1 trigger for the Sigma rule adapter. Both items reference TTP_TAGGING.md sections so the rationale stays linked to the design doc.	2026-05-02 01:35:49 -04:00
anti	301d3feee9	feat(ttp): E.4.a extract decnet/cli/ttp.py with worker run + backfill CLI The TTP worker entry moved out of decnet/cli/workers.py into its own module so the TTP CLI surface (worker + admin verbs) is colocated, mirroring decnet/cli/canary.py / webhook.py / swarm.py. - New `decnet/cli/ttp.py` with `decnet ttp` (worker, ExecStart-stable for decnet-ttp.service) and `decnet ttp-backfill --since-days N`. - `decnet ttp-backfill` walks Attacker.commands and CanaryTrigger history, dispatches each row through the live CompositeTagger, persists tags via repo.insert_tags (idempotent INSERT OR IGNORE). --dry-run / --source command\|canary\|all / --batch-size supported. - Backfill deliberately bypasses bus publish — historical replay must not re-trigger SIEM/webhook fan-out per TTP_TAGGING.md §"Bus topics" loop-prevention invariant. - Added `iter_attacker_commands_since` / `iter_canary_triggers_since` read-only iterators on TTPMixin + abstract bindings on BaseRepository. - Master-only via gating; both `ttp` and `ttp-backfill` listed in MASTER_ONLY_COMMANDS.	2026-05-02 01:35:17 -04:00
anti	e84b522fd3	feat(ttp): E.3.18c wire RuleEngine via RuleEngineTagger The canonical rule-based engine from §"Tagging engines, layered §1" of TTP_TAGGING.md was fully implemented but never instantiated as a composite child — pure pattern rules (R0014/R0017/R0023/... 23 rules total) had no tagger to dispatch them. - Add `RuleEngineTagger(Tagger)` adapter in rule_engine.py wrapping `RuleEngine.evaluate()`. `HANDLES = {command, http_request, auth_attempt, payload}` — the source kinds whose rules typically live outside any per-source lifter. - Adapter's `watch_store()` filters via `_is_engine_owned` so the engine's dispatch index excludes lifter-claimed rules (`match.kind: lifter:*`) and stays disjoint from per-lifter ownership. - Prepend `RuleEngineTagger` to the `CompositeTagger` lifter list so generic pattern rules dispatch before per-source cross-event logic. - Composes with E.3.18a (worker hydrates `watch_store`) and E.3.18b (worker fans session payloads into per-`command` events) — together these three commits make R0001–R0030 actually fire at runtime.	2026-05-02 01:29:58 -04:00
anti	65435f1427	feat(ttp): E.3.18b worker fans session-ended payloads into per-command events R0001–R0030 declare `applies_to: [command]` and match per command, not per session. The worker now translates one `attacker.session.ended` payload carrying a `commands: list` into: - one source_kind="session" event (behavioral / cross-event lifters) - one source_kind="command" event per command (RuleEngineTagger) Both string and dict command shapes are accepted; dicts contribute their `id` / `uuid` / `command_id` as the per-command source_id so the deterministic `compute_tag_uuid` keeps replays idempotent. Tags from session + per-command dispatch are aggregated into a single `ttp.tagged` envelope per upstream session.	2026-05-02 01:27:37 -04:00
anti	44ade3eb63	fix(ttp): E.3.18a worker hydrates per-lifter rule indexes via watch_store Each per-source lifter holds its own RuleIndex and exposes an `async watch_store()` that loads the corpus and drains store change events forever. Until this commit nothing called `watch_store()` in production — every dispatch index stayed empty and no rule fired. - Add `WatchableTagger` runtime-checkable Protocol in `decnet.ttp.base`. - `CompositeTagger.iter_watchables()` yields lifters that satisfy it. - `run_ttp_worker_loop` fans out one task per watchable, cancelled and awaited alongside pump/heartbeat/control in the existing finally. - Watch failures log and exit the watch task without taking the worker down — mirrors the pump-task tolerance contract.	2026-05-02 01:25:15 -04:00
anti	9a31d0e50c	feat(ttp): E.3.17 worker registration + scoped schemathesis suite Wires decnet-ttp as a first-class worker: * `decnet ttp` CLI command (master-only via MASTER_ONLY_COMMANDS) * deploy/decnet-ttp.service.j2 systemd unit (After= identity / intel / reuse-correlator workers; ProtectHome=read-only since FilesystemRuleStore only reads ./rules/ttp/) * deploy/decnet.target Wants= chain extended with decnet-ttp.service * `ttp` was already in web/worker_registry.KNOWN_WORKERS tests/api/test_schemathesis_ttp.py: TTP-routes-only schemathesis suite, filtered via the OpenAPI tags=["TTP Tagging"] annotation shared by the eight TTP routes. Reuses the live uvicorn subprocess the wider test_schemathesis spawns; max_examples=400 keeps the focused gate fast for E.3.13–E.3.16 iteration. wiki-checkout/Service-Bus.md committed in its own repo: ttp.tagged and ttp.rule.fired.<id> flipped from "reserved (TTP worker)" to "decnet.ttp.worker" now that the worker publishes them.	2026-05-01 21:26:46 -04:00
anti	07a609973b	feat(ttp): E.3.16 frontend TTP UI TTPsObservedSection.tsx: shared analyst-facing rollup. scope= identity drives /ttp/by-identity/{uuid} (primary, with Navigator export download); scope=attacker drives /ttp/by-attacker/{uuid} (per-IP slice). Tactic → technique tree in fixed UKC-aligned order, counts and confidence-weighted bars. Literal "NO TECHNIQUES OBSERVED YET" empty state per TTP_TAGGING.md §"UI surface — Empty state": no spinner, no fallback list. RuleStateControls.tsx: admin-only rule operational state panel backed by POST/DELETE /ttp/rules/{rule_id}/state. Server-gated by require_admin AND client-gated on /config?.role so a non-admin never sees the controls (per feedback_serverside_ui.md the client gate is UX, not security — the server returns 403 either way). Wired into Config.tsx as a new "TTP RULES" admin tab. Wired TTPsObservedSection into IdentityDetail (above fingerprints) and AttackerDetail (above TIMELINE). DeckyFleet/PersonaGeneration vocabulary throughout (logs-section / section-header / btn / matrix-text / dim-chip). tsc --noEmit and vite build clean. The dev-server browser smoke is deferred per the "can't reliably exercise UI from this harness" reality — typecheck + build is the correctness gate, not feature verification.	2026-05-01 21:05:28 -04:00
anti	403d83faba	feat(ttp): E.3.15 UKC bridge — production phase-handoff edge fires Add BaseRepository.list_ttp_decky_phases(identity_uuid) returning per-decky tag observations as (decky_id, tactic, created_at_ts) rows ordered by creation time. Rewrite from_identity_row() to project tactic → UKCPhase via tactic_to_ukc_phase and populate the four phase-handoff maps (first/last_phase_per_decky, first/last_seen_per_decky) so combined_campaign_weight finally lights up on real DB rows — not just synthetic fixtures. ConnectedComponentsCampaignClusterer.tick() pulls each active identity's per-decky phase observations before projecting features. Repo failures are non-fatal: a partial repo falls back to the empty phase-handoff signal (legacy behavior) so the worker stays up. tests/clustering/test_ttp_phase_handoff.py pins the production-row pair clearing CAMPAIGN_EDGE_THRESHOLD on a C2 → DISCOVERY hand-off — the trip-wire that says the whole project paid off. commands_by_phase_on_decky itself stays empty on the production path: it is consumed only by the synthetic-fixture similarity surface, and the phase-handoff edge does not use it. Synthetic fixtures still populate it directly via from_synthetic_identity.	2026-05-01 21:01:58 -04:00
anti	101127247e	feat(ttp): E.3.14 worker bootstrap (insert + ttp.tagged publish) Inner loop drains a per-process asyncio.Queue populated by one pump task per topic in _TOPICS, dispatches each event through CompositeTagger, persists via repo.insert_tags(), and publishes ttp.tagged + per-technique ttp.rule.fired.<id> only when the insert returned a non-zero rowcount. CompositeTagger seeded with all six lifters (Behavioral, Intel, CanaryFingerprint, Email, Identity, Credential). Loop-prevention invariant from TTP_TAGGING.md §"Bus topics" enforced: N replays of the same upstream event publish exactly one ttp.tagged event. test_worker_bus covers both the direct invocation path and the idempotency replay path. Intel catch-up via attacker.session.ended is intentionally deferred to E.3.14b — needs a session→intel join the repo doesn't expose yet.	2026-05-01 20:57:57 -04:00
anti	322fd44d72	feat(ttp): E.3.13 IdentityLifter + CredentialLifter (R0001-R0006) IdentityLifter owns lifter:identity_* — currently R0003 (password spraying). CredentialLifter owns lifter:credential_* — R0001 generic auth brute, R0002 password guessing, R0004 credential reuse, R0005 valid-account use, R0006 default credentials. YAMLs R0001/R0002/R0003/R0005/R0006 had their match.kind normalised to fit the lifter prefix scheme — the design doc's promised "YAMLs normalised in a separate refactor commit" lands here. Identity-rollup tags null out attacker_uuid on emit so the worked- example invariant holds (the tag belongs to the Identity, never to one member IP). Tests: test_identity_lifter.py + test_credential_lifter.py cover each predicate's positive/negative path, state modulation (disabled/clipped/expired), source-kind gating, and idempotent replay. test_lifter_absence and test_lifters updated for the new ctor signature.	2026-05-01 20:52:56 -04:00
anti	62ad76615e	docs(ttp): mark E.3.9-E.3.12 lifters done Records the RuleIndex extraction prerequisite, the lifter:<owner>_ prefix routing convention, per-provider technique fan-out logic for intel rules, the canary identity-merge guard rail, and the email PII allowlist + R0042 simhash requirement.	2026-05-01 20:31:47 -04:00
anti	7a89fbb357	feat(ttp): E.3.12 EmailLifter (R0041-R0048) SMTP message-level technique tagger per Appendix A.6: open relay abuse (rcpt_count + foreign From), mass phishing (rcpt_count + body simhash), phishing-kit X-Mailer, IDN/punycode URL, sender masquerade composite (From/Return-Path/DKIM/SPF), malicious attachment (macro/.lnk/.iso/.img/ hash match), BEC subject+body composite, encoded payload in body. PII discipline (TTP_TAGGING.md §'Hard parts §6') is enforced at the lifter layer via _filter_evidence(): emitted TTPTag.evidence is restricted to the EmailEvidence-allowed allowlist (body_sha256, matched_headers — names only, rcpt_domain_set — domains only, attachment_sha256s, rcpt_count) plus PII-safe match discriminators (matched_kit, matched_trigger, matched_url_host, etc). Raw addresses, raw body bytes, full URLs, and decoded base64 previews NEVER appear in evidence — defense-in-depth over the YAML evidence_fields hint. Tests: tests/ttp/test_email_lifter.py per-rule positive + negative + PII allowlist guard + state modulation. tests/ttp/rule_precision/ test_email_rules.py xfail flipped to real precision (R0041-R0048 H-band ≥95%). Corpus rows updated to acknowledge that R0045 (masquerade) co-fires with R0041 / R0047 when the sender-masquerade signals are present alongside open-relay or BEC patterns — overlap is by design, not a precision bug.	2026-05-01 20:31:03 -04:00
anti	f211d394e6	feat(ttp): E.3.11 CanaryFingerprintLifter (R0049-R0053) Browser-payload derivations per Appendix A.9: navigator.webdriver flag, canvas/audio/WebGL automation hash matches (Puppeteer/Playwright/ Selenium/curl-impersonate), WebRTC IP leak, TZ/language vs source-IP geo mismatch, navigator.platform vs userAgent vs WebGL renderer inconsistency. Evidence shape pinned to CanaryFingerprintEvidence (metric + matched_signature) — raw fingerprint blobs (canvas hashes, full UAs, navigator.platform values) explicitly NOT carried into TTPTag.evidence per TTP_TAGGING.md §'Hard parts §7' (enrichment vs tag boundary). The identity-merge guard rail is preserved: composite fp.id matches across IPs are NOT a TTP, so no rule fires on the bare hash. Tests: tests/ttp/test_canary_fingerprint_lifter.py per-rule positive + negative + evidence-shape guard + state modulation. tests/ttp/rule_precision/test_canary_rules.py xfail flipped to real precision (R0049/R0050/R0051/R0053 H-band ≥95%; R0052 M-band ≥80%).	2026-05-01 20:25:57 -04:00
anti	7865e71aa9	feat(ttp): E.3.10 IntelLifter (R0054-R0058) Per-provider verdict translator for AbuseIPDB, GreyNoise, Feodo Tracker, and ThreatFox per Appendix A.10. Each rule's predicate inspects payload fields produced by the enrich worker (no DB I/O, no decnet.intel.* imports — E.2.7 decoupling guard preserved). AbuseIPDB confidence is scaled by abuse_confidence_score / 100; categories drive per-technique fan-out. R0058 aggregate-bump is a no-op in v0 (cross-tag bump deferred to E.3.14 worker bootstrap). Per-provider null tolerance is the steady state — a missing provider column produces zero tags from that rule, never an error. Tests: - tests/ttp/test_intel_lifter.py — per-provider positive + negative + state modulation + decoupling source-import guard. - tests/ttp/rule_precision/test_intel_rules.py — xfail flipped, real precision driven over seed_intel.jsonl (R0054-R0057 H-band ≥95%; R0058 skipped as bump-only). - tests/ttp/test_lifter_absence.py — IntelLifter all-populated test flipped from xfail-strict to real assertion with realistic payload. - tests/ttp/test_lifters.py — partial-null xfail flipped to real assertion.	2026-05-01 20:23:42 -04:00
anti	eff3e4bce7	feat(ttp): E.3.9 BehavioralLifter (R0031-R0040) Reads pre-shaped session aggregates from TaggerEvent.payload and emits techniques per Appendix A behavior tables. Per-rule predicates dispatch on match.kind (lifter:behavioral_<name>); the lifter holds its own RuleIndex watching the same RuleStore as the engine, so disable / clip / TTL state reaches lifter-bound rules through the same atomic-swap path. R0032/R0036/R0037/R0040 YAMLs had over-escaped regex strings (\\ instead of \\) — fixed in place. Factory wired so default get_tagger() returns CompositeTagger with BehavioralLifter shipped; remaining three lifters (E.3.10-E.3.12) land in subsequent commits. E.2.6 contract preserved via TolerantTagger: empty payload steady-state yields [] with zero ERROR records. Disabled / clipped / expired state verified.	2026-05-01 20:17:59 -04:00
anti	321ea7a2a6	refactor(ttp): normalise lifter:<owner>_<name> match.kind prefix E.3.9.1 prerequisite. Rules R0031-R0040 now use lifter:behavioral_, R0041 (open_relay) uses lifter:email_open_relay; the rest of the email, canary, and intel cohorts already conformed. Each lifter at E.3.9-E.3.12 will claim its rules via str.startswith('lifter:<owner>_'), keeping the ownership routing explicit and trivially extensible. R0001-R0006 / R0030 lifter: rules are E.3.13 (Identity/Credential) territory and stay as-is.	2026-05-01 20:10:33 -04:00
anti	e7531ee756	refactor(ttp): extract RuleIndex from RuleEngine E.3.9.0 prerequisite for the per-source lifters (E.3.9-E.3.13). The dispatch index, install/evict/apply_change atomic-swap protocol, and state-modulation helpers (is_active / apply_ceiling) move out of rule_engine.py into _rule_index.py and _state.py. RuleEngine wraps a RuleIndex; back-compat shims preserve _by_kind / _by_rule / _install attribute access for tests poking at the dispatch internals. Lifters in E.3.9-E.3.12 will each hold their own RuleIndex, watching the same RuleStore via subscribe_changes() fan-out. Hot-reload semantics (disable / clip / TTL via set_state API) now reach lifter-bound rules through the same atomic-swap path the engine uses, not a future composite-rebuild compromise.	2026-05-01 20:09:18 -04:00
anti	b819dfefa3	feat(ttp): E.3.8 R0054-R0058 intel cohort + mark step done 5 YAMLs for the intel-verdict cohort per Appendix B / A.10: AbuseIPDB category mapping, GreyNoise classification, Feodo Tracker hit, ThreatFox IOC type, aggregate-malicious bump-only. IntelLifter (E.3.10) consumes by rule_id and tolerates absence silently (null provider column → no tag). R0058 is the meta bump-only rule — emits a single confidence=0.0 sentinel so it validates and surfaces in the catalogue, but the repository's sub-0.3 drop ensures no fresh tag persists if the fanout fires accidentally. test_intel_rules.py pins that zero-confidence invariant. Marks E.3.8 done in development/TTP_TAGGING.md with the cohort- split summary.	2026-05-01 09:22:48 -04:00
anti	dc1867315d	feat(ttp): E.3.8 R0049-R0053 canary fingerprint cohort 5 YAMLs for the canary-fingerprint cohort per Appendix B / A.9: navigator.webdriver flag, automation canvas/audio/WebGL hash match, WebRTC IP leak, TZ/lang vs geo mismatch, platform inconsistency. CanaryFingerprintLifter (E.3.11) consumes by rule_id. test_canary_rules.py: YAML-present + inert-in-v0 + xfail(strict) gated on E.3.11.	2026-05-01 09:21:01 -04:00
anti	1ad15470a1	feat(ttp): E.3.8 R0041-R0048 email cohort 8 YAMLs for the email cohort per Appendix B: open-relay abuse, mass phishing, phishing-kit X-Mailer signatures, IDN/punycode URLs, sender masquerade, malicious attachment, BEC, encoded payload in body. EmailLifter (E.3.12) consumes by rule_id. test_email_rules.py: YAML-present + inert-in-v0 + xfail(strict) precision case gated on E.3.12.	2026-05-01 09:19:56 -04:00
anti	806301e179	feat(ttp): E.3.8 R0031-R0040 behavioral cohort 10 YAMLs for the behavioral / cross-event cohort per Appendix B: beaconing, data destruction, ransom note, web exfil, DB mass-read, credentials-in-files, k8s SA token harvest, Docker host escape, LLMNR poisoning, TFTP router-config retrieval. Every rule is lifter-bound (BehavioralLifter / IdentityLifter) — the v0 RuleEngine cannot count, aggregate, or compose cross-event signals, so these YAMLs declare the technique mappings the lifter will consume by rule_id at E.3.9. Their match specs use a 'kind: lifter:*' shape inert to the regex matcher. test_behavioral_rules.py asserts each YAML compiles, none fire from the v0 engine (FP regression guard against a YAML drifting into a regex), and an xfail(strict=True, reason='impl phase E.3.9') precision case that will flip green when the lifter lands.	2026-05-01 09:18:27 -04:00
anti	b1fe1f9403	feat(ttp): E.3.8 R0001-R0030 command cohort 30 YAMLs for the shell/command rule cohort per Appendix B (rules/ttp/). Splits into engine-active (R0007-R0029, regex on command_text / raw_url / user_agent) and lifter-bound (R0001-R0006, R0030 — the v0 RuleEngine cannot count auth attempts, do identity rollups, or parse fingerprint blobs; the BehavioralLifter / IdentityLifter / CredentialLifter consume them by rule_id at E.3.9 / E.3.13). test_command_rules.py asserts: - every R000N has a YAML that compiles - lifter-bound rules NEVER fire from the v0 engine (regression guard against a YAML drifting into a regex match.spec) - engine-active rules meet their Appendix-C precision target against the seed corpus (≥0.95 high-conf, ≥0.80 medium) Conftest fixes: precision_engine moved to module-scope so module- scope precomputed dispatch fixture (fired_by_label) can request it; _RULES_DIR path bumped from parents[2] to parents[3] so the loader resolves the project root regardless of pytest cwd; make_event synthesizes attacker_uuid so TTPTag's anchor invariant is satisfied. Seed corpus broadened: positive examples for every regex rule plus 6 negative examples across innocuous shell verbs (ls, echo, cd, ps, df, free) so FPs surface in precision rather than passing vacuously.	2026-05-01 09:16:38 -04:00
anti	c635478442	feat(ttp): E.3.8 corpus + harness — labelled holdout fixture Sub-step preceding the rule-pack commits per TTP_TAGGING.md:2967. Adds the per-rule precision suite scaffolding under tests/ttp/rule_precision/: - conftest.py: precision_engine fixture (RuleEngine populated from ./rules/ttp/), corpus_loader (real → seed → empty fallback), precision_for() helper for TP/FP accounting. - _build_corpus.py: extractor for a real prod corpus pull. Mandatory --exclude-ip / DECNET_TTP_CORPUS_EXCLUDE_IPS — operator IPs never end up in the committed exclusion list. Pulls both 'command' and 'unknown_command' event types. - corpus/seed_.jsonl: synthetic seed rows for each cohort so the harness exercises in clean checkouts. - corpus/.jsonl (operator-built) is gitignored. - test_corpus_loads.py: sentinel that every seed file parses.	2026-05-01 09:08:07 -04:00
anti	ed3f340ea8	feat(ttp): E.3.7 RuleEngine — evaluate + atomic-swap watch_store Implements the rule engine body left empty at contract phase: evaluate() dispatches by source_kind through self._by_kind, runs the rule's match spec against event.payload, and emits one TTPTag per emits entry. watch_store() loads the initial corpus from RuleStore.load_compiled, then drains subscribe_changes, applying definition changes via single-statement dict assignment (atomic swap, GIL-atomic to readers) and state changes via NamedTuple._replace on the existing CompiledRule. Why: with the FS + DB stores in place (E.3.5/E.3.6), the engine is the last piece of the rule plane. Lifters (E.3.9–E.3.13) consume the engine; the worker bootstrap (E.3.14) wires watch_store into the asyncio event loop. After this commit a CompositeTagger constructed with a RuleEngine + a populated rules dir will produce real tags. Notes: - CompiledRule.emits extended to 4-tuple (technique_id, sub_technique_id, tactic, confidence). Tactic + confidence ride per-emit so a single rule can carry multiple precision targets (the "one event maps to many techniques" property). Compile helpers in both backends extract them from the YAML emits dict; missing tactic or confidence is a deploy-time error. - v0 match operator is "pattern" (regex). The field defaults per source_kind (command_text / raw_url / subject / verdict / …) and is overridable via match.field. Future ops (contains, equals, in_set) extend _match_event without touching the engine surface. - Confidence model: rules with state="clipped" + confidence_max set cap the per-emit confidence downward; clipped is a soft suppress, not a hard skip. Disabled rules are skipped wholly; expires_at past is re-checked at evaluate as defense-in-depth (the store auto-reverts, but a racing read between expiry and revert must not fire the rule). - _span(name, **attrs) helper in engine + both stores short-circuits on decnet.telemetry._ENABLED — matches the project's @traced / wrap_repository zero-overhead-when-disabled pattern instead of relying solely on the no-op tracer indirection. - Late-bound tracer (telemetry.get_tracer called per-span, not at module load) so test_tracing's monkeypatch reaches the production code path. xfails flipped: tests/ttp/test_rule_engine.py multi-emit fan-out + rule_version-collision-via-engine; tests/ttp/test_multi_mapping.py N×M engine fan-out + idempotent replay; tests/ttp/test_tracing.py ttp.eval span hierarchy + ttp.rule.fire span attributes. Tests: 214 passed, 19 xfailed (gated on E.3.8 lifters / rule pack / worker bootstrap). mypy: clean on prod code; pre-existing test-stub arg-type warnings unchanged.	2026-05-01 08:49:15 -04:00
anti	8a93ee3129	feat(ttp): E.3.6 DatabaseRuleStore — ttp_rule/ttp_rule_state + master sync Implements the DB-backed rule store body left empty at contract phase: load_compiled reads from ttp_rule + ttp_rule_state; get_state / set_state hit ttp_rule_state with the same expires_at auto-revert and bus-event semantics as the FS backend; subscribe_changes returns a per-subscriber queue. State persists across process restarts — the swarm property the FS backend deliberately doesn't have. Also lands two swarm-mode helpers: - sync_from_filesystem(fs_store) — master-side, subscribes to a FilesystemRuleStore and projects each RuleChange onto a ttp_rule upsert/delete. - tail_db(poll_interval) — worker-side, watermark poll over ttp_rule.updated_at; emits RuleChange("definition", ...) for each row that moved. Why: swarm mode needs rule definitions and operator state to propagate across hosts. The filesystem backend (E.3.5) was the single-host-dev variant; this one survives restart and serves N workers from a shared DB. Notes: - DatabaseRuleStore() with no args lazy-inits an in-memory SQLite repo so the conformance fixture works without test plumbing. In production the worker bootstrap (E.3.14) passes an explicit repo. - The conftest.py rule_store fixture became async (pytest_asyncio), per-backend creates/initializes a SQLite repo for the DB run. - Adds a `seed_rule(store, rule_id, yaml)` helper to bridge backend semantics: drop a YAML file (FS) vs insert a ttp_rule row (DB). Used by the parametrized load_compiled conformance test. - Late-bound _tracer() in both backends (was module-level get_tracer binding) so test_tracing's monkeypatch of decnet.telemetry.get_tracer actually affects span output. xfails flipped: tests/ttp/store/test_database.py set_state-writes-to- ttp_rule_state + filesystem-to-DB sync; tests/ttp/store/test_conformance.py DB-side load_compiled / set_state isolation / round-trip / per-rule fan-out / expired-state revert / set_state failure / get_state default (was xfail-only-on-DB); tests/ttp/test_tracing.py set_state span hierarchy. Tests: 208 passed, 25 xfailed (gated on E.3.7 + lifters). mypy: clean on all touched files.	2026-05-01 08:39:46 -04:00
anti	f41995a229	feat(ttp): E.3.5 FilesystemRuleStore — inotify hot-reload + per-rule events Implements the filesystem-backed rule store body left empty at contract phase: YAML parse + Pydantic validation, asyncinotify watch over ./rules/ttp/, in-process state cache with auto-revert on expires_at, and a subscribe_changes() async iterator yielding one RuleChange per per-rule edit. Bus topic builders ttp_rule_reloaded / ttp_rule_state ship alongside. Why: the rule plane needed a store before the engine (E.3.7) could consume RuleChange events and atomically swap compiled rules into its dispatch index. Notes: - Linux-only by construction (asyncinotify wheel gated by sys_platform marker; FilesystemRuleStore.__init__ raises on non-Linux). - Filename allowlist is the FIRST check on every inotify event. - Content-hash dedup so a single write firing IN_CREATE + IN_CLOSE_WRITE produces exactly one RuleChange. - All compile work serializes on a single asyncio.Lock. - Subscribers register their queue eagerly so events fired between subscribe_changes() and the first __anext__() are buffered. xfails flipped: per-save-style + filter-ordering + atomic-swap in test_filesystem.py; load_compiled / set_state isolation / round-trip / per-rule fan-out / expired-state revert / set_state failure semantics in test_conformance.py (FS side; DB side stays xfail until E.3.6); malformed-YAML compile-time check in test_rule_engine.py. Tests: 197 passed, 35 xfailed (gated on E.3.6 / E.3.7 / lifters). mypy + bandit: clean on all touched files. Wiki update for the per-rule reload + state-change topics lands in a matching wiki-checkout/Service-Bus.md edit (separate repo).	2026-05-01 08:31:05 -04:00
anti	89ce893792	feat(ttp): E.3.4 API handlers wired to repo (rollups + Navigator) Five GET rollup endpoints (techniques, by-identity, by-attacker, by-campaign, by-session) and the Navigator export (fleet + per-identity) now call into the TTPMixin methods. Rule catalogue endpoint still returns [] — backed by the RuleStore which lands at E.3.5/E.3.6.	2026-05-01 08:06:53 -04:00
anti	fee697694d	feat(ttp): E.3.3 repository — insert_tags + listing rollups (dual backend) Dialect-split: portable rollup queries on TTPMixin; bulk insert with ON CONFLICT DO NOTHING / INSERT IGNORE in the per-dialect repos. Confidence-floor (< 0.3) drop applied at mixin layer before the dialect hook. BaseRepository now declares the six TTP methods abstract. Tests in tests/web/db/test_ttp_repo.py flipped from pytest.fail stubs to real dual-backend behavioral tests; tests/ttp/test_confidence.py drop-below-floor xfail removed.	2026-05-01 08:04:46 -04:00
anti	226b3adfa2	docs(ttp): mark E.3.1 + E.3.2 done — schema/bus verification	2026-05-01 07:57:38 -04:00
anti	3664ea7008	docs(ttp): mark E.2.9–E.2.14b as done in design doc Each section gets a Status: ✅ done block summarising what's GREEN today vs xfail-gated and noting any divergence from the doc's original wording (E.2.9 lossy observable phases; E.2.13 db_backends fixture landed alongside; E.2.14a Jaeger-skip + tracing-enabled plumbing; E.2.14b NamedTuple AttributeError vs FrozenInstanceError).	2026-05-01 07:47:01 -04:00
anti	0217319423	test(ttp): E.2.14b RuleStore conformance — cross-backend + filesystem-specific + database-specific tests/ttp/store/conftest.py — parametrized rule_store fixture over FilesystemRuleStore (skipped on non-Linux) + DatabaseRuleStore. test_conformance.py — shared assertions (default-state, set_state isolation/round-trip, subscribe_changes per-rule fan-out, expires_at auto-revert, set_state failure semantics) parametrize over both. get_state-default GREEN today on FS (returns RuleState() for empty cache); rest xfail-gated behind E.3.5/E.3.6. test_filesystem.py — inotify mask + canonical kernel values + 9 scratch-filename rejections + 4 valid-filename acceptances + fullmatch anchor + tmp_path construction + CompiledRule frozen property GREEN today; per-save-style + filter-ordering + atomic-swap concurrency xfail-gated. test_database.py — class-level surface (no platform guard, ABC methods concrete, async coroutines) GREEN today; ttp_rule_state write + filesystem→DB sync xfail-gated behind E.3.6.	2026-05-01 07:45:32 -04:00
anti	bf5414c0d1	test(ttp): E.2.14a follow-up — force DECNET_DEVELOPER_TRACING=true, skip when Jaeger unreachable Session-scoped autouse fixture in tests/ttp/conftest.py sets DECNET_DEVELOPER_TRACING=true and forces decnet.telemetry._ENABLED so the no-op tracer doesn't silently swallow emitted spans. The span_exporter fixture also monkeypatches decnet.telemetry.get_tracer so production code under test lands spans in the in-memory exporter. Tracing tests skip when DECNET_OTEL_ENDPOINT (default localhost:4317) isn't reachable so the dev loop stays green without lying about coverage.	2026-05-01 07:42:22 -04:00
anti	f4fe6fe6e4	test(ttp): E.2.14a observability tracing — span hierarchy + no-PII property In-memory span exporter fixture wired to a per-test TracerProvider (OTEL global is locked once set, so each test gets its own). ttp.eval / ttp.lifter.{name} / ttp.rule.fire / ttp.rule.state.change hierarchy + no-PII canary battery xfail-gated behind E.3.5–E.3.13.	2026-05-01 07:40:58 -04:00
anti	4a93e16407	test(ttp): E.2.13 repository tests — TTPMixin idempotency + identity-rollup projection on dual backends Adds tests/web/db/conftest.py with a db_backends fixture parametrizing SQLite (always) + MySQL (gated on DECNET_TEST_MYSQL_URL). Surface assertions (mixin methods present + async) GREEN today; insert_tags idempotency, identity rollup projection, attacker-rollup exclusion of NULL-attacker tags xfail-gated behind E.3.3.	2026-05-01 07:39:16 -04:00
anti	6814949bc0	test(ttp): E.2.12 worker bus integration — _TOPICS equality, loop-prevention, delivery asymmetry Pin _TOPICS frozenset against documented set (single source of truth). Worker→engine invocation, loop-prevention invariant, attacker.enriched/email.received catch-up asymmetry xfail-gated behind E.3.14.	2026-05-01 07:37:58 -04:00
anti	c276b5696e	test(ttp): E.2.11 multi-mapping property — N×M fan-out, idempotent UUID, replay-safety Hypothesis property: N rule_ids × M technique_ids on one event yield N×M distinct tag UUIDs. Worked example pinned: one rule emitting (T1110, None) and (T1078, None) → two distinct UUIDs. Engine-level fan-out + replay xfail-gated behind E.3.7.	2026-05-01 07:36:19 -04:00

1 2 3 4 5 ...

1101 Commits