DECNET

Author	SHA1	Message	Date
anti	a0aeba5abc	refactor(db): extract FleetMixin and promote JSON helpers Moves the 6 fleet-decky methods (incl. cross-source list_running_deckies aggregator) into sqlmodel_repo/fleet.py. _serialize_json_fields and _deserialize_json_fields move to _helpers.py since they're shared across fleet, topology, and canary.	2026-04-28 14:50:01 -04:00
anti	d989cd0461	refactor(db): extract WebhooksMixin Moves the 9 webhook-subscription methods (CRUD + delivery bookkeeping) into sqlmodel_repo/webhooks.py.	2026-04-28 14:47:42 -04:00
anti	167f140b0e	refactor(db): extract BountiesMixin Moves the 5 bounty methods plus the cross-table purge_logs_and_bounties helper into sqlmodel_repo/bounties.py.	2026-04-28 14:46:39 -04:00
anti	c6804d79b6	refactor(db): extract DeckiesMixin Moves the 4 decky-shard CRUD methods into sqlmodel_repo/deckies.py.	2026-04-28 14:45:15 -04:00
anti	eebf9e4c97	refactor(db): extract AuthMixin Moves the 7 user CRUD methods into sqlmodel_repo/auth.py. _ensure_admin_user stays in __init__.py so DECNET_ADMIN_PASSWORD remains addressable at the module path tests already monkeypatch.	2026-04-28 14:43:49 -04:00
anti	99adbebe75	refactor(db): extract SwarmMixin Moves the 7 swarm-host CRUD methods into sqlmodel_repo/swarm.py.	2026-04-28 14:42:58 -04:00
anti	85c914e754	refactor(db): extract AttackerIntelMixin Moves upsert_attacker_intel, get_attacker_intel_by_uuid, and get_unenriched_attackers into sqlmodel_repo/attacker_intel.py. Composed onto SQLModelRepository via mixin inheritance.	2026-04-28 14:40:36 -04:00
anti	e16f47ad24	refactor(db): extract _safe_session/_detach_close to _helpers.py Module-level session helpers move into sqlmodel_repo/_helpers.py. __init__.py re-exports them so external import paths (decnet.web.db.sqlmodel_repo._safe_session) keep resolving.	2026-04-28 14:38:26 -04:00
anti	4167345d51	refactor(db): convert sqlmodel_repo.py to a package Pure rename — the old monolithic 3505-line file becomes decnet/web/db/sqlmodel_repo/__init__.py. No code changes. Subsequent commits will extract per-domain mixins out of __init__.py to mirror the topical layout used by decnet/web/db/models/.	2026-04-28 14:37:18 -04:00
anti	6d8c90777d	chore: remove vulture-flagged dead code, add whitelist - plain.py: drop `or True` short-circuit + unreachable return; drop now-unused _HASH_HINTS - ingester.py: drop unused `current_position` param from _flush_batch - vulture_whitelist.py: document remaining false positives (FastAPI Depends side-effects, IMAP uid_mode where UID==seq)	2026-04-28 14:30:12 -04:00
anti	72cc928ebf	feat(prober-cert): roll up fingerprints onto AttackerIdentity Brings the federation-gossip columns on AttackerIdentity to life — ja3_hashes, hassh_hashes, and the new tls_cert_sha256 — by projecting the union of every member observation's fingerprints JSON onto the identity at clusterer create / link / merge time. - decnet/profiler/identity_rollup.py: pure extract_fp_summaries() reads the production bounty shape (payload.fingerprint_type + payload.{ja3,hash,cert_sha256}) and returns deduped+sorted JSON list[str] per family, or None when a family has no signal so the column stays NULL instead of '[]'. - BaseRepository.update_identity_fingerprints + SQLModel impl: one idempotent write that overwrites the three summary columns and bumps updated_at. - ConnectedComponentsClusterer: after every per-component reconciliation (fresh-create OR existing-merge+link), recomputes and writes the rollup for the target identity. Wrapped in a best-effort helper so a write failure logs but never breaks the tick. - Tests: extract_fp_summaries unit (dedup, sort determinism, unknown types ignored, malformed JSON, nested-stringified payloads, non-string values); end-to-end clusterer ticks populate the columns on create + on later observation links; no-fingerprint clusters keep the columns NULL.	2026-04-28 11:28:54 -04:00
anti	4749c972e5	feat(prober-cert): schema for active TLS cert capture Adds storage for TLS certificate details collected from attacker-run servers by the active prober (sibling to the existing JARM probe). - AttackerIdentity.tls_cert_sha256 / Campaign.tls_cert_sha256: JSON list[str] columns mirroring ja3_hashes / hassh_hashes for federation gossip. - ingester clause 9b: emits a 'tls_certificate' fingerprint bounty when a prober event carries subject_cn (disjoint from the existing sniffer-gated clause). - Prober-side capture (ssl.wrap_socket follow-up after JARM) and profiler rollup land in sibling commits.	2026-04-28 11:09:25 -04:00
anti	ccc8619387	fix(test-schemathesis): disable rate limiter in fuzz subprocess Schemathesis fires up to 3000 examples per endpoint. POST /auth/login caps at 10/5min per IP, so the second example onward returns 429 and the positive_data_acceptance check flags it as RejectedPositiveData (its allowed-status list is hardcoded in schemathesis to 2xx/401/403/404/409/5xx, so OpenAPI tweaks can't fix it). DECNET_LIMITER_ENABLED=false exists for exactly this case (see limiter.py docstring on stress/load testing). Reverts the custom_openapi shim from `5d88346` / `9b1168c` — the endpoint already declares 429 in its responses= map (api_login.py:38), and the shim turned out to address a problem that wasn't there. Drop the companion test along with it.	2026-04-28 09:51:49 -04:00
anti	9b1168ce0b	fix(api): scope 429 OpenAPI injection to rate-limited routes Previous commit advertised 429 on every operation. Only routes decorated with @limiter.limit can actually return slowapi's 429 — currently just POST /api/v1/auth/login. Documenting it elsewhere is dishonest and would mislead clients into expecting a response the server cannot produce. Walk slowapi's _route_limits / _dynamic_route_limits registries to identify decorated endpoints, match them to FastAPI routes by {module}.{name}, and only inject 429 on those. Existing per-route 429 declarations (e.g. SSE connection-cap on events streams via sse_limits) are untouched.	2026-04-28 01:00:34 -04:00
anti	5d883466a2	fix(api): advertise 429 on every operation in OpenAPI SlowAPI middleware can short-circuit any request with 429 once a per-route or per-IP rate limit fires (e.g. POST /api/v1/auth/login is capped at 10/5min). The OpenAPI spec did not declare 429 on any operation, so schemathesis flagged legitimate rate-limit responses as RejectedPositiveData / status-code-nonconformance failures. Override app.openapi to inject a generic 429 response object on every HTTP operation in the generated schema. Add a contract test that fails if any operation drops the 429 advertisement.	2026-04-28 00:58:37 -04:00
anti	5415e98458	sec(api): mode-gate and eager-load JWT secret in lifespan Refuse to start decnet.web.api when DECNET_MODE=agent (unless the operator explicitly opts into dual-role with DECNET_DISALLOW_MASTER= false). The Typer CLI already hides master-only commands on agents, but a misconfigured systemd unit or a direct uvicorn invocation would bypass that — now the lifespan itself refuses, before any worker, DB or bus comes up. Resolve DECNET_JWT_SECRET eagerly at startup so a missing or known- bad value fails at boot rather than on the first auth-gated request. The lazy-load shape stays useful for non-master CLIs.	2026-04-27 21:26:03 -04:00
anti	1a7da33375	sec(env): refuse to start master API with footgun public-binding config Add validate_public_binding() called from the master API lifespan: when DECNET_API_HOST is non-loopback, refuse to start if DECNET_CORS_ORIGINS still contains a loopback origin (catches the "operator flipped to 0.0.0.0 to make it work and forgot to update CORS" footgun) or if DECNET_CANARY_HTTP_BASE is plaintext http:// to a non-loopback host. Log CRITICAL when DECNET_LIMITER_ENABLED=false on a public binding. The validator no-ops under pytest so unrelated suites don't trip on it. Add DECNET_VERIFY_HOSTNAME env knob; AgentClient and UpdaterClient consult it when verify_hostname is None, giving production deploys TLS hostname verification on top of the existing CA + fingerprint pin. Default off so dev enrollments with mismatched SANs keep working.	2026-04-27 21:15:15 -04:00
anti	2cc60bd677	feat(realism): operator-tunable planner weights via realism_config New realism_config table (uuid PK + unique key) + two repo methods (get/set) backs an admin-only GET/PUT /api/v1/realism/config surface. The planner now exposes apply_payload(payload) / current_payload() / reset_to_defaults() and reads its weights through mutable module globals; pick() resolves the live values each call. Validation catches negative weights, zero totals, out-of-range canary_probability, unknown content_class names, and silently drops cross-list entries (canary class on the user list, etc). The orchestrator worker calls _refresh_realism_config(repo) on startup and every 5 ticks (~5min at 60s interval). Operator changes land within one refresh window with no bus signal — the simpler path for a knob whose latency tolerance is minutes.	2026-04-27 18:00:08 -04:00
anti	da3c35c6a4	fix(realism): synthetic_files path fits MySQL utf8mb4 index cap The (decky_uuid VARCHAR(64), path VARCHAR(1024)) UNIQUE constraint generated a 4352-byte composite key under utf8mb4 (4 bytes/char), busting MySQL's 3072-byte cap and crashing decnet api on init with: Specified key was too long; max key length is 3072 bytes Tighten path to VARCHAR(512) — (64+512)*4 = 2304 bytes, well under the cap. Real realism + canary placement paths are short (/home/<persona>/Documents/<file>, ~70 chars); 512 keeps headroom without the index hassle. Pre-v1, no migration helper. Adds a regression test pinning the (decky_uuid + path) byte budget so a future widening fails loudly in CI rather than at MySQL deploy time.	2026-04-27 17:55:35 -04:00
anti	87cb61c8b2	feat(realism): synthetic-files browser API Adds GET /api/v1/realism/synthetic-files (paginated list, filters by decky_uuid, persona, content_class) and GET /api/v1/realism/synthetic-files/{uuid} (single row with last_body and a truncated:bool flag set when the stored body is at the 64KB cap). Repo gains count_synthetic_files() and get_synthetic_file(uuid). The list view drops last_body to keep the wire payload bounded; the detail endpoint is the only path that returns it. Read-only — orchestrator remains the sole writer.	2026-04-27 17:44:53 -04:00
anti	7e9bc6d49a	refactor(realism): enforce synthetic_files 64KB cap at the repo The orchestrator worker clipped last_body at write time, but the repo didn't enforce. A future caller that forgot the clip would write the full body. Move the clip to record_synthetic_file and update_synthetic_file via SYNTHETIC_FILE_BODY_LIMIT in decnet/web/db/models/realism.py. Worker now passes the full body and trusts the repo. Tests retargeted to assert repo enforcement.	2026-04-27 17:37:36 -04:00
anti	32eeb0c813	refactor(orchestrator): collapse decnet-emailgen.service into orchestrator Stage 5 of the realism migration. Email generation is no longer a separate worker / systemd unit / CLI subcommand — the orchestrator's single tick loop covers SSH traffic, file plants, and email drops. Going from 21 services to 20. Worker: - _one_tick rolls between traffic / file / email (45/45/10 weights). The 10% email weight at a 60s orchestrator interval produces ~one email per 10 minutes, close to the pre-collapse 5-minute cadence. - get_driver_for(action) (stage 4) handles SSH vs Email dispatch. - Quiet branches fall through so a (decky-set, persona-pool, mail-decky) shape that silences one branch doesn't waste the tick. - Periodic prune covers both orchestrator_events and orchestrator_emails tables. Deletions: - deploy/decnet-emailgen.service.j2 - decnet/orchestrator/emailgen/worker.py - decnet/cli/emailgen.py - tests/orchestrator/emailgen/test_worker_integration.py Renames (history-preserving): - decnet/web/router/emailgen/ -> decnet/web/router/realism/ - tests/api/emailgen/ -> tests/api/realism/ - tests/cli/test_emailgen_* -> tests/cli/test_realism_* Public surface changes (clean break, pre-v1): - API URL /api/v1/emailgen/personas -> /api/v1/realism/personas - CLI `decnet emailgen import-personas` -> `decnet realism import-personas`. `decnet emailgen run` is gone — the orchestrator covers it. - gating.py: emailgen master-only group replaced by realism. - decnet-orchestrator.service.j2: DECNET_REALISM_* env block added. - decnet.target: decnet-emailgen.service entry removed. - frontend: PersonaGeneration.tsx fetches /realism/personas.	2026-04-27 16:33:04 -04:00
anti	cb1872c52f	feat(realism): synthetic_files table + planner wiring + scheduler swap Stage 3 of the realism migration. Replaces orchestrator/scheduler.py's hardcoded _FILE_TEMPLATES/_USERS (3 templates emitting epoch-suffixed filenames like notes-1777315854.txt with identical bodies per template) with a persona-driven realism engine. New surface: - SyntheticFile SQLModel (synthetic_files table, UNIQUE on decky_uuid+path) — per-(decky, path) state for the future edit-in-place flow. Pre-v1, no _migrate_* helper. - BaseRepository methods: record_synthetic_file, update_synthetic_file, list_synthetic_files, pick_random_synthetic_file_for_edit (used by stage 3b). - realism/naming.py: per-content-class filename templates, persona-conditioned. /var/log/cron.log + logrotate skeleton for system-class; /home/<persona>/TODO.md, scratch.md, etc. for user-class. Anti-regression test pins "no 8+ digit decimals in basenames" (the realism failure today). - realism/bodies.py: deterministic body templates per content_class. TODO body uses checkbox markdown, script body has a shebang, cron body matches syslog cron shape ("CRON[PID]: (user) CMD (...)"). - realism/planner.py: pick(deckies, now, rng) returns a Plan. Diurnal-gated, weighted user/system content split (70/30 user bias). Create-only in stage 3; edit branch lands in stage 3b. Scheduler split: - scheduler.pick is now traffic-only (sync). - scheduler.pick_file is async, takes a repo, resolves personas (Topology.email_personas for topology-source deckies; global realism.personas_pool otherwise), and maps Plan -> FileAction. - FileAction gains persona/content_class/mtime fields. Worker: - _one_tick rolls 50/50 between traffic and file each tick. After a successful FileAction plant, _record_synthetic_file persists or patches the synthetic_files row (catching the unique-constraint collision on re-plant of the same path). - SSHDriver._run_file passes action.mtime through to plant_file so files don't all stamp at wall-clock-now.	2026-04-27 16:22:07 -04:00
anti	0b9873982d	refactor(realism): move emailgen LLM/personas/prompt into shared library Lift the format-agnostic pieces from decnet/orchestrator/emailgen/ into the new decnet/realism/ library so file-class content generation (stage 3 of the realism migration) can reuse them. Email-specific delivery (RFC 2822 EML, IMAP/POP3 spool, thread chains) stays in orchestrator/. Renames (history-preserving git mv): emailgen/personas.py -> realism/personas.py emailgen/prompt.py -> realism/prompts/email.py emailgen/global_pool.py -> realism/personas_pool.py emailgen/llm/ -> realism/llm/ Env-var clean break (pre-v1, no aliases): DECNET_EMAILGEN_LLM -> DECNET_REALISM_LLM DECNET_EMAILGEN_MODEL -> DECNET_REALISM_MODEL DECNET_EMAILGEN_TIMEOUT -> DECNET_REALISM_TIMEOUT DECNET_EMAILGEN_PERSONAS -> DECNET_REALISM_PERSONAS DECNET_EMAILGEN_FAKE_OUTPUT -> DECNET_REALISM_FAKE_OUTPUT Importers rewritten in: orchestrator/emailgen/scheduler.py, orchestrator/drivers/email.py, web/router/{emailgen,topology}/ api_personas.py, cli/emailgen.py. Tests for moved modules relocated to tests/realism/; tests for stay-put modules updated in place. API URL `/api/v1/emailgen/personas` and CLI `decnet emailgen import-personas` keep their public names until the service-collapse commit (stage 5).	2026-04-27 16:05:43 -04:00
anti	6c4ea706f8	feat(api): canary token CRUD router (/api/v1/canary) + tests Two sub-routers under /api/v1/canary: blobs (operator-uploaded artifacts, deduped by sha256): - POST /blobs (multipart upload; admin) - GET /blobs (list with token_count; admin) - DELETE /blobs/{uuid} (refcount-aware; 409 when referenced; admin) tokens (per-decky planted artifacts): - POST /tokens (generate or instrument + plant; admin) - GET /tokens?decky_name=&kind=&state= (filter; viewer) - GET /tokens/{uuid} (detail; viewer) - GET /tokens/{uuid}/preview (instrumented bytes; admin) - GET /tokens/{uuid}/triggers (paged callback log; viewer) - DELETE /tokens/{uuid} (revoke + bus event; admin) XOR validation: exactly one of blob_uuid / generator must be set. Path validation rejects relative/NUL/newlines/.. segments. Every body-bearing route documents 400 plus 401/403/404 as applicable. Stdlib MIME sniffer (no python-magic dep) covers PNG/JPEG/GIF/PDF/ HTML/XML/DOCX/XLSX/JSON/YAML/TOML/text/plain; everything else falls through to passthrough. Tests run end-to-end through the live FastAPI app (planter docker exec is patched); 17 cases covering dedup, refcount, lifecycle, XOR validation, path validation, and 404 paths.	2026-04-27 13:18:00 -04:00
anti	6a0d140e91	feat(db): canary token repository CRUD Adds the abstract surface on BaseRepository and the SQLModel-backed implementation (shared by SQLite and MySQL) for: - canary blobs (upsert-by-sha256, list-with-refcount, refcount-aware delete) - canary tokens (create, slug lookup, list with filters, state update) - canary triggers (record+bump-counters atomically, list, attribute) The triggers path is a single session that inserts the row and bumps the parent token's counters together, so a subscriber that reads the token right after the bus event sees the updated count. Blob delete refuses while any token (including revoked) still references the blob; pre-v1 revoked tokens stick around for forensic value.	2026-04-27 12:48:24 -04:00
anti	813f14bf2a	feat(db): canary token tables (blob/token/trigger) Three new tables for the canary tokens feature: - canary_blobs — operator-uploaded source artifacts, deduped by sha256 - canary_tokens — one planted artifact in one decky; carries the callback slug, generator/instrumenter, and lifecycle - canary_triggers — append-only log of every callback hit; attacker_id back-filled by the correlator Pydantic request/response shapes live in the same file per the single-source-of-truth convention. No migrations file — pre-v1 SQLModel.metadata.create_all() covers it.	2026-04-27 12:45:41 -04:00
anti	f046634d6e	feat(web): Persona Generation page under AUTOMATION New dashboard surface for editing the global emailgen persona pool — the JSON file fleet (MACVLAN/IPVLAN) and SWARM-shard mail deckies pull from. MazeNET topology personas are out of scope here; they're configured per-topology in the topology editor. Backend: * GET/PUT /api/v1/emailgen/personas — admin-write, viewer-read. PUT validates with the same Pydantic schema the worker uses (parse_personas), drops invalid entries with a warning, returns 400 only when the entire payload fails. Path is operator-discoverable on every response so a CLI-driven backup workflow stays visible. Frontend: * PersonaGeneration.tsx + .css — table + add/edit modal with the full EmailPersona schema (name, email, role, tone, mannerisms list, language, signature, active hours, reply latency, uses_llms_heavily). Local edits are batched; explicit "SAVE CHANGES" writes back, with a dirty-indicator pill and a "DISCARD" reset. Email uniqueness is enforced client-side so the scheduler never picks the same persona as both sender + recipient. * Sidebar AUTOMATION group gains a "Persona Generation" entry next to Orchestrator; route registered at /persona-generation. The worker reads the same on-disk file the API writes — see decnet.orchestrator.emailgen.global_pool. The API resets the in-process cache on every read/write so the worker picks up dashboard edits within its next tick rather than waiting on mtime.	2026-04-27 09:55:42 -04:00
anti	818aebadfc	feat(web): emailgen events in Orchestrator page The SSE pipe at /orchestrator/events/stream was already streaming 'orchestrator.email.{decky_uuid}' events (the subscription is for the 'orchestrator.>' wildcard), but the consumer side dropped them on the floor. Three fixes to close the loop: * useOrchestratorStream.ts now registers an 'email' SSE listener — the EventSource silently ignores frames whose event name has no listener, so missing this entry meant every email frame was dropped before reaching the page's onEvent handler. * /api/v1/orchestrator/events accepts kind=email and dispatches to list_orchestrator_emails, adapting rows to the existing wire shape: subject -> action, sender_email -> src_decky_uuid, recipient_email -> dst_decky_uuid, plus email-specific extras (thread_id, language, mail_decky_uuid, message_id, in_reply_to) ride along as top-level keys. * Orchestrator.tsx gains an 'email' tab in the kind filter and a branch in the row renderer / inspector that: - shows full sender / recipient (no UUID truncation), - chips the language code next to the subject, - relabels ACTION as SUBJECT in the inspector and surfaces thread / in-reply-to / mail-decky details. The 'all' tab continues to show traffic+file only (today's behavior); operators see emails by switching to the email tab. A union view at the API layer is the obvious follow-up but not necessary for now.	2026-04-26 22:56:48 -04:00
anti	4badc75fb2	feat(emailgen): global persona pool + Date-stamped EML mtimes Two changes that unwind earlier MazeNET-only assumptions and fix a realism tell: 1. Persona resolution is now per-decky-source, not topology-only. The scheduler walks the union view (list_running_deckies, including fleet MACVLAN/IPVLAN + SWARM shards) and picks the right persona list for each source: * topology decky -> Topology.email_personas (per-topology richness preserved) * fleet / shard -> a single host-wide pool loaded from disk (DECNET_EMAILGEN_PERSONAS, /etc/decnet/email_personas.json, or ~/.decnet/email_personas.json) Operators install the global pool via 'decnet emailgen import-personas <file>' which validates with the same Pydantic schema the worker uses. 2. The driver now runs 'touch -d <Date>' inside the docker exec right after the EML write so file mtime matches the email's RFC 2822 Date: header. Without this an attacker 'ls -lt'ing the spool sees every email clustered inside the worker's tick window — the cluster itself was a stylometric tell. CLI now exposes 'decnet emailgen' as a sub-app with 'run' (default, backwards-compatible with bare 'decnet emailgen') and 'import-personas'. list_running_deckies carries topology_id through so consumers can resolve the parent topology without a second round-trip.	2026-04-26 22:39:16 -04:00
anti	3ee55ec341	feat(emailgen): Ollama-driven fake email worker for IMAP/POP3 deckies Second orchestrator worker (decnet emailgen) that drips persona-driven, threaded, multi-language fake emails into running mail deckies. Personas live on Topology.email_personas; topology-wide language_default falls through to any persona that doesn't pin its own. Em-dashes are suppressed at the prompt layer by default and only lifted for personas explicitly marked uses_llms_heavily — em-dashes are an LLM tell and a flat corpus of em-dashed mail is a giveaway. EML delivery writes into /var/spool/decnet-emails/<thread>/<msg>.eml on the mail decky via docker exec; wiring the IMAP/POP3 templates to read from that spool (replacing the hardcoded _BAIT_EMAILS) is the next step.	2026-04-26 22:16:19 -04:00
anti	9650366d34	fix(orchestrator): drop topology_deckies FK on event src/dst columns Once the orchestrator started seeing fleet + SWARM shard sources via list_running_deckies (`a844148`), every event row landing on a fleet decky broke the FK to topology_deckies — the column now carries opaque ids ("local:omega-decky" for fleet, "host_uuid:decky_name" for shards) that will never match topology_deckies.uuid. Symptom on the operator's mothership: IntegrityError 1452 — orchestrator_events_ibfk_2 FK violated on every tick once the reconciler populated fleet_deckies. Index on dst_decky_uuid is preserved (the dashboard reads "events for this decky" frequently); only the FK is removed. Keeps data integrity loose by design — events are append-only history that should outlive the deckies they reference. Existing MySQL deployments need the FK dropped manually: ALTER TABLE orchestrator_events DROP FOREIGN KEY orchestrator_events_ibfk_2, DROP FOREIGN KEY orchestrator_events_ibfk_1; SQLite users are unaffected — SQLite doesn't enforce FKs by default.	2026-04-26 21:40:06 -04:00
anti	c3518e3159	feat(workers): surface clusterer, campaign-clusterer, reconciler in panel The Workers panel (Config → Workers tab) hardcodes its row list in KNOWN_WORKERS — by design, so a rogue publisher can't inject UI rows. Three heartbeat-emitting workers were missing: * clusterer — behavioral clustering (decnet/clustering/) * campaign-clusterer — campaign assembly (decnet/clustering/campaign/) * reconciler — host-local fleet convergence (added in `430262e`) Each already publishes on system.<name>.health via run_health_heartbeat, so they show up live the moment they're added to the registry — no frontend or subscriber wiring needed (Config.tsx renders whatever /workers returns). Also added to _PREFERRED_ORDER in start-all so START ALL WORKERS brings them up in dependency-friendly order: data-plane → reconciler → intel → clustering → output → orchestrator. Three deployable units (listener, web, swarmctl) intentionally remain absent from KNOWN_WORKERS — they don't emit heartbeats (CLI / static server / one-shot tooling), so they'd permanently render as UNKNOWN and confuse operators. Adding them is a separate decision that needs a "synthesize installed-but-silent rows" pass on the registry.	2026-04-26 21:31:34 -04:00
anti	8814902999	docs(api): clarify fleet_deckies + JSON dual-write happens in engine.deployer The unihost API path delegates to engine.deployer.deploy(), which now writes both decnet-state.json (existing) and the fleet_deckies DB table (added in `646aeec`). Comment makes the single-sink design explicit so future maintainers don't add a parallel save_state / upsert_fleet_decky call here. No behavioral change — every fleet-creation path on every host (CLI deploy, this unihost API path, and per-worker SWARM agent deploys) already routes through the engine.deployer single sink.	2026-04-26 21:08:44 -04:00
anti	095500ae9a	feat(db): FleetDecky table mirrors decnet-state.json into the DB Adds a fleet_deckies table so DB-only consumers (orchestrator, web dashboard, REST API) can see unihost / MACVLAN / IPVLAN deckies without reading the JSON state file. Mirrors DeckyShard field-for-field. Composite PK (host_uuid, name) future-proofs for a mothership that runs both a local fleet and acts as a swarm master. host_uuid defaults to the "local" sentinel — no FK to swarm_hosts because the local mothership isn't enrolled as a worker. Repo additions: upsert_fleet_decky, delete_fleet_decky, list_fleet_deckies, list_running_fleet_deckies, update_fleet_decky_state, plus list_running_deckies which unions topology + fleet + shard sources for the orchestrator. Smoke-tested round-trip against MySQL: upsert, list_running, union view (source="fleet"), delete.	2026-04-26 21:00:01 -04:00
anti	5b5ff54fa2	feat(web): orchestrator events read API + SSE stream GET /api/v1/orchestrator/events — paginated list with optional kind=traffic\|file filter. GET /api/v1/orchestrator/events/stream — SSE: snapshot on connect, live forward of orchestrator.> bus events mapped to 'traffic' / 'file' SSE event names. Repo gains list_orchestrator_events(limit, offset, kind?, since_ts?), count_orchestrator_events(kind?), and prune_orchestrator_events (per_dst_cap=10000) for periodic worker-side trimming.	2026-04-26 19:58:12 -04:00
anti	4c37ece39e	feat(orchestrator): MVP synthetic life-injection worker (SSH only) Adds a new decnet orchestrate worker whose job is to keep the honeypot ecosystem from looking suspiciously static — a frozen LAN with no inter-host traffic and no filesystem aging is its own honeypot tell. MVP scope: - New OrchestratorEvent table + repo methods (purpose-built sibling to Log so synthetic events stay separable from attacker-driven ones). - New orchestrator.{activity,file}.<decky_id> bus topics + system.orchestrator.health heartbeat. - SSH-only driver. Traffic action runs python3 inside src container to TCP-connect dst:22 and read the SSH banner — real on-the-wire SSH-protocol traffic without shipping creds. File action drops or refreshes a small file via docker exec on the destination. - Random scheduler (50/50 traffic/file when >=2 SSH-capable deckies are running). Diurnal shaping, role-aware pairing, and session-aware backoff are explicit non-goals for MVP. - CLI registration, systemd unit (SupplementaryGroups=docker), worker-registry entry so the dashboard shows orchestrator health. - 11 tests: scheduler policy, driver argv shape + injection-safety, end-to-end one-tick integration with FakeBus + SQLite.	2026-04-26 19:43:20 -04:00
anti	d531cea536	feat(web): read-only campaigns API + SSE + frontend API: /api/v1/campaigns (paginated list), /api/v1/campaigns/{uuid} (soft-merge chain follow), /api/v1/campaigns/{uuid}/identities (member identities), and /api/v1/campaigns/events (SSE under campaign.> + JWT-via-?token=, snapshot-on-connect). Mirror of the identity router; same auth, same shape, same OpenAPI tags pattern. Frontend: CampaignDetail.tsx page (same visual vocabulary as IdentityDetail), useCampaignStream hook (mirror of useIdentityStream), /campaigns/:id route, IdentityDetail's CAMPAIGN badge becomes clickable and navigates to the campaign. useIdentityStream now listens for identity.campaign.assigned so the badge appears live without a manual refresh.	2026-04-26 09:20:17 -04:00
anti	75af00c9c8	test(clustering): full-bound passes through production campaign clusterer Runs the chained identity + campaign clustering pipeline against all seven fixtures via from_synthetic / from_synthetic_identity adapters and ratchets every YAML floor to 1.0 — the production clusterer (and the reference clusterers used in the per-fixture tests) all score perfectly across ARI / homogeneity / completeness / singleton_recall on each fixture. Three substrate fixes surfaced by the ratchet: - Tuning: shared_infra now Jaccards payload+C2 only; decky_set moved into cohort_weight to prevent fleet-scarcity false-merges (F1's shared_wordlist failure mode). Tier weight raised to 1.0 so shared payload+C2 alone crosses threshold (F5's intended pass). - Adapter: from_synthetic_identity now reads SyntheticSession started_at + duration_s for session_windows and per-decky timestamps (the production-row adapter still uses start_ts/end_ts when available). - Fixture data: paused_campaign.yaml's JA3 collided exactly with vpn_hopping.yaml's (same TLS extension list). The collision fused two unrelated campaigns under the chained identity layer in the noise_floor composite. Made paused's JA3 distinct. Also wires Campaign / CampaignsResponse into models/__init__.py's __all__ that was missed in the schema commit.	2026-04-26 09:13:59 -04:00
anti	0a1cf65ddb	feat(db): Campaign SQLModel + repo write/read methods Adds the campaigns table and the BaseRepository / SQLModelRepository methods that the campaign-clusterer worker (next commit) needs to populate it. Mirrors the AttackerIdentity layer: schema_version from day one for federation gossip, soft-merge via merged_into_uuid with a chain-walking get_campaign_by_uuid, list_campaigns excluding merged- out rows while list_all_campaigns returns the unfiltered set for the revoke pass. attacker_identities.campaign_id gets a real FK now that the target table exists.	2026-04-26 08:54:28 -04:00
anti	97aa57faed	feat(api): SSE stream for identity events at /api/v1/identities/events Mirrors GET /api/v1/topologies/{id}/events: subscribes to identity.> on the bus for the duration of the request and forwards each event as a named SSE frame (formed / observation.linked / merged / unmerged). The endpoint is broadly scoped (every identity event, not per-uuid) because both AttackerDetail and IdentityDetail need the same firehose: AttackerDetail watches for an identity.formed that finally binds its identity_id; IdentityDetail watches for observation.linked / merged / unmerged against its current row. A per-uuid filter would force the client to know its identity before subscribing, which it doesn't always. JWT via ?token= (EventSource can't set headers), require_stream_viewer gate, sse_connection_slot per-user cap, snapshot-on-connect with the first 50 identities so the client buffer renders without a separate REST call. Bus-disabled / unreachable path keeps the connection alive on keepalives so the client doesn't reconnect-storm; it can re-poll the REST API on its own timer.	2026-04-26 08:36:17 -04:00
anti	e364ef8859	feat(clustering): revocable merges (merge + unmerge) Reworks the clusterer's tick to handle multi-identity components and re-evaluate prior merges. Two passes per tick: Pass 1 — per-component reconciliation: * Fresh component → mint identity (commit 4 path). * Single-identity component → link unassigned observations. * Multi-identity component → soft-merge: pick the smallest-uuid winner deterministically, set merged_into_uuid on each loser, link unassigned observations to the winner. Observations stay FK'd to their original identity row — the merge is a soft pointer, not a re-point. Audit trail preserved; cached subscribers resolve through the chain. Pass 2 — revocable-merge undo: * For each merged-out identity, check whether its observations still cluster with its winner's. If not, the merge is contradicted by new evidence — clear merged_into_uuid and emit identities_unmerged. The resurrected identity keeps its original uuid, so subscribers that cached it during the merged interval re-attach without a new lookup. A pre-built merge-chain dict feeds Pass 1 so the effective-identity lookup is O(1) per observation. The chain has a hop cap (paranoia against accidental cycles in the underlying state). Repo additions on BaseRepository + SQLModelRepository: * list_all_identities() — includes merged-out rows. * update_identity_merged_into(uuid, winner_or_None) — single setter for both merge and unmerge. DummyRepo coverage stub updated. Tests: * Two distinct identities bridged by a new observation merge with the smaller uuid as winner. * A pre-seeded soft-merge whose underlying observations diverge gets revoked; resurrected uuid emerges with merged_into_uuid cleared. * Tick is idempotent under no state changes.	2026-04-26 08:33:32 -04:00
anti	de2f4c3a62	feat(clustering): wire high-weight edges end-to-end The connected-components clusterer now writes attacker_identities rows + sets attackers.identity_id when high-weight signals (JA3 / HASSH / payload-hash / C2-endpoint exact match) agree across observations. Singletons stay un-fingerprinted and un-clustered. Algorithm split: - cluster_observations(observations) — pure union-find over the high-weight edge function. Same code path for fixture validation and production tick. - from_attacker_row(row) — production-row adapter; recovers JA3 + HASSH from Attacker.fingerprints JSON. Payload + C2 join from logs in later commits; the function shape doesn't change. Repo additions on BaseRepository + SQLModelRepository: - list_attackers_for_clustering(limit=None) - create_attacker_identity(row) - set_attacker_identity_id(attacker_uuid, identity_uuid) DummyRepo coverage stub updated. v1 behavior is conservative: only assigns identities to observations whose identity_id is currently NULL. Multi-identity components are skipped this pass — merge / re-assign lands in commit 10 with revocable merges. Fixture bounds tightened against the production clusterer: - lone_wolf (F3) — singletons stay singletons - shared_wordlist (F1) — credential-only overlap doesn't cluster (high-weight tier doesn't include credentials) - vpn_hopping (F2, identity-level) — 5 rotated IPs with stable JA3 + HASSH fold into one identity, ARI = 1.0, completeness = 1.0	2026-04-26 08:19:56 -04:00
anti	dc3d08dd41	feat(web): read-only /api/v1/identities/* endpoints + repo methods Second of the five-step identity-resolution substrate. Ships the API surface against the empty AttackerIdentity table from commit 1 — every endpoint returns empty/404 cleanly until the clusterer populates rows. Routes (auth-gated, viewer role): * GET /api/v1/identities — paginated list, excludes merged-out rows * GET /api/v1/identities/{uuid} — detail; transparently follows merged_into_uuid to surface the canonical winner * GET /api/v1/identities/{uuid}/observations — Attacker rows FK'd to the (resolved) identity uuid Repository (BaseRepository abstract + SQLModelRepository concrete): * get_identity_by_uuid (with merge-chain following, hop-bounded) * list_identities / count_identities (excluding merged-out) * list_observations_for_identity / count_observations_for_identity Tests: 12 new (empty-table behavior, seeded data, merge-chain resolution, repo-level smoke against real SQLite). Also fixes the pre-existing test_base_repo_coverage failure (DEBT-041 added abstract methods without updating the DummyRepo stub) — included here because this PR adds 5 more abstract methods, fixing it as a bonus. 474 db/web/profiler/correlation tests green.	2026-04-26 07:08:55 -04:00
anti	84c1ca9c9b	feat(identity): AttackerIdentity table + nullable attackers.identity_id FK Schema-only commit, first of the five-step substrate for identity resolution. The clusterer that populates identities lands later; this ships the table empty and the FK uniformly NULL on existing rows. * decnet/web/db/models/attackers.py — new AttackerIdentity SQLModel (uuid PK, schema_version, fingerprint summary lists, kd_digraph_simhash, merged_into_uuid self-FK, all clusterer-populated fields nullable). Attacker grows a nullable indexed identity_id FK + docstring marking it as the per-IP observation row. * decnet/web/db/models/__init__.py — re-exports AttackerIdentity. * tests/db/test_identity_schema.py — 9 schema invariants: table exists, identity_id nullable + indexed, FK targets attacker_identities.uuid, schema_version defaults to 1, attacker rows inserted with NULL identity_id, FK constraint blocks orphans. 463 unrelated db/web/profiler/correlation tests still green. See development/IDENTITY_RESOLUTION.md for the full design.	2026-04-26 07:00:24 -04:00
anti	3eb67c9400	refactor(intel): re-key attacker_intel on attacker_uuid (closes DEBT-041) The threat-intel surface was IP-keyed on day one as an expedient — the worker is woken by IP-bearing bus events. ANTI's call: don't carry that debt. NO IPs as primary keys anywhere on the attacker-intel surface. Schema: - attacker_uuid is now the canonical key — UNIQUE + FK to attackers.uuid. - attacker_ip stays as a denormalised, indexed, NON-UNIQUE value column. Updated on every upsert; useful for SIEM payloads and audit lookups, but explicitly NOT a key. Model docstring says so. - Pre-v1, no Alembic migration needed. SQLModel.metadata.create_all() builds the new shape on fresh DBs. Repo: - upsert_attacker_intel now keys on attacker_uuid. - get_attacker_intel_by_ip → get_attacker_intel_by_uuid. - get_unenriched_attacker_ips → get_unenriched_attackers, returning [{uuid, ip}] tuples so the worker writes by UUID and dispatches provider calls by IP without a second round-trip. Worker: - _enrich_one(uuid, ip, ...) — UUID lands on the row, IP rides for provider egress. - attacker.intel.enriched bus payload gains attacker_uuid alongside attacker_ip — webhook → SIEM consumers benefit; no removal. API: - GET /api/v1/attackers/{ip}/intel deleted outright (rip-and-replace, never deployed beyond dev). - GET /api/v1/attackers/{uuid}/intel is the only public route, matching every other /attackers/* route. Frontend: - <IntelPanel uuid={id!} /> uses the URL param directly, fetches in parallel with the rest of AttackerDetail rather than waiting on attacker.ip. Tests: re-keyed in place, 39 passed (same coverage as before the refactor). Provider-impl tests untouched. DEBT-041: closed in DEBT.md (entry preserved as historical rationale, summary table flipped to ✅, remaining-open list shortened by one).	2026-04-26 05:35:29 -04:00
anti	8a6d632ab0	feat(deploy): systemd unit for decnet-enrich + register in worker panel Mirrors decnet-reuse-correlator.service.j2: same hardening posture (NoNewPrivileges, ProtectSystem=full, etc.), same restart policy, same log file convention. The decnet init renderer picks it up automatically via the decnet-*.service.j2 glob. Also reconciles a naming inconsistency I shipped earlier: the heartbeat name was 'intel' (the package) but the CLI command and unit are 'enrich' (the action). Renamed the heartbeat to 'enrich' so the workers panel displays the same string the operator types and the same string in the systemd unit file. Convention across the project: heartbeat name = registry key = unit basename = CLI command name. Registers 'enrich' in worker_registry.KNOWN_WORKERS and in the start-all preferred order. The decnet.target Wants= list also picks up the new unit so 'systemctl start decnet.target' brings everything up together.	2026-04-26 05:20:54 -04:00
anti	d3d9bd5aa7	feat(intel): `decnet enrich` CLI + GET /attackers/{ip}/intel endpoint CLI command mirrors the reuse-correlate shape (--poll-interval, --ttl-hours, --daemon). Run it under systemd as a sibling worker. The API endpoint returns the most recent cached row for an attacker IP or 404. Auth-gated via require_viewer like every other attacker route. Also extends the worker test with a real FakeBus so the attacker.intel.enriched publish path is exercised end-to-end (no longer a no-op against NullBus).	2026-04-26 05:17:25 -04:00
anti	0dd3811436	feat(intel): attacker_intel table + repo helpers New TTL-cached threat-intel row keyed by attacker IP, with per-provider verdict/raw/queried_at columns for GreyNoise, AbuseIPDB, abuse.ch Feodo Tracker and ThreatFox. Carries schema_version from day one (federation wire-format precedent set by SessionProfile). Repo gains upsert_attacker_intel, get_attacker_intel_by_ip, and a get_unenriched_attacker_ips backfill primitive that picks fresh + stale rows for the forthcoming 'decnet enrich' worker. Also documents the open-source intel-source backlog in DEVELOPMENT_V2.	2026-04-26 04:56:47 -04:00
anti	50870f2e7a	feat(creds): surface plaintext/b64 secret on reuse findings The CredentialReuse table only stores the sha256+kind hash of the secret; the printable + b64 forms live on the underlying Credential rows. The dashboard drawer was therefore showing only the hash, which defeats most of the value of having a reuse view in the first place. Repo helpers list_credential_reuses + get_credential_reuse_by_id now issue one batched SELECT against credentials keyed on the sha256s in the result page and graft secret_printable + secret_b64 onto each row before returning. The drawer renders the same printable/b64 code-block the credentials inspector uses.	2026-04-26 04:34:19 -04:00

1 2 3 4 5 ...

274 Commits