DECNET

Author	SHA1	Message	Date
anti	4749c972e5	feat(prober-cert): schema for active TLS cert capture Adds storage for TLS certificate details collected from attacker-run servers by the active prober (sibling to the existing JARM probe). - AttackerIdentity.tls_cert_sha256 / Campaign.tls_cert_sha256: JSON list[str] columns mirroring ja3_hashes / hassh_hashes for federation gossip. - ingester clause 9b: emits a 'tls_certificate' fingerprint bounty when a prober event carries subject_cn (disjoint from the existing sniffer-gated clause). - Prober-side capture (ssl.wrap_socket follow-up after JARM) and profiler rollup land in sibling commits.	2026-04-28 11:09:25 -04:00
anti	2cc60bd677	feat(realism): operator-tunable planner weights via realism_config New realism_config table (uuid PK + unique key) + two repo methods (get/set) backs an admin-only GET/PUT /api/v1/realism/config surface. The planner now exposes apply_payload(payload) / current_payload() / reset_to_defaults() and reads its weights through mutable module globals; pick() resolves the live values each call. Validation catches negative weights, zero totals, out-of-range canary_probability, unknown content_class names, and silently drops cross-list entries (canary class on the user list, etc). The orchestrator worker calls _refresh_realism_config(repo) on startup and every 5 ticks (~5min at 60s interval). Operator changes land within one refresh window with no bus signal — the simpler path for a knob whose latency tolerance is minutes.	2026-04-27 18:00:08 -04:00
anti	da3c35c6a4	fix(realism): synthetic_files path fits MySQL utf8mb4 index cap The (decky_uuid VARCHAR(64), path VARCHAR(1024)) UNIQUE constraint generated a 4352-byte composite key under utf8mb4 (4 bytes/char), busting MySQL's 3072-byte cap and crashing decnet api on init with: Specified key was too long; max key length is 3072 bytes Tighten path to VARCHAR(512) — (64+512)*4 = 2304 bytes, well under the cap. Real realism + canary placement paths are short (/home/<persona>/Documents/<file>, ~70 chars); 512 keeps headroom without the index hassle. Pre-v1, no migration helper. Adds a regression test pinning the (decky_uuid + path) byte budget so a future widening fails loudly in CI rather than at MySQL deploy time.	2026-04-27 17:55:35 -04:00
anti	7e9bc6d49a	refactor(realism): enforce synthetic_files 64KB cap at the repo The orchestrator worker clipped last_body at write time, but the repo didn't enforce. A future caller that forgot the clip would write the full body. Move the clip to record_synthetic_file and update_synthetic_file via SYNTHETIC_FILE_BODY_LIMIT in decnet/web/db/models/realism.py. Worker now passes the full body and trusts the repo. Tests retargeted to assert repo enforcement.	2026-04-27 17:37:36 -04:00
anti	cb1872c52f	feat(realism): synthetic_files table + planner wiring + scheduler swap Stage 3 of the realism migration. Replaces orchestrator/scheduler.py's hardcoded _FILE_TEMPLATES/_USERS (3 templates emitting epoch-suffixed filenames like notes-1777315854.txt with identical bodies per template) with a persona-driven realism engine. New surface: - SyntheticFile SQLModel (synthetic_files table, UNIQUE on decky_uuid+path) — per-(decky, path) state for the future edit-in-place flow. Pre-v1, no _migrate_* helper. - BaseRepository methods: record_synthetic_file, update_synthetic_file, list_synthetic_files, pick_random_synthetic_file_for_edit (used by stage 3b). - realism/naming.py: per-content-class filename templates, persona-conditioned. /var/log/cron.log + logrotate skeleton for system-class; /home/<persona>/TODO.md, scratch.md, etc. for user-class. Anti-regression test pins "no 8+ digit decimals in basenames" (the realism failure today). - realism/bodies.py: deterministic body templates per content_class. TODO body uses checkbox markdown, script body has a shebang, cron body matches syslog cron shape ("CRON[PID]: (user) CMD (...)"). - realism/planner.py: pick(deckies, now, rng) returns a Plan. Diurnal-gated, weighted user/system content split (70/30 user bias). Create-only in stage 3; edit branch lands in stage 3b. Scheduler split: - scheduler.pick is now traffic-only (sync). - scheduler.pick_file is async, takes a repo, resolves personas (Topology.email_personas for topology-source deckies; global realism.personas_pool otherwise), and maps Plan -> FileAction. - FileAction gains persona/content_class/mtime fields. Worker: - _one_tick rolls 50/50 between traffic and file each tick. After a successful FileAction plant, _record_synthetic_file persists or patches the synthetic_files row (catching the unique-constraint collision on re-plant of the same path). - SSHDriver._run_file passes action.mtime through to plant_file so files don't all stamp at wall-clock-now.	2026-04-27 16:22:07 -04:00
anti	813f14bf2a	feat(db): canary token tables (blob/token/trigger) Three new tables for the canary tokens feature: - canary_blobs — operator-uploaded source artifacts, deduped by sha256 - canary_tokens — one planted artifact in one decky; carries the callback slug, generator/instrumenter, and lifecycle - canary_triggers — append-only log of every callback hit; attacker_id back-filled by the correlator Pydantic request/response shapes live in the same file per the single-source-of-truth convention. No migrations file — pre-v1 SQLModel.metadata.create_all() covers it.	2026-04-27 12:45:41 -04:00
anti	3ee55ec341	feat(emailgen): Ollama-driven fake email worker for IMAP/POP3 deckies Second orchestrator worker (decnet emailgen) that drips persona-driven, threaded, multi-language fake emails into running mail deckies. Personas live on Topology.email_personas; topology-wide language_default falls through to any persona that doesn't pin its own. Em-dashes are suppressed at the prompt layer by default and only lifted for personas explicitly marked uses_llms_heavily — em-dashes are an LLM tell and a flat corpus of em-dashed mail is a giveaway. EML delivery writes into /var/spool/decnet-emails/<thread>/<msg>.eml on the mail decky via docker exec; wiring the IMAP/POP3 templates to read from that spool (replacing the hardcoded _BAIT_EMAILS) is the next step.	2026-04-26 22:16:19 -04:00
anti	9650366d34	fix(orchestrator): drop topology_deckies FK on event src/dst columns Once the orchestrator started seeing fleet + SWARM shard sources via list_running_deckies (`a844148`), every event row landing on a fleet decky broke the FK to topology_deckies — the column now carries opaque ids ("local:omega-decky" for fleet, "host_uuid:decky_name" for shards) that will never match topology_deckies.uuid. Symptom on the operator's mothership: IntegrityError 1452 — orchestrator_events_ibfk_2 FK violated on every tick once the reconciler populated fleet_deckies. Index on dst_decky_uuid is preserved (the dashboard reads "events for this decky" frequently); only the FK is removed. Keeps data integrity loose by design — events are append-only history that should outlive the deckies they reference. Existing MySQL deployments need the FK dropped manually: ALTER TABLE orchestrator_events DROP FOREIGN KEY orchestrator_events_ibfk_2, DROP FOREIGN KEY orchestrator_events_ibfk_1; SQLite users are unaffected — SQLite doesn't enforce FKs by default.	2026-04-26 21:40:06 -04:00
anti	095500ae9a	feat(db): FleetDecky table mirrors decnet-state.json into the DB Adds a fleet_deckies table so DB-only consumers (orchestrator, web dashboard, REST API) can see unihost / MACVLAN / IPVLAN deckies without reading the JSON state file. Mirrors DeckyShard field-for-field. Composite PK (host_uuid, name) future-proofs for a mothership that runs both a local fleet and acts as a swarm master. host_uuid defaults to the "local" sentinel — no FK to swarm_hosts because the local mothership isn't enrolled as a worker. Repo additions: upsert_fleet_decky, delete_fleet_decky, list_fleet_deckies, list_running_fleet_deckies, update_fleet_decky_state, plus list_running_deckies which unions topology + fleet + shard sources for the orchestrator. Smoke-tested round-trip against MySQL: upsert, list_running, union view (source="fleet"), delete.	2026-04-26 21:00:01 -04:00
anti	5b5ff54fa2	feat(web): orchestrator events read API + SSE stream GET /api/v1/orchestrator/events — paginated list with optional kind=traffic\|file filter. GET /api/v1/orchestrator/events/stream — SSE: snapshot on connect, live forward of orchestrator.> bus events mapped to 'traffic' / 'file' SSE event names. Repo gains list_orchestrator_events(limit, offset, kind?, since_ts?), count_orchestrator_events(kind?), and prune_orchestrator_events (per_dst_cap=10000) for periodic worker-side trimming.	2026-04-26 19:58:12 -04:00
anti	4c37ece39e	feat(orchestrator): MVP synthetic life-injection worker (SSH only) Adds a new decnet orchestrate worker whose job is to keep the honeypot ecosystem from looking suspiciously static — a frozen LAN with no inter-host traffic and no filesystem aging is its own honeypot tell. MVP scope: - New OrchestratorEvent table + repo methods (purpose-built sibling to Log so synthetic events stay separable from attacker-driven ones). - New orchestrator.{activity,file}.<decky_id> bus topics + system.orchestrator.health heartbeat. - SSH-only driver. Traffic action runs python3 inside src container to TCP-connect dst:22 and read the SSH banner — real on-the-wire SSH-protocol traffic without shipping creds. File action drops or refreshes a small file via docker exec on the destination. - Random scheduler (50/50 traffic/file when >=2 SSH-capable deckies are running). Diurnal shaping, role-aware pairing, and session-aware backoff are explicit non-goals for MVP. - CLI registration, systemd unit (SupplementaryGroups=docker), worker-registry entry so the dashboard shows orchestrator health. - 11 tests: scheduler policy, driver argv shape + injection-safety, end-to-end one-tick integration with FakeBus + SQLite.	2026-04-26 19:43:20 -04:00
anti	75af00c9c8	test(clustering): full-bound passes through production campaign clusterer Runs the chained identity + campaign clustering pipeline against all seven fixtures via from_synthetic / from_synthetic_identity adapters and ratchets every YAML floor to 1.0 — the production clusterer (and the reference clusterers used in the per-fixture tests) all score perfectly across ARI / homogeneity / completeness / singleton_recall on each fixture. Three substrate fixes surfaced by the ratchet: - Tuning: shared_infra now Jaccards payload+C2 only; decky_set moved into cohort_weight to prevent fleet-scarcity false-merges (F1's shared_wordlist failure mode). Tier weight raised to 1.0 so shared payload+C2 alone crosses threshold (F5's intended pass). - Adapter: from_synthetic_identity now reads SyntheticSession started_at + duration_s for session_windows and per-decky timestamps (the production-row adapter still uses start_ts/end_ts when available). - Fixture data: paused_campaign.yaml's JA3 collided exactly with vpn_hopping.yaml's (same TLS extension list). The collision fused two unrelated campaigns under the chained identity layer in the noise_floor composite. Made paused's JA3 distinct. Also wires Campaign / CampaignsResponse into models/__init__.py's __all__ that was missed in the schema commit.	2026-04-26 09:13:59 -04:00
anti	0a1cf65ddb	feat(db): Campaign SQLModel + repo write/read methods Adds the campaigns table and the BaseRepository / SQLModelRepository methods that the campaign-clusterer worker (next commit) needs to populate it. Mirrors the AttackerIdentity layer: schema_version from day one for federation gossip, soft-merge via merged_into_uuid with a chain-walking get_campaign_by_uuid, list_campaigns excluding merged- out rows while list_all_campaigns returns the unfiltered set for the revoke pass. attacker_identities.campaign_id gets a real FK now that the target table exists.	2026-04-26 08:54:28 -04:00
anti	84c1ca9c9b	feat(identity): AttackerIdentity table + nullable attackers.identity_id FK Schema-only commit, first of the five-step substrate for identity resolution. The clusterer that populates identities lands later; this ships the table empty and the FK uniformly NULL on existing rows. * decnet/web/db/models/attackers.py — new AttackerIdentity SQLModel (uuid PK, schema_version, fingerprint summary lists, kd_digraph_simhash, merged_into_uuid self-FK, all clusterer-populated fields nullable). Attacker grows a nullable indexed identity_id FK + docstring marking it as the per-IP observation row. * decnet/web/db/models/__init__.py — re-exports AttackerIdentity. * tests/db/test_identity_schema.py — 9 schema invariants: table exists, identity_id nullable + indexed, FK targets attacker_identities.uuid, schema_version defaults to 1, attacker rows inserted with NULL identity_id, FK constraint blocks orphans. 463 unrelated db/web/profiler/correlation tests still green. See development/IDENTITY_RESOLUTION.md for the full design.	2026-04-26 07:00:24 -04:00
anti	3eb67c9400	refactor(intel): re-key attacker_intel on attacker_uuid (closes DEBT-041) The threat-intel surface was IP-keyed on day one as an expedient — the worker is woken by IP-bearing bus events. ANTI's call: don't carry that debt. NO IPs as primary keys anywhere on the attacker-intel surface. Schema: - attacker_uuid is now the canonical key — UNIQUE + FK to attackers.uuid. - attacker_ip stays as a denormalised, indexed, NON-UNIQUE value column. Updated on every upsert; useful for SIEM payloads and audit lookups, but explicitly NOT a key. Model docstring says so. - Pre-v1, no Alembic migration needed. SQLModel.metadata.create_all() builds the new shape on fresh DBs. Repo: - upsert_attacker_intel now keys on attacker_uuid. - get_attacker_intel_by_ip → get_attacker_intel_by_uuid. - get_unenriched_attacker_ips → get_unenriched_attackers, returning [{uuid, ip}] tuples so the worker writes by UUID and dispatches provider calls by IP without a second round-trip. Worker: - _enrich_one(uuid, ip, ...) — UUID lands on the row, IP rides for provider egress. - attacker.intel.enriched bus payload gains attacker_uuid alongside attacker_ip — webhook → SIEM consumers benefit; no removal. API: - GET /api/v1/attackers/{ip}/intel deleted outright (rip-and-replace, never deployed beyond dev). - GET /api/v1/attackers/{uuid}/intel is the only public route, matching every other /attackers/* route. Frontend: - <IntelPanel uuid={id!} /> uses the URL param directly, fetches in parallel with the rest of AttackerDetail rather than waiting on attacker.ip. Tests: re-keyed in place, 39 passed (same coverage as before the refactor). Provider-impl tests untouched. DEBT-041: closed in DEBT.md (entry preserved as historical rationale, summary table flipped to ✅, remaining-open list shortened by one).	2026-04-26 05:35:29 -04:00
anti	0dd3811436	feat(intel): attacker_intel table + repo helpers New TTL-cached threat-intel row keyed by attacker IP, with per-provider verdict/raw/queried_at columns for GreyNoise, AbuseIPDB, abuse.ch Feodo Tracker and ThreatFox. Carries schema_version from day one (federation wire-format precedent set by SessionProfile). Repo gains upsert_attacker_intel, get_attacker_intel_by_ip, and a get_unenriched_attacker_ips backfill primitive that picks fresh + stale rows for the forthcoming 'decnet enrich' worker. Also documents the open-source intel-source backlog in DEVELOPMENT_V2.	2026-04-26 04:56:47 -04:00
anti	ce4be68501	feat(creds): cred-reuse foundation + vectorstore scaffold Lays the storage and bus substrate for the "credential reuse patterns" task in DEVELOPMENT.md and scaffolds decnet/vectorstore/ as the future substrate for statistical attacker re-identification over behavioral fingerprints. No correlator, profiler, API, or dashboard wiring in this commit — see TODO.md for the handoff. Schema: - Credential.attacker_uuid (nullable FK to attackers.uuid), backfilled by the profiler post-write to avoid coupling the capture path to the profiler's ordering. - CredentialReuse table — UUID PK, JSON list columns for the accumulating attacker_uuids/ips/deckies/services, target_count (the discriminative scalar), confidence reserved for a future fuzzy-credential pass. Repo: - upsert_credential_reuse / list_credential_reuses / get_credential_reuse_by_id / update_credential_attacker_uuid. - Renamed pre-existing get_credential_reuse(secret_sha256) to get_credential_attempts_for_secret(secret_sha256) — the new findings table needs the cleaner name. Bus topics: - credential.captured (one per Credential upsert) - credential.reuse.detected (correlator-emitted on insert/grow) Vectorstore subpackage (decnet/vectorstore/, flat layout mirroring decnet/bus/): - BaseVectorStore ABC keyed by (kind, id) — kind discriminator means new feature families are additive, no schema migration. - FakeVectorStore (in-memory L2 KNN), NullVectorStore (no-op for DECNET_VECTORSTORE_ENABLED=false), SqliteVecVectorStore (lazy sqlite_vec extension load, one vec0 virtual table per kind). - get_vectorstore() env-driven dispatch with graceful fallback to FakeVectorStore when the sqlite-vec extension isn't on the host, so workers don't crash on a missing optional dep. Tests: 26 new (11 cred-reuse repo, 15 vectorstore). Existing credentials and base-repo tests updated for the rename. Total: 34 passing on the touched files.	2026-04-26 03:18:34 -04:00
anti	6b16c844b6	fix(creds): MQTT regression + secret_kind for hash credentials Honest correction to the "every cred-emitting service" claim. Audit of templates/* found three gaps: 1. MQTT — was working through the legacy adapter, silently dropped when Phase 3 (`e696c2b`) deleted it. Now migrated to encode_secret() alongside the others. 2. Postgres — `auth, pw_hash=…` event captures the MD5 challenge-response the attacker sent. Plaintext irrecoverable, so it never fit the (principal, secret_b64=raw_bytes) shape. Lands in Credential as secret_kind="postgres_md5_challenge". 3. VNC — `auth_response, response=…hex` event captures the 16-byte DES-encrypted challenge. Same situation as Postgres: plaintext irrecoverable. Lands as secret_kind="vnc_des_response". Adds a `secret_kind` discriminator column to Credential (default "plaintext", indexed). The dedup tuple gains secret_kind so two credentials with the same sha256 but different kinds are fundamentally different rows — different challenges produce different bytes for the same plaintext password, so cross-kind reuse matches are meaningless and would only confuse analytics. The model now genuinely covers every cred-emitting service in the fleet: plaintext SSH, Telnet, FTP, POP3, IMAP, SMTP, Redis, LDAP, MQTT postgres_md5_* Postgres vnc_des_response VNC Username-only services (MySQL/MSSQL — TDS pre-encryption captures the user but never sees the password byte) intentionally don't feed Credential — they're recon signals, not cred attempts. 40 tests pass in the touched scope. New cases: secret_kind dedups independently in the repo; Postgres MD5 + VNC DES emitters thread through; MQTT round-trips through the native branch.	2026-04-25 06:16:57 -04:00
anti	2f47f67eef	feat(creds): future-proof Credential storage model Replaces the opaque Bounty.bounty_type='credential' path with a dedicated `credentials` table whose schema is forward-compatible across every auth-bearing service in the fleet. Hoisted indexed columns (secret_sha256, principal, service, attacker_ip) carry the universal reuse-analytics signal; service-specific JSON keys ride in `fields`. Cross-service reuse queries become an indexed lookup on secret_sha256 instead of JSON_EXTRACT scans. Schema decisions baked in (per ANTI): - New `Credential` table, not extension to Bounty - Hoisted `principal` column for cross-service principal-reuse - Standardized JSON keys: every payload carries secret_b64 + secret_printable + principal universally; service-specific extras (user, domain, dn, mech, …) ride alongside The auth-helper SD-block emits the new shape natively. The ingester forks at _extract_bounty: - Native shape (SSH/Telnet, future emitters): secret_b64 present → direct upsert_credential - Legacy shape (FTP/POP3/IMAP/SMTP today): username + password → adapter synthesizes secret_{b64,sha256,printable} on the fly, upserts into the same Credential table. Tracked as DEBT-039; one-shot bridge until those service templates migrate. Defense-in-depth across five layers (input validation): - C helper: bytes outside [0x20, 0x7f) collapse to '?', RFC 5424 escape rules for \\, ", ]; b64 preserves exact bytes - Ingester native branch: rejects malformed secret_b64 (regex), drops the credential row but keeps the underlying Log - Ingester legacy adapter: same printable-ASCII filter as the C code; sha256 + b64 over the original utf-8 bytes (lossless, even when secret_printable is sanitized) - DB column caps with truncation warning; sha256 always over the full pre-truncation bytes so reuse queries match across truncation - JSON serialized with ensure_ascii=True so utf8mb4 columns stay safe even with non-ASCII service-specific keys Bounty.bounty_type='credential' is no longer written. Pre-v1: no historical backfill; existing rows stay untouched but unused. 595 tests pass; new tests cover the model + repo (upsert dedup, null-principal independence, cross-service reuse, filters), both ingester branches, b64 validation, sanitization preserving the fingerprinting signal in b64.	2026-04-25 05:29:26 -04:00
anti	bcf460d2a5	feat(profiler): write ASN + AS name onto attacker rows Adds asn (int), as_name (varchar 128), asn_source (varchar 16) to the Attacker SQLModel — direct columns, no _migrate_* helper per feedback_no_new_migrations_prev1. Profiler worker now calls decnet.asn.enrich_ip alongside the existing geoip enrich_ip; both feed the upsert payload. Failure is total — if either lookup throws or the IP is private/unannounced, the field stays None and the row still writes. Both lookups are independent: a CGNAT address can have a country (RIR allocation) but no ASN (no BGP origin), and vice-versa for unrouted RIR-allocated space. Storing them separately preserves that signal.	2026-04-25 04:01:28 -04:00
anti	ee176a6f79	Revert "feat(mazenet): per-LAN swarm host pin" This reverts commit `0d92170a57`.	2026-04-25 03:26:19 -04:00
anti	0d92170a57	feat(mazenet): per-LAN swarm host pin Adds nullable LAN.host_uuid (FK swarm_hosts.uuid). Resolution order when deploying a LAN: lan.host_uuid → topology.target_host_uuid → master. A LAN is one Docker bridge so the bridge cannot span hosts; this pin forces every decky in the LAN onto the named host. LANCreateRequest / LANUpdateRequest accept host_uuid; both validate that the host exists, returning 400 on unknown UUIDs. PATCH still gated by the existing pending-only guard, so reassignment of a live LAN is not yet possible (deferred to mutator support). LANRow surfaces the field so the frontend can render per-host badges.	2026-04-25 03:04:23 -04:00
anti	5a34371009	feat(attackers): PTR record (reverse DNS) enrichment Resolve each attacker IP's rDNS name once at first sighting, store on Attacker.ptr_record, render on AttackerDetail under ORIGIN. Many attackers run infrastructure with forgotten rDNS that instantly identifies them once surfaced: scan-node-42.shodan.io, shady-vps.leasecloud.net, etc. Resolver lives in decnet/geoip/ptr.py — colocated with enrich_ip because the shape matches (take an IP, return supplementary metadata, never raise). Uses the OS resolver via socket.gethostbyaddr offloaded to the default executor, wrapped with asyncio.wait_for timeout=2s so a slow authoritative NS can't stall the profiler tick. Profiler side: _WorkerState grows a ptr_attempted: set[str] bounding resolution to once per worker lifetime. Cold-start batches resolve concurrently (Semaphore(_PTR_CONCURRENCY=10)) so a backlog doesn't serialize 2s ceilings. _build_record gains a keyword-only ptr_record parameter that, when _UNSET, omits the key from the record dict — upsert_attacker's attribute-merge loop then preserves whatever's stored on the row. Explicit None is a "fresh failed attempt" signal and gets written through. Env kill-switch DECNET_PTR_ENABLED=false for locked-down deploys where egress DNS is forbidden. Private / loopback / link-local / multicast / reserved addresses short-circuit before any DNS call. IPv6 reverse DNS works transparently through the stdlib resolver. Schema change — run once on upgrade: ALTER TABLE attackers ADD COLUMN ptr_record VARCHAR(256) NULL DEFAULT NULL; Or drop-and-recreate on dev boxes (db-reset's SQLModel.metadata-driven table discovery now picks it up automatically since `ba155b7`). tests/conftest.py disables DECNET_PTR_ENABLED globally for the same reason it disables DECNET_GEOIP_ENABLED — unit tests must never hit the network. tests/geoip/test_ptr.py re-enables explicitly via an autouse fixture.	2026-04-24 17:26:40 -04:00
anti	2bcef50ac5	feat(webhooks): circuit breaker auto-disables misbehaving subscriptions After DECNET_WEBHOOK_CIRCUIT_THRESHOLD (default 5) consecutive failed deliveries, the worker calls trip_webhook_circuit(uuid, ts) which flips enabled=False and stamps auto_disabled_at. The worker sets its reload flag so the next dispatch epoch stops consuming events for the tripped sub entirely — one dead receiver can't poison the shared egress pool anymore. Operator clears the trip via PATCH — setting enabled=True when the sub was previously disabled clears auto_disabled_at, zeros consecutive_failures, and clears last_error. Admin-pause → re-enable hits the same path harmlessly. Three observable states now distinguishable in the UI: - Active enabled=True, auto_disabled_at=NULL - Admin-paused enabled=False, auto_disabled_at=NULL - Tripped enabled=False, auto_disabled_at=<ts> UI surfaces a TRIPPED · <ts> chip on the row (red, alert-styled) and a "N TRIPPED" count in the page header. Hover tooltip tells the operator how to reset ("Re-enable via Edit"). record_webhook_failure now returns the new consecutive_failures count so the worker can compare against the threshold without a second roundtrip. trip_webhook_circuit is idempotent — re-tripping just re-stamps auto_disabled_at. Closes THREAT_MODEL WH-02 and DEBT-037 §1.	2026-04-24 16:24:33 -04:00
anti	638236113d	feat(webhooks): non-blocking http:// warning + WH-03 accepted risk WebhookResponse now carries a `warnings: list[str]` field. When the subscription's URL starts with http://, an `insecure_url` advisory is surfaced on every GET/CREATE without blocking the request. HMAC still detects tampering regardless of transport — only read-confidentiality is lost over plaintext — and test/dev environments without TLS stay usable. Matches the operator-trust posture already established by DA-06 (admin-on-admin protection is out of scope). The alternative — hard rejection at admin time — was considered and declined; warning-plus- visibility is the right shape. THREAT_MODEL WH-03 accepted risk registered; revisit triggers are multi-admin delegation, a regulated customer, or an operator ticket asking for a DECNET_WEBHOOK_REQUIRE_HTTPS enforcement knob.	2026-04-24 15:53:30 -04:00
anti	b70845a85d	feat(webhooks): subscription CRUD + HMAC-signed delivery client Introduces the webhook egress foundation — a new WebhookSubscription table, admin-gated CRUD under /api/v1/webhooks, and the shared delivery client that both the test-ping route and the upcoming worker will use. No worker yet; this commit is API + model + client only. Simple-mode enum (AttackerDetail / DeckyStatus / SystemStatus) expands to bus-topic patterns at the router layer; storage is always the raw pattern list. Advanced mode lets admins supply raw NATS-style patterns directly. Filter-at-subscribe: the worker (next commit) will subscribe to the union of patterns across enabled subscriptions. Delivery client handles HMAC-SHA256 signing (X-DECNET-Signature), retry on 429/5xx/network errors with jittered backoff, no-retry on 4xx. Secrets never leave the server on GET/LIST — only the create response carries the secret for copy-out. CRUD routes publish WEBHOOK_SUBSCRIPTIONS_CHANGED on the bus after every mutation so the (future) worker can hot-reload. Opens DEBT-037 for the deferred items (circuit breaker, dead-letter, batch delivery, payload templates, secret-at-rest).	2026-04-24 15:30:05 -04:00
anti	df84981954	feat(api): pin response_model on dict-returning mutation routes Every mutation route that returned an untyped dict now declares response_model at the decorator. MessageResponse covers the eight {"message": ...} envelopes (change-password, mutate-decky, mutate- interval, update-deployment-limit, update-global-mutation-interval, delete-user, update-user-role, reset-user-password). Purpose-built models cover the richer shapes (DeployResponse for /deckies/deploy, PurgeResponse for /config/reinit, ReapReportResponse for /reap-orphans, UserResponse for /config/users). 204-No-Content and Response/ ORJSONResponse routes stay as-is. The wire shape for clients is unchanged — the envelopes already only shipped a message field. What changes is that a handler which accidentally returns a richer dict (e.g. a full user row including password_hash) would be silently stripped to the declared fields at serialization time. Also flips F4/D "expensive LIKE" to accepted (new DA-09) — the /logs and /attackers search routes LIKE-scan unbounded columns, but both are admin-gated, limit-capped, and operator rate-limit scope per DA-04. FTS5 stays a performance TODO, not a security blocker.	2026-04-24 14:27:58 -04:00
anti	1e7703d64d	refactor(db): name the keystroke-dynamics thresholds + add max_pause_gap Follow-ups on `9232031` per review: - Module-level constants KD_PAUSE_BURST_MAX_S (0.2s), KD_PAUSE_THINK_MAX_S (1.5s), KD_START_OF_ACTION_IDLE_S (2.0s). Docstrings reference them by name; future calibration against real session data only has to touch one place. Threshold for "started a new action" raised from 1s → 2s — 1s catches too much mid-command hesitation to be empirically bimodal. - New column kd_max_pause_gap (seconds). The distracted bucket count alone can't distinguish one 3s pause from three 60s pauses; max-gap carries that signal in one cheap scalar (vs widening the histogram to a fourth bucket). - Scope-framing docstring above the whole kd_* section: intended use is session clustering / tooling attribution, explicitly NOT biometric identity, admission decisions, or ML-driven user ID. Keeps a future well-intentioned contributor from walking the project into legal/ethics territory by accident. - TODO comment on kd_top_bigrams: v1's JSON-in-TEXT is fine for "show the top digraphs on the attacker page". If bigram-similarity queries become hot, promote to a session_bigram_stats(sid, bigram, count, mean_iat_s) table or Postgres JSONB + GIN. Neither changes the write-side ingester materially. No new migration helper — pre-v1 schema additions go through create_all on fresh DBs; the existing _migrate_session_profile_table stays but does not get extended. Alembic lands at v1 and sweeps all the ad-hoc migrations at once.	2026-04-24 10:49:38 -04:00
anti	9232031ec7	feat(db): extend SessionProfile schema with DEBT-036 keystroke features Adds the three signal columns motivated by the manual keystroke analysis in DEBT-036 directly to the SessionProfile table. Pre-v1 so we modify the schema in place — Alembic arrives at v1. Columns: - kd_top_bigrams (TEXT) — JSON of top-N most-common digraphs with mean IAT per bigram. Complements kd_digraph_simhash ("same typist?") with "same typist in same mental state?" (tired / rested / distracted shifts bigram-specific IATs measurably). - kd_start_of_action_latency (REAL/DOUBLE) — median IAT of the first keystroke after an idle gap > 1s. Separates "initiating a command" from "executing a remembered one"; real humans have measurable start-of-action latency, bots don't. - kd_pause_hist_burst / _think / _distracted (INT) — three-bucket histogram (counts, <0.2s / 0.2-1.5s / >1.5s). More discriminating than the existing flat burst_ratio / think_ratio pair: C2 operators concentrate in burst with a thin tail; opportunistic humans have a fat think bucket and a long distracted tail. Both backends get an idempotent ADD COLUMN migration (_migrate_session_profile_table) wired into initialize() alongside the existing _migrate_attackers_table path — guards on PRAGMA table_info (SQLite) / information_schema.COLUMNS (MySQL) so reruns are safe. PII discipline comment on kd_digraph_simhash and kd_top_bigrams: both operate on bigram CHARACTERS, never on raw input stream content. Attacker passwords typed over SSH must not land here. Test updated for the MySQL initialize() migration-order contract.	2026-04-24 10:45:48 -04:00
anti	26d04d5eb8	fix(db): SessionProfile.kd_digraph_simhash must be BINARY(8), not BLOB MySQL can't index a BLOB/TEXT column without a prefix length, so create_all() on a fresh MySQL schema blew up with "BLOB/TEXT column 'kd_digraph_simhash' used in key specification without a key length". SimHashes are a fixed 8 bytes — the variable-length type was a SQLAlchemy-side auto-mapping from 'Optional[bytes]', not an actual schema requirement. Switch to BINARY(8), which is portable: MySQL gets a fixed-width indexable BINARY, SQLite treats it as BLOB and doesn't care about key length.	2026-04-23 22:06:38 -04:00
anti	ffc275f051	feat(geoip): country-code enrichment via RIR delegated-stats Populates Attacker.country_code + country_source (MVP) using the five RIR delegated-stats files (ARIN/RIPE/APNIC/LACNIC/AFRINIC). Offline, license-free, no outbound traffic that could burn honeypot stealth. - decnet.geoip package with factory/base/lookup + rir/ subpackage (fetch/parse/provider) mirroring the db + bus factory convention - Profiler._build_record calls enrich_ip on every upsert - Idempotent ALTER TABLE migrations for both SQLite and MySQL - decnet geoip refresh/lookup CLI (master-only) - /var/lib/decnet/geoip seeded by decnet init - DECNET_GEOIP_ENABLED=false kill-switch; set in tests/conftest.py so unit tests never trigger the first-access fetch	2026-04-23 21:12:38 -04:00
anti	d43303251d	feat(profiler): track SMTP victim domains per attacker New SmtpTarget table records each (attacker, domain) pair observed via the SMTP honeypots. Only the domain is stored — local-parts are dropped at ingestion, so this table holds no user-identifying data beyond the target organisation's identity. The profiler worker extracts domains from rcpt_to / rcpt_denied / message_accepted events, normalizes them (lowercase, strip local-part, drop blocked TLDs), and upserts one row per pair with a running count + first_seen / last_seen. Three repo methods shipped: * increment_smtp_target(attacker, domain) — upsert + bump * list_smtp_targets(attacker) — per-attacker view * smtp_target_seen(domain) — cross-attacker aggregate, shaped as the federation-gossip RPC that V2 will expose. The gossip-query shape is load-bearing: each operator can answer "have any of your attackers targeted corp1.com?" without leaking which attackers or when — the aggregate returns a bool + total count + first/last seen, nothing else.	2026-04-22 22:23:27 -04:00
anti	d47a84c90b	refactor(models): split models.py into topical submodules decnet/web/db/models.py was approaching 1000 lines across User/Log/ Attacker/Swarm/Topology/Workers/Updater/Health domains. Split into a package with one module per domain; __init__.py re-exports every symbol so all 52 call sites keep importing from decnet.web.db.models unchanged.	2026-04-22 21:55:41 -04:00

33 Commits