DECNET

Author	SHA1	Message	Date
anti	f86dc79990	feat(canary): ship Node helper with wheel + install-toolchain CLI The fingerprint canaries' obfuscator shells out to a Node helper that require()s javascript-obfuscator. Without this commit, a fresh pip install decnet would land the .py modules but not the .js helper / package.json, and there'd be no documented way to provision Node side. * pyproject.toml - extend tool.setuptools.package-data to ship canary/_obfuscate_helper.js, canary/fingerprint_payload.js, and canary/package.json with the wheel. * decnet/cli/canary.py - new "decnet canary-install-toolchain" subcommand. Resolves decnet.canary.__file__'s dir, runs npm install --omit=dev there, exits non-zero with a clear message if npm is missing or install fails. Idempotent - safe to call every API service start. * deploy/decnet-api.service.j2 - non-fatal ExecStartPre that calls the new subcommand. Leading '-' so a missing Node toolchain only degrades fingerprint canaries (loud at mint time) without keeping the API from booting. * tests/canary/test_cli.py - registration smoke test, missing-npm exit path, and a mocked-subprocess test asserting the right argv and cwd land on npm. Realism cultivator already has a broad except Exception around cultivate() in scheduler.py:195-211, so a missing toolchain on a host running the realism tick degrades to an inert noise file with no extra plumbing.	2026-04-29 16:53:27 -04:00
anti	907ade9142	feat(realism): wire fingerprint_html/svg through taxonomy + UI The two new fingerprint canary generators existed at the API level since `f64e78f` but weren't visible to the realism engine or the operator-facing dashboard. Threads them through every place that enumerates canary content classes. Backend: * realism/taxonomy.py - two new ContentClass members (CANARY_FINGERPRINT_HTML, CANARY_FINGERPRINT_SVG); enum is wire-visible (synthetic_files.content_class column + bus discrim) so we add at the bottom, never reorder. * canary/cultivator.py - class-to-generator dispatch, kind mapping (both http), and default placement paths (~/Documents/asset_directory.html and network_topology.svg). * realism/naming.py + bodies.py - _name_canary / _body_canary entries. * realism/planner.py - added to _DEFAULT_CANARY_CLASS_WEIGHTS and the _CANARY_CLASSES classification set. Frontend: * decnet_web/src/realism/labels.ts - display labels. * decnet_web/src/components/RealismConfig/RealismConfig.tsx - default canary weight rows so operators see them in the realism config UI. * decnet_web/src/components/SyntheticFiles/SyntheticFiles.tsx - added to the CONTENT_CLASSES allow-list so filter dropdowns show them. Also: re-applied the nosec B404/B603 markers on canary/obfuscator.py; the first commit's pre-commit autoformatter stripped them. Tests: extended tests/realism/test_taxonomy.py's stability assertion to include the two new values. Full canary + realism suites pass (362 / 2 skipped).	2026-04-29 16:44:03 -04:00
anti	de6d5cd1a8	fix(canary): include fingerprint_* in KNOWN_GENERATORS stability test	2026-04-29 16:26:09 -04:00
anti	dd807bc55e	feat(canary): worker decodes ?d=/?o=/?s=&i=&n=&d= fingerprint params The fingerprint payload beacons fingerprint data as base64url JSON in GET query params: ?o=1 for the bare-open beacon, ?d=<blob> for a single-shot dump, or ?s/i/n/d=<chunk> for chunked dumps. Until now those params were buried inside request_path; consumers had to parse the URL themselves. Worker now extracts them in _extract_fingerprint and merges into raw_headers under reserved _fp* keys: * _fp_open — bare-open marker * _fp — decoded fingerprint dict (single-shot path) * _fp_sid/idx/total/chunk — chunked metadata + raw base64 (reassembly is a downstream concern, not the worker's job) * _fp_decode_error / _fp_oversize — failure markers for trash dumps Per-chunk size capped at 8KB so an attacker spamming /c/<known_slug> can't inflate trigger rows indefinitely. Decode failures degrade gracefully — the trigger row still records the hit, just with a _fp_decode_error flag instead of structured fingerprint data. Tests cover the single-shot decode, bare-open flag, chunked metadata, malformed input, and oversize drop paths.	2026-04-29 16:25:17 -04:00
anti	f64e78f78c	feat(canary): fingerprint_html + fingerprint_svg generators Two new synthesised-artifact generators that bake the obfuscated fingerprint payload into plausible-looking decoy files: * fingerprint_html — a mundane "Internal Asset Directory" page with a small table of fake hosts; the obfuscated payload is inlined at the bottom of <body>. Visible content (row pool slice, sync timestamp) also varies per mint via SHA-256-derived stable ints, so two extracted canaries don't diff to zero even on the rendered surface. * fingerprint_svg — standalone SVG with an embedded <script> CDATA block. SVG <script> only fires for top-level loads / <object> / <iframe>; <img>-referenced renders are safely inert. Both derive the mint UUID via uuid.uuid5 from the callback token, so re-mints are byte-identical (preserving the generator determinism contract) AND the same token produces the same mint UUID across HTML and SVG variants — the worker can correlate beacons across artifact shapes. Wired into the factory + KNOWN_GENERATORS, default placement paths under ~/Documents/asset_directory.html and ~/Documents/network_topology.svg for both linux and windows personas. Tests cover determinism, per-token divergence, structural validity (DOCTYPE/SVG headers), and that the beacon URL stays inside the obfuscated string array (not in plaintext). The two new entries skip in test_generators.py when Node toolchain is absent so bare CI checkouts still pass.	2026-04-29 16:22:18 -04:00
anti	12cd7ad9cb	feat(canary): per-mint JS obfuscator wrapper + fingerprint payload Adds the load-bearing primitives for obfuscated browser-fingerprinting canaries. Step 3 (HTML/SVG generators) and step 4 (worker-side fingerprint ingestion) build on top of these. * decnet/canary/obfuscator.py - javascript-obfuscator wrapper. Seed and polymorphic config bits both derive from the callback token, so output is byte-identical for the same mint (preserving the generator determinism contract from base.py) and structurally distinct across mints. * decnet/canary/fingerprint_payload.js - port of canary-self-test.html with the rendering UI stripped. Two placeholders (BEACON_URL, MINT_UUID) substituted before obfuscation. MVP beacon strategy: bare-open GET pixel first, then base64url-encoded fingerprint as query params on subsequent GETs (chunked above ~6KB) so the existing worker records hits before step-4 lands. * decnet/canary/_obfuscate_helper.js - Node subprocess helper that reads code+options JSON from stdin and writes obfuscated JS to stdout. Vendored javascript-obfuscator under decnet/canary/. * tests/canary/test_obfuscator.py - determinism, per-mint divergence, template substitution, Node syntax check, error path.	2026-04-29 16:16:37 -04:00
anti	eefab020d4	fix(swarm): propagate service mutations to worker agent via shard re-dispatch Add/remove/update_config on a fleet decky living on a swarm worker — and on an agent-pinned topology — used to run the master's local docker-compose only, which has no containers for the remote decky. The mutation persisted on master and silently no-op'd on the worker. - Fleet swarm: lookup DeckyShard.host_uuid; if found, rebuild a single-host shard from master state and call dispatch_decnet_config — same proven path as POST /swarm/deploy. Skip local _compose (no containers to touch). - Topology agent-pinned: call decnet.engine.deployer.resync_agent_topology (existing helper) to push the latest hydrated blob to the worker. - Local-only deckies: behaviour unchanged. - Tests: 5 new in tests/engine/test_services_live_swarm.py covering all three mutations on a swarm fleet decky (no local _compose, dispatch fires with the right host's deckies), plus apply=False save-only path (no dispatch), plus regression that local-only fleet add still runs local compose. Bus signal `decky.{name}.service_config_changed` keeps publishing as an audit trail; it is not the propagation trigger.	2026-04-29 12:51:16 -04:00
anti	94b06ee862	feat(services): initial config on ADD SERVICE — schema modal in DeckyCard, MazeNET drag, and Inspector - DeckyServiceAddRequest gains an optional `config: dict` field, validated against the service's config_schema before any state mutation (400 on bad type, no half-written rows). - Engine: add_service threads `config` into _add_topology_service / _add_fleet_service, persisting validated cfg to decky_config.service_config BEFORE compose regen so the first `up -d --build` materialises the env on the new container. No follow-up apply needed. - Frontend: shared AddServiceConfigModal — same wizard accordion shape, used by: * DeckyCard's ADD SERVICE picker (Fleet & MazeNET inspectors via shared component) * MazeNET Inspector's ADD SERVICE picker * MazeNET palette drag-drop onto a deployed decky Empty-schema services short-circuit to a one-click add (no modal flash). Operator can cancel; errors surface in the modal. - Tests: add_service config plumbing — persist, drop unknown keys, 400-equivalent on bad types, back-compat empty-config. - Drive-by: fix stale repo-method names in test_services_live.py (create_topology_decky → add_topology_decky, get_topology_decky → list+pick helper, service.added → service_added topic).	2026-04-29 12:44:47 -04:00
anti	77ceb9d6f3	feat(services): config schemas for the rest of the registry + textarea base64 transport - Declarative config_schema on RDP, Telnet, MySQL, Redis, SMTP, SMTP_Relay matching the keys each service already reads at compose time. - TODO marker on the 19 services that accept service_cfg but never read it, so future contributors know where to plug schemas in. - Wizard base64-wraps all textarea values at INI emit (DeckyFleet buildIni); validate_cfg detects the b64: sentinel and decodes back to UTF-8. Plain raw strings still pass through for direct API submitters. - HTTPS image entrypoint accepts PEM content or path in TLS_CERT/TLS_KEY: detects a BEGIN header, writes content to /opt/tls/, and re-exports the on-disk path so server.py keeps reading paths. - Tests cover schema/compose alignment for each new service plus textarea base64 round-trip (incl. UTF-8) and HTTPS PEM end-to-end.	2026-04-29 12:23:56 -04:00
anti	8d3f5c646a	fix(network): accept CAP_NET_ADMIN in lieu of euid==0 for macvlan setup The systemd unit grants AmbientCapabilities=CAP_NET_ADMIN so the API service can program host-side macvlan/ipvlan interfaces without running as root, but setup_host_macvlan/_ipvlan rejected with euid!=0 before even trying — making web-driven 'decnet deploy' impossible under the privilege model the unit advertises. Replace _require_root with _require_net_admin, which reads CapEff from /proc/self/status and accepts the cap (bit 12) as well as euid==0. No libcap dep — pure /proc parse.	2026-04-29 11:56:40 -04:00
anti	75b1ce3a31	feat(api): per-service config schema endpoint + PUT/POST update+apply for fleet & topology - GET /topologies/services/{name}/schema serves the declared ServiceConfigField metadata so the Inspector can auto-render forms. - PUT /(topologies/{id}/)deckies/{decky}/services/{svc}/config persists the validated dict (DB + compose); container untouched (Save). - POST /(topologies/{id}/)deckies/{decky}/services/{svc}/apply persists then force-recreates <decky>-<svc> so the new env takes effect (Apply, destructive). - New engine helper update_service_config wires both fleet and topology paths through the existing _persist_fleet_change / _rerender_topology_compose machinery; emits decky.<name>.service_config_changed on the bus.	2026-04-29 11:38:06 -04:00
anti	54b1fbed14	feat(services): declarative config_schema on BaseService + SSH/HTTP/HTTPS descriptors ServiceConfigField dataclass + BaseService.validate_cfg coerce/drop submitted service_cfg dicts against per-service typed schemas. SSH/HTTP/HTTPS now declare the keys they already read in compose_fragment, so the upcoming Inspector form has metadata to render from instead of hardcoded inputs per service.	2026-04-29 11:28:53 -04:00
anti	d314470d7f	fix(stats): keep TopologyDecky.state in sync with docker so ACTIVE DECKIES counts right Dashboard's ACTIVE DECKIES (active_deckies in get_stats_summary) counts TopologyDecky rows where state='running'. No code path was flipping that state away from the default 'pending', so the count read 0/N even when every container was running fine — the dashboard was lying. Two complementary fixes: 1. deploy_topology — after the post-deploy compose ps verification, reconcile each TopologyDecky.state from the corresponding base container's docker state. running → 'running'; anything else → 'failed'. Reuses the ps_rows already gathered for the ACTIVE-vs-DEGRADED status decision; no extra docker hit. 2. apply_add_decky — _materialise_decky_spawn now returns True/False; on True the row is updated to state='running' before _assert_valid_after. Catches the case where a decky added via the live mutator queue stays at 'pending' indefinitely (the deployer's reconcile only runs on a fresh deploy_topology pass). Existing topology deckies in active topologies will still read as 'pending' until the next deploy_topology runs, since this is forward-only. An operator-side fix is to teardown + redeploy or run the (forthcoming) reconcile-on-startup pass.	2026-04-29 11:09:32 -04:00
anti	57e527534c	fix(mutator): auto-fall-back to legacy builder when buildx wedges live decky add apply_add_decky's compose-up was hard-failing whenever the operator's ~/.docker/buildx/activity/ landed on a read-only mount — the wedge detection in _compose_with_retry correctly refuses to retry (would just leak more mounts), but for live materialisation we don't want a wedged buildx state to abort an admin's mutation. ANTI hit it on adding decky-a977: 'failed to update builder last activity time: ... read-only file system → buildx wedge detected → returned non-zero'. _compose_up_with_buildkit_fallback wraps _compose_with_retry: on a CalledProcessError whose stderr matches both wedge signatures (_BUILDX_WEDGE_SIGNATURE + _BUILDX_EROFS_SIGNATURE), it logs a warning with the manual recovery steps + retries once with DOCKER_BUILDKIT=0 set. The legacy non-buildx builder doesn't use the activity dir and isn't affected. Wired into the two paths that pass --build: * _materialise_decky_spawn (apply_add_decky) * _materialise_decky_services_diff (apply_update_decky service add) _materialise_decky_recreate_base doesn't build — it just recreates a container from an existing image — so it's not affected. Operator-facing log message points at the manual fix (rm -rf ~/.docker/buildx/activity + docker buildx create) so they can recover at their leisure; we don't ATTEMPT the recovery because the activity dir might be RO for a reason (zfs/btrfs snapshot, etc.) that an automated rm would be wrong to fight.	2026-04-29 10:59:04 -04:00
anti	892219ec87	feat(mutator): refuse forwards_l3 promotion on non-DMZ deckies apply_update_decky's flip path now refuses to promote a decky to gateway unless its home LAN is a DMZ. The compose generator publishes host ports for forwards_l3=True; a non-DMZ gateway would shadow the host's port space without anything legitimately able to reach the service. Same posture as the existing 'forwards_l3 flip on live requires force=true' guard — refused before any DB write so a bad mutation leaves zero side-effects. The check is intentionally NOT a standing _RULES invariant — the codebase uses forwards_l3 for two semantics: 1. Generic L3 forwarding (internal bridge deckies routing between their multi-home LANs). The generator writes this on internal bridges via bridge_forward_probability; legitimately non-DMZ. 2. DMZ gateway (host-port publisher). Only meaningful on DMZ. Standing validation can't enforce DMZ-homing without breaking case 1. The guard fires only on the explicit user-driven flip path where the operator's intent is unambiguously case 2. Generator output and internal-bridge attachments bypass the check. check_gateway_homed_in_dmz lives in validate.py for callers that want the explicit form (and for the test surface), but is not a standing rule — comment in _RULES explains the asymmetry.	2026-04-29 00:38:51 -04:00
anti	a27e3f5e0f	fix(tests+mutator): unbreak the docker-shadow test env + let mutator delete from active Two related fixes that came out of running the W5 tests locally: 1. tests/__init__.py — empty file, makes 'tests/' a package so pytest stops inserting it into sys.path. Without it, 'tests/docker/' (the docker-image test category) shadowed the installed docker SDK on every engine-touching test in the repo: module 'docker' has no attribute 'DockerClient' Pytest's default --import-mode=prepend was the culprit; making tests/ a package is the cheapest fix and doesn't change --import-mode for the whole tree. 2. delete_topology_decky / delete_topology_edge / delete_lan grow an 'enforce_pending: bool = True' kwarg. Default preserves the HTTP CRUD guard (api_decky_crud / api_edge_crud / api_lan_crud get the 409 for free). apply_remove_decky / apply_detach_decky / apply_remove_lan now pass enforce_pending=False — the mutator queue is the live-editing surface and has its own active-topology gating; the repo's pending-only guard was for design-time CRUD that mustn't bypass it. Without this, apply_remove_decky was silently broken on active topologies pre-W5; W5's new test surfaced it on first run. 10/10 new W5 tests pass; 58/58 across mutator + topology suites.	2026-04-29 00:24:17 -04:00
anti	98c929894c	feat(mutator): selective materialisation for apply_update_decky + tests apply_update_decky now discriminates three sub-cases: * services list changed → diff old vs new and call _materialise_decky_services_diff (compose up -d for added, stop + rm -f for removed). Mirrors services_live's pattern but doesn't import it — mutator-routed mutations carry a different bus surface (mutation.applied) than the direct API path (decky.<name>.service_added). * forwards_l3 flipped → port publishing changes, which docker can only apply at container-create time. Gated on payload['force'] is true; default raises MutationError so a half-thinking operator can't stomp a live decky. When force=true, _materialise_decky_recreate_base does compose up -d --no-deps --force-recreate. Pre-checked BEFORE the DB write so a refused mutation leaves zero side-effects. * coord-only (x/y) → DB only, no docker work. Ships tests/mutator/test_ops_materialisation.py with focused coverage for every new helper: add_decky/remove_decky/attach_decky/ detach_decky/update_decky/update_lan paths against an active topology, with compose primitives + docker SDK mocked at the source modules so the helpers' lazy imports pick up the stubs. Also covers the pending-topology skip and the force-flag gating.	2026-04-29 00:18:20 -04:00
anti	6ac8cac908	feat(deckies): live service add/remove without full redeploy decnet.engine.services_live exposes add_service / remove_service for both fleet and topology decky scopes. The host's _compose() wrapper already supported per-service targeting (up --no-deps -d <svc>, stop, rm -f); what was missing was the orchestration around it: * add: validate against decnet.services.registry (rejects unknown + fleet_singleton); persist the new services list; re-render the per-scope compose file (so future redeploys reflect the change); run docker compose up -d --no-deps --build <decky>-<svc>. * remove: stop + rm -f the service container; persist; re-render compose so a future up -d doesn't bring it back. Both publish decky.<name>.service.added / .removed on the bus, with the post-mutation services list. Topic constants added to decnet.bus.topics; the matching wiki entry in wiki-checkout/Service-Bus.md ships in a separate commit on the wiki repo (wiki-checkout/ is gitignored). Four new admin endpoints: * POST/DELETE /api/v1/deckies/{name}/services{,/svc} * POST/DELETE /api/v1/topologies/{id}/deckies/{name}/services{,/svc} ServiceMutationError messages are mapped at the API boundary to 404 (decky/topology missing), 409 (idempotency violation), 422 (unknown or fleet_singleton service).	2026-04-28 22:51:42 -04:00
anti	0bc4b05c73	feat(deckies): generic file drops on fleet + MazeNET deckies Extracts the docker-exec-with-base64-stdin pattern out of canary/planter and orchestrator/drivers/ssh into a shared decnet.decky_io package. Both consumers now delegate; the canary planter test still proves the contract end-to-end. Adds POST/DELETE /api/v1/deckies/files for arbitrary file drops. Container resolution is shared with the canary path: topology_id absent means fleet (<name>-ssh), present routes through resolve_decky_container which picks <name>-ssh when the topology decky exposes ssh, else the topology base container decnet_t_<id8>_<name>. Path validation rejects relative paths and '..' traversal at the request model layer. Bad base64 → 400; unknown topology → 404; decky not in topology → 422; docker exec failure → 409.	2026-04-28 22:43:34 -04:00
anti	3fe999d706	feat(canary): allow custom canaries on MazeNET deckies via API POST /api/v1/canary/tokens grows an optional topology_id field. When present, the server hydrates the topology, validates the named decky is in it, and resolves the docker container via planter.resolve_topology_container — <name>-ssh if the decky exposes ssh, else the topology base container. Absent ⇒ fleet semantics, unchanged. The token row gets a nullable topology_id column (no migration helper per pre-v1 policy). GET /api/v1/canary/tokens accepts ?topology_id= as a filter. DELETE re-resolves the container at revoke time so a redeployed topology is still reachable. 422 when the named decky isn't in the topology; 404 when the topology itself doesn't exist.	2026-04-28 22:34:45 -04:00
anti	5802de1f86	feat(canary): seed baseline canaries on MazeNET deckies Topology deploys now plant the configured canary baseline set on every decky in the topology, mirroring the fleet-deploy hook. Containers are resolved via resolve_topology_container — <decky>-ssh when the decky exposes an ssh service, else the topology base container decnet_t_<id8>_<decky>. The planter's plant/revoke/seed_baseline grow an optional container= kwarg; default preserves the fleet <name>-ssh resolution.	2026-04-28 22:30:11 -04:00
anti	e3ddeb0395	feat(bounty): surface file drops and stored mail in the Vault The Bounty Vault page only read from the Bounty table, but inotifywait-captured file drops (event_type=file_captured) and SMTP quarantined messages (event_type=message_stored) were only landing in the Logs table. AttackerDetail's tabs queried logs directly, so they showed up per-attacker but were invisible on the global Vault page. Mirror both events into Bounty as bounty_type=artifact with payload.kind ∈ {file, mail} so the existing dedup (bounty_type, attacker_ip, payload) collapses repeats by sha256. Add an ARTIFACTS segment to the Vault filter row, plus dedicated render branches: file drops show orig_path + size + writer attribution; mail shows subject + From + attachment count + size, with the Mail icon distinguishing them from FileText for file drops. Forward-only — existing logs stay where they are. A backfill pass would be straightforward (read Log WHERE event_type IN ('file_captured', 'message_stored') and feed each row through _extract_bounty) but is out of scope here.	2026-04-28 19:42:54 -04:00
anti	88f276e9e7	feat(collector): drop native unix daemon syslog from ingestion sshd, pam_unix, sudo, CRON, systemd, kernel, rsyslogd, and dbus-daemon all share the SSH/telnet decky containers and write to the same syslog socket as DECNET's own emitters. Their output was being parsed and ingested into the JSON stream, the dashboard, and the profiler — pure noise: sshd's "Failed password for root from X" duplicates the auth-helper's structured auth_attempt event, pam_unix repeats it again, CRON/systemd say nothing about attacker behavior. Drop these APP-NAMEs in _should_ingest before the JSON write and bus publish. Raw .log file still captures everything for forensics. The denylist is overridable with DECNET_COLLECTOR_DROP_APPS so operators can extend it without code changes.	2026-04-28 19:21:39 -04:00
anti	d4591b38dc	fix(profiler): aggregate bash PROMPT_COMMAND lines into attacker profile SSH/telnet decky containers emit shell commands via `logger -t bash "CMD …"` which produces RFC 5424 lines with MSGID=NIL. Both parsers were leaving event_type="-", so the behavioral profiler's `_COMMAND_EVENT_TYPES` filter silently dropped them — the IP profile existed but no command transcripts or artifacts. Confirmed in the wild: 44/48 events from one attacker were event_type="-". Rewrite event_type to "command" in both parsers when MSGID=NIL and the msg starts with "CMD ". Correlation parser also extracts the cmd= payload into fields["command"] so the profiler can build the transcript; collector parser leaves fields={} to avoid duplicate pills in the dashboard.	2026-04-28 19:09:41 -04:00
anti	862e4dbb31	merge: testing → main (reconcile 2-week divergence)	2026-04-28 18:36:00 -04:00
anti	3b4b0a1016	merge: resolve conflicts between testing and main (remove tracked settings, fix pyproject deps)	2026-04-13 07:48:37 -04:00
anti	f2cc585d72	fix: align tests with model validation and API error reporting	2026-04-13 01:43:52 -04:00
anti	03f5a7826f	Fix: resolved sqlite concurrency errors (table users already exists) by moving DDL to explicit async initialize() and implementing lazy singleton dependency.	2026-04-12 08:01:21 -04:00
anti	b2e4706a14	Refactor: implemented Repository Factory and Async Mutator Engine. Decoupled storage logic and enforced Dependency Injection across CLI and Web API. Updated documentation. Some checks failed CI / Lint (ruff) (push) Successful in 12s Details CI / SAST (bandit) (push) Successful in 13s Details CI / Dependency audit (pip-audit) (push) Successful in 22s Details CI / Test (Standard) (3.11) (push) Failing after 54s Details CI / Test (Standard) (3.12) (push) Successful in 1m35s Details CI / Test (Live) (3.11) (push) Has been skipped Details CI / Test (Fuzz) (3.11) (push) Has been skipped Details CI / Merge dev → testing (push) Has been skipped Details CI / Prepare Merge to Main (push) Has been skipped Details CI / Finalize Merge to Main (push) Has been skipped Details	2026-04-12 07:48:17 -04:00
anti	4064e19af1	merge: resolve conflicts between testing and main Some checks failed PR Gate / Lint (ruff) (pull_request) Failing after 11s Details PR Gate / Test (pytest) (3.11) (pull_request) Failing after 10s Details PR Gate / Test (pytest) (3.12) (pull_request) Failing after 10s Details PR Gate / SAST (bandit) (pull_request) Successful in 12s Details PR Gate / Dependency audit (pip-audit) (pull_request) Failing after 13s Details	2026-04-12 04:09:17 -04:00
anti	0f63820ee6	chore: fix unused imports in tests and update development roadmap Some checks failed CI / Lint (ruff) (push) Successful in 16s Details CI / Test (pytest) (3.11) (push) Failing after 34s Details CI / Test (pytest) (3.12) (push) Failing after 36s Details CI / SAST (bandit) (push) Successful in 12s Details CI / Merge dev → testing (push) Has been cancelled Details CI / Open PR to main (push) Has been cancelled Details CI / Dependency audit (pip-audit) (push) Has been cancelled Details	2026-04-12 03:46:23 -04:00
anti	ff38d58508	Testing: Stabilized test suite and achieved 93% total coverage. - Fixed CLI tests by patching local imports at source (psutil, os, Path). - Fixed Collector tests by globalizing docker.from_env mock. - Stabilized SSE stream tests via AsyncMock and immediate generator termination to prevent hangs. - Achieved >80% coverage on CLI (84%), Collector (97%), and DB Repository (100%). - Implemented SMTP Relay service tests (100%).	2026-04-12 03:30:06 -04:00
anti	f78104e1c8	fix: resolve all ruff lint errors and SQLite UNIQUE constraint issue Ruff fixes (20 errors → 0): - F401: Remove unused imports (DeckyConfig, random_hostname, IniConfig, COMPOSE_FILE, sys, patch) across cli.py, mutator/engine.py, templates/ftp, templates/rdp, test_mysql.py, test_postgres.py - F541: Remove extraneous f-prefixes on strings with no placeholders in templates/imap, test_ftp_live, test_http_live - E741: Rename ambiguous variable 'l' to descriptive names (line, entry, part) across conftest.py, test_ftp_live, test_http_live, test_mongodb_live, test_pop3, test_ssh SQLite fix: - Change _initialize_sync() admin seeding from SELECT-then-INSERT to INSERT OR IGNORE, preventing IntegrityError when admin user already exists from a previous run	2026-04-12 02:17:50 -04:00
anti	662a5e43e8	feat(tests): add live subprocess integration test suite for services Spins up each service's server.py in a real subprocess via a free ephemeral port (PORT env var), connects with real protocol clients, and asserts both correct protocol behavior and RFC 5424 log output. - 44 live tests across 10 services: http, ftp, smtp, redis, mqtt, mysql, postgres, mongodb, pop3, imap - Shared conftest.py: _ServiceProcess (bg reader thread + queue), free_port, live_service fixture, assert_rfc5424 helper - PORT env var added to all 10 targeted server.py templates - New pytest marker `live`; excluded from default addopts run - requirements-live-tests.txt: flask, twisted + protocol clients	2026-04-12 01:34:16 -04:00
anti	d63e396410	fix(protocols): guard against zero/malformed length fields in binary protocol parsers MongoDB had the same infinite-loop bug as MSSQL (msg_len=0 → buffer never shrinks in while loop). Postgres, MySQL, and MQTT had related length-field issues (stuck state, resource exhaustion, overlong remaining-length). Also fixes an existing MongoDB _op_reply struct.pack format bug (extra 'q' specifier caused struct.error on any OP_QUERY response). Adds 53 regression + protocol boundary tests across MSSQL, MongoDB, Postgres, MySQL, and MQTT, including a _run_with_timeout threading harness to catch infinite loops and @pytest.mark.fuzz hypothesis tests for each.	2026-04-12 01:01:13 -04:00
anti	65d585569b	fix(telnet): replace Cowrie with real busybox telnetd + rsyslog logging Cowrie was exposing an SSH daemon on port 22 alongside the telnet service even when COWRIE_SSH_ENABLED=false, contaminating deployments that did not request an SSH service. New implementation mirrors the SSH service pattern: - busybox telnetd in foreground mode on port 23 - /bin/login for real PAM authentication (brute-force attempts logged) - rsyslog RFC 5424 bridge piped to stdout for Docker log capture - Configurable root password and hostname via env vars - No Cowrie dependency	2026-04-12 00:34:45 -04:00
anti	c384a3103a	refactor: separate engine, collector, mutator, and fleet into independent subpackages - decnet/engine/ — container lifecycle (deploy, teardown, status); _kill_api removed - decnet/collector/ — Docker log streaming (moved from web/collector.py) - decnet/mutator/ — mutation engine (no longer imports from cli or duplicates deployer code) - decnet/fleet.py — shared decky-building logic extracted from cli.py Cross-contamination eliminated: - web router no longer imports from decnet.cli - mutator no longer imports from decnet.cli - cli no longer imports from decnet.web - _kill_api() moved to cli (process management, not engine concern) - _compose_with_retry duplicate removed from mutator	2026-04-12 00:26:22 -04:00
anti	c79f96f321	refactor(ssh): consolidate real_ssh into ssh, remove duplication real_ssh was a separate service name pointing to the same template and behaviour as ssh. Merged them: ssh is now the single real-OpenSSH service. - Rename templates/real_ssh/ → templates/ssh/ - Remove decnet/services/real_ssh.py - Deaddeck archetype updated: services=["ssh"] - Merge test_real_ssh.py into test_ssh.py (includes deaddeck + logging tests) - Drop decnet.services.real_ssh from test_build module list	2026-04-11 19:51:41 -04:00
anti	a6063efbb9	fix(collector): daemonize background subprocesses with start_new_session Collector and mutator watcher subprocesses were spawned without start_new_session=True, leaving them in the parent's process group. SIGHUP (sent when the controlling terminal closes) killed both processes silently — stdout/stderr were DEVNULL so the crash was invisible. Also update test_services and test_composer to reflect the ssh plugin no longer using Cowrie env vars (replaced with SSH_ROOT_PASSWORD / SSH_HOSTNAME matching the real_ssh plugin).	2026-04-11 19:36:46 -04:00
anti	d4ac53c0c9	feat(ssh): replace Cowrie with real OpenSSH + rsyslog logging pipeline Scraps the Cowrie emulation layer. The real_ssh template now runs a genuine sshd backed by a three-layer logging stack forwarded to stdout as RFC 5424 for the DECNET collector: auth,authpriv.* → rsyslogd → named pipe → stdout (logins/failures) user.* → rsyslogd → named pipe → stdout (PROMPT_COMMAND cmds) sudo syslog=auth → rsyslogd → named pipe → stdout (privilege escalation) sudo logfile → /var/log/sudo.log (local backup with I/O) The ssh.py service plugin now points to templates/real_ssh and drops all COWRIE_* / NODE_NAME env vars, sharing the same compose fragment shape as real_ssh.py.	2026-04-11 19:12:54 -04:00
anti	babad5ce65	refactor(collector): use state file for container detection, drop label heuristics _load_service_container_names() reads decnet-state.json and builds the exact set of expected container names ({decky}-{service}). is_service_container() and is_service_event() do a direct set lookup — no regex, no label inspection, no heuristics.	2026-04-11 03:58:52 -04:00
anti	7abae5571a	fix(collector): fix container detection and auto-start on deploy Two bugs caused the log file to never be written: 1. is_service_container() used regex '^decky-\d+-\w' which only matched the old decky-01-smtp naming style. Actual containers are named omega-decky-smtp, relay-decky-smtp, etc. Fixed by using Docker Compose labels instead: com.docker.compose.project=decnet + non-empty depends_on discriminates service containers from base (sleep infinity) containers reliably regardless of decky naming convention. Added is_service_event() for the Docker events path. 2. The collector was only started when --api was used. Added a 'collect' CLI subcommand (decnet collect --log-file <path>) and wired it into deploy as an auto-started background process when --api is not in use. Default log path: /var/log/decnet/decnet.log	2026-04-11 03:56:53 -04:00
anti	c7713c6228	feat(imap,pop3): full IMAP4rev1 + POP3 bait mailbox implementation IMAP: extended to full IMAP4rev1 — 10 bait emails (AWS keys, DB creds, tokens, VPN config, root pw etc.), LIST/LSUB/STATUS/FETCH/UID FETCH/ SEARCH/CLOSE/NOOP, proper SELECT untagged responses (EXISTS, UIDNEXT, FLAGS, PERMANENTFLAGS), CAPABILITY with IDLE/LITERAL+/AUTH=PLAIN. FETCH correctly handles sequence sets (1:, 1:3, ), item dispatch (FLAGS, ENVELOPE, BODY[], RFC822, RFC822.SIZE), and places body literals last per RFC 3501. POP3: extended with same 10 bait emails, fixed banner env var key (POP3_BANNER not IMAP_BANNER), CAPA fully populated (TOP/UIDL/USER/ RESP-CODES/SASL), TOP (headers + N body lines), UIDL (msg-N format), DELE/RSET with _deleted set tracking, NOOP. _active_messages() helper excludes DELE'd messages from STAT/LIST/UIDL. Both: DEBT-026 stub added (_EMAIL_SEED_PATH env var, documented in DEBT.md for next-session JSON seed file wiring). Tests: test_imap.py expanded to 27 cases, test_pop3.py to 22 cases — 860 total tests passing.	2026-04-11 03:12:32 -04:00
anti	1196363d0b	feat(os_fingerprint): Phase 2 — add icmp_ratelimit + icmp_ratemask sysctls Windows: both 0 (no ICMP rate limiting — matches real Windows behavior) Linux: 1000ms / mask 6168 (kernel defaults) BSD: 250ms / mask 6168 (FreeBSD default is faster than Linux) Embedded/Cisco: both 0 (most firmware doesn't rate-limit ICMP) These affect nmap's IE and U1 probe groups which measure ICMP error response timing to closed UDP ports. Windows responds to all probes instantly while Linux throttles to ~1/sec. Tests: 10 new cases (5 per sysctl). Suite: 822 passed.	2026-04-10 16:41:23 -04:00
anti	6df2c9ccbf	revert(os_fingerprint): undo ip_no_pmtu_disc=1 for windows — was incorrect ip_no_pmtu_disc controls PMTU discovery for UDP/ICMP paths only. TI=Z originates from ip_select_ident() in the kernel TCP stack setting IP ID=0 for DF=1 TCP packets — a namespace-scoped sysctl cannot change this. The previous commit was based on incorrect root-cause analysis.	2026-04-10 16:29:44 -04:00
anti	b1f6c3b84a	fix(os_fingerprint): set ip_no_pmtu_disc=1 for windows to eliminate TI=Z When ip_no_pmtu_disc=0 the Linux kernel sets DF=1 on TCP packets and uses IP ID=0 (RFC 6864). nmap's TI=Z fingerprint has no Windows match in its DB, causing 91% confidence guesses of 'Linux 2.4/2.6 embedded' regardless of TTL being 128. Setting ip_no_pmtu_disc=1 allows non-zero IP ID generation. Trade-off: DF bit is not set on outgoing packets (slightly wrong for Windows) but TI=Z is far more damaging to the spoof than losing DF accuracy.	2026-04-10 16:19:32 -04:00
anti	5e83c9e48d	feat(os_fingerprint): Phase 1 — extend OS sysctls with 6 new fingerprint knobs Add tcp_timestamps, tcp_window_scaling, tcp_sack, tcp_ecn, ip_no_pmtu_disc, and tcp_fin_timeout to every OS profile in OS_SYSCTLS. All 6 are network-namespace-scoped and safe to set per-container without --privileged. They directly influence nmap's OPS, WIN, ECN, and T2-T6 probe groups, making OS family detection significantly more convincing. Key changes: - tcp_timestamps=0 for windows/embedded/cisco (strongest Windows discriminator) - tcp_ecn=2 for linux (ECN offer), 0 for all others - tcp_sack=0 / tcp_window_scaling=0 for embedded/cisco - ip_no_pmtu_disc=1 for embedded/cisco (DF bit ICMP behaviour) - Expose _REQUIRED_SYSCTLS frozenset for completeness assertions Tests: 88 new test cases across all OS families and composer integration. Total suite: 812 passed.	2026-04-10 16:06:36 -04:00
anti	f583b3d699	fix(services): Resolve protocol realism gaps and update technical debt register - Add dynamic challenge nonces to Postgres, VNC, and SIP. - Add basic keyspace lookup and mock data to Redis. - Correct MSSQL TDS pre-login offset bounds. - Support MongoDB OP_MSG handshake version checking. - Suppress Werkzeug HTTP server headers and normalize FTPAnonymousShell response. - Add tracking for Dynamic Bait Store (DEBT-027) via DEBT.md.	2026-04-10 02:16:42 -04:00
anti	08242a4d84	Implement ICS/SCADA and IMAP Bait features	2026-04-10 01:50:08 -04:00
anti	63fb477e1f	feat: add smtp_relay service; add service_testing/ init - decnet/services/smtp_relay.py: open relay variant of smtp, same template with SMTP_OPEN_RELAY=1 baked into the environment - tests/service_testing/__init__.py: init so pytest discovers the subdirectory	2026-04-10 01:09:15 -04:00

1 2

92 Commits