DECNET

Author	SHA1	Message	Date
anti	1e8ca4cc05	feat(swarm-cli): add `decnet swarm {enroll,list,decommission}` + `deploy --mode swarm` New sub-app talks HTTP to the local swarm controller (127.0.0.1:8770 by default; override with --url or $DECNET_SWARMCTL_URL). - enroll: POSTs /swarm/enroll, prints fingerprint, optionally writes ca.crt/worker.crt/worker.key to --out-dir for scp to the worker. - list: renders enrolled workers as a rich table (with --status filter). - decommission: looks up uuid by --name, confirms, DELETEs. deploy --mode swarm now: 1. fetches enrolled+active workers from the controller, 2. round-robin-assigns host_uuid to each decky, 3. POSTs the sharded DecnetConfig to /swarm/deploy, 4. renders per-worker pass/fail in a results table. Exits non-zero if no workers exist or any worker's dispatch failed.	2026-04-18 19:52:37 -04:00
anti	a6430cac4c	feat(swarm): add `decnet forwarder` CLI to run syslog-over-TLS forwarder The forwarder module existed but had no runner — closes that gap so the worker-side process can actually be launched and runs isolated from the agent (asyncio.run + SIGTERM/SIGINT → stop_event). Guards: refuses to start without a worker cert bundle or a resolvable master host ($DECNET_SWARM_MASTER_HOST or --master-host).	2026-04-18 19:41:37 -04:00
anti	39d2077a3a	feat(swarm): syslog-over-TLS log pipeline (RFC 5425, TCP 6514) Worker-side log_forwarder tails the local RFC 5424 log file and ships each line as an octet-counted frame to the master over mTLS. Offset is persisted in a tiny local SQLite so master outages never cause loss or duplication — reconnect resumes from the exact byte where the previous session left off. Impostor workers (cert not signed by DECNET CA) are rejected at TLS handshake. Master-side log_listener terminates mTLS on 0.0.0.0:6514, validates the client cert, extracts the peer CN as authoritative worker provenance, and appends each frame to the master's ingest log files. Attacker- controlled syslog HOSTNAME field is ignored — the CA-controlled CN is the only source of provenance. 7 tests added covering framing codec, offset persistence across reopens, end-to-end mTLS delivery, crash-resilience (offset survives restart, no duplicate shipping), and impostor-CA rejection. DECNET_SWARM_SYSLOG_PORT / DECNET_SWARM_MASTER_HOST env bindings added.	2026-04-18 19:33:58 -04:00
anti	811136e600	refactor(swarm): one file per endpoint, matching existing router layout Splits the three grouped router files into eight api_<verb>_<resource>.py modules under decnet/web/router/swarm/ to match the convention used by router/fleet/ and router/config/. Shared request/response models live in _schemas.py. Keeps each endpoint easy to locate and modify without stepping on siblings.	2026-04-18 19:23:06 -04:00
anti	63b0a58527	feat(swarm): master-side SWARM controller (swarmctl) + agent CLI Adds decnet/web/swarm_api.py as an independent FastAPI app with routers for host enrollment, deployment dispatch (sharding DecnetConfig across enrolled workers via AgentClient), and active health probing. Runs as its own uvicorn subprocess via 'decnet swarmctl', mirroring the isolation pattern used by 'decnet api'. Also wires up 'decnet agent' CLI entry for the worker side. 29 tests added under tests/swarm/test_swarm_api.py cover enrollment (including bundle generation + duplicate rejection), host CRUD, sharding correctness, non-swarm-mode rejection, teardown, and health probes with a stubbed AgentClient.	2026-04-18 19:18:33 -04:00
anti	cd0057c129	feat(swarm): DeckyConfig.host_uuid + fix agent log/status field refs - decnet.models.DeckyConfig grows an optional 'host_uuid' (the SwarmHost that runs this decky). Defaults to None so legacy unihost state files deserialize unchanged. - decnet.agent.executor: replace non-existent config.name references with config.mode / config.interface in logs and status payload. - tests/swarm/test_state_schema.py covers legacy-dict roundtrip, field default, and swarm-mode assignments.	2026-04-18 19:10:25 -04:00
anti	0c77cdab32	feat(swarm): master AgentClient — mTLS httpx wrapper around worker API decnet.swarm.client exposes: - MasterIdentity / ensure_master_identity(): the master's own CA-signed client bundle, issued once into ~/.decnet/ca/master/. - AgentClient: async-context httpx wrapper that talks to a worker agent over mTLS. health/status/deploy/teardown methods mirror the agent API. SSL context is built from a bare ssl.SSLContext(PROTOCOL_TLS_CLIENT) instead of httpx.create_ssl_context — the latter layers on default-CA and purpose logic that broke private-CA mTLS. Server cert is pinned by CA + chain, not DNS (workers enroll with arbitrary SANs). tests/swarm/test_client_agent_roundtrip.py spins uvicorn in-process with real certs on disk and verifies: - A CA-signed master client passes health + status calls. - An impostor whose cert comes from a different CA cannot connect.	2026-04-18 19:08:36 -04:00
anti	8257bcc031	feat(swarm): worker agent + fix pre-existing base_repo coverage test Worker agent (decnet.agent): - mTLS FastAPI service exposing /deploy, /teardown, /status, /health, /mutate. uvicorn enforces CERT_REQUIRED with the DECNET CA pinned. - executor.py offloads the blocking deployer onto asyncio.to_thread so the event loop stays responsive. - server.py refuses to start without an enrolled bundle in ~/.decnet/agent/ — unauthenticated agents are not a supported mode. - docs/openapi disabled on the agent — narrow attack surface. tests/test_base_repo.py: DummyRepo was missing get_attacker_artifacts (pre-existing abstractmethod) and so could not be instantiated. Added the stub + coverage for the new swarm CRUD surface on BaseRepository.	2026-04-18 07:15:53 -04:00
anti	d3b90679c5	feat(swarm): PKI module — self-managed CA for master/worker mTLS decnet.swarm.pki provides: - generate_ca() / ensure_ca() — self-signed root, PKCS8 PEM, 4096-bit. - issue_worker_cert() — per-worker keypair + cert signed by the CA with serverAuth + clientAuth EKU so the same identity backs the agent's HTTPS endpoint AND the syslog-over-TLS upstream. - write_worker_bundle() / load_worker_bundle() — persist with 0600 on private keys. - fingerprint() — SHA-256 DER hex for master-side pinning. tests/swarm/test_pki.py covers: - CA idempotency on disk. - Signed chain validates against CA subject. - SAN population (DNS + IP). - Bundle roundtrip with 0600 key perms. - End-to-end mTLS handshake between two CA-issued peers. - Cross-CA client rejection (handshake fails).	2026-04-18 07:09:58 -04:00
anti	7864c72948	test(ssh): add unit coverage for emit_capture RFC 5424 output Exercises the JSON → syslog formatter end to end: flat fields ride as SD params, bulky nested metadata collapses into the meta_json_b64 blob, and the event_type / hostname / service mapping lands in the right RFC 5424 header slots.	2026-04-18 05:37:42 -04:00
anti	8bdc5b98c9	feat(collector): parse real PROCID and extract IPs from logger kv pairs - Relaxed RFC 5424 regex to accept either NILVALUE or a numeric PROCID; sshd / sudo go through rsyslog with their real PID, while syslog_bridge emitters keep using '-'. - Added a fallback pass that scans the MSG body for IP-shaped key=value tokens. This rescues attacker attribution for plain logger callers like the SSH PROMPT_COMMAND shim, which emits 'CMD … src=IP …' without SD-element params.	2026-04-18 05:37:08 -04:00
anti	41fd496128	feat(web): attacker artifacts endpoint + UI drawer Adds the server-side wiring and frontend UI to surface files captured by the SSH honeypot for a given attacker. - New repository method get_attacker_artifacts (abstract + SQLModel impl) that joins the attacker's IP to `file_captured` log rows. - New route GET /attackers/{uuid}/artifacts. - New router /artifacts/{decky}/{service}/{stored_as} that streams a quarantined file back to an authenticated viewer. - AttackerDetail grows an ArtifactDrawer panel with per-file metadata (sha256, size, orig_path) and a download action. - ssh service fragment now sets NODE_NAME=decky_name so logs and the host-side artifacts bind-mount share the same decky identifier.	2026-04-18 05:36:48 -04:00
anti	39dafaf384	feat(ssh-stealth): hide capture artifacts via XOR+gzip entrypoint blob The /opt/emit_capture.py, /opt/syslog_bridge.py, and /usr/libexec/udev/journal-relay files were plaintext and world-readable to any attacker root-shelled into the SSH honeypot — revealing the full capture logic on a single cat. Pack all three into /entrypoint.sh as XOR+gzip+base64 blobs at build time (_build_stealth.py), then decode in-memory at container start and exec the capture loop from a bash -c string. No .py files under /opt, no journal-relay file under /usr/libexec/udev, no argv_zap name anywhere. The LD_PRELOAD shim is installed as /usr/lib/x86_64-linux-gnu/libudev-shared.so.1 — sits next to the real libudev.so.1 and blends into the multiarch layout. A 1-byte random XOR key is chosen at image build so a bare 'base64 -d \| gunzip' probe on the visible entrypoint returns binary noise instead of readable Python. Docker-dependent tests live under tests/docker/ behind a new 'docker' pytest marker (excluded from the default run, same pattern as fuzz / live / bench).	2026-04-18 05:34:50 -04:00
anti	b0e00a6cc4	fix(ssh-capture): drop relay FIFO, rsyslog→/proc/1/fd/1 direct The named pipe at /run/systemd/journal/syslog-relay had two problems beyond its argv leak: any root-in-container process could (a) `cat` the pipe and watch the live SIEM feed, and (b) write to it and inject forged log lines. Since an attacker with a shell is already root inside the honeypot, file permissions can't fix it. Point rsyslog's auth/user actions directly at /proc/1/fd/1 — the container-stdout fd Docker attached to PID 1 — and delete the mkfifo + cat relay from the entrypoint. No pipe on disk, nothing to read, nothing to inject, and one fewer cloaked process in `ps`.	2026-04-18 02:12:32 -04:00
anti	2843aafa1a	fix(ssh-capture): hide watcher bash argv and sanitize script header Two leaks remained after the inotifywait argv fix: 1. The bash running journal-relay showed its argv[1] (the script path) in /proc/PID/cmdline, producing a line like 'journal-relay /usr/libexec/udev/journal-relay' Apply argv_zap.so to that bash too. 2. argv_zap previously hardcoded PR_SET_NAME to 'kmsg-watch', which was wrong for any caller other than inotifywait. The comm name now comes from ARGV_ZAP_COMM so each caller can pick its own (kmsg-watch for inotifywait, journal-relay for the watcher bash). 3. The capture.sh header started with 'SSH honeypot file-catcher' — fatal if an attacker runs 'cat' on it. Rewritten as a plausible systemd-journal relay helper; stray 'attacker' / 'honeypot' words in mid-script comments stripped too.	2026-04-18 02:06:36 -04:00
anti	766eeb3d83	feat(ssh): add ping/nmap/ca-certificates to base image A lived-in Linux box ships with iputils-ping, ca-certificates, and nmap available. Their absence is a cheap tell, and they're handy for letting the attacker move laterally in ways we want to observe. iproute2 (ip a) was already installed for attribution — noted here for completeness.	2026-04-18 01:53:33 -04:00
anti	f462835373	feat(ssh-capture): LD_PRELOAD shim to zero inotifywait argv The kmsg-watch (inotifywait) process was the last honest giveaway in `ps aux` — its watch paths and event flags betrayed the honeypot. The argv_zap.so shim hooks __libc_start_main, heap-copies argv for the real main, then memsets the contiguous argv[1..] region to NUL so the kernel's cmdline reader returns just argv[0]. gcc is installed and purged in the same Docker layer to keep the image slim. The shim also calls prctl(PR_SET_NAME) so /proc/self/comm mirrors the argv[0] disguise.	2026-04-18 01:52:30 -04:00
anti	8dd4c78b33	refactor: strip DECNET tokens from container-visible surface Rename the container-side logging module decnet_logging → syslog_bridge (canonical at templates/syslog_bridge.py, synced into each template by the deployer). Drop the stale per-template copies; setuptools find was picking them up anyway. Swap useradd/USER/chown "decnet" for "logrelay" so no obvious token appears in the rendered container image. Apply the same cloaking pattern to the telnet template that SSH got: syslog pipe moves to /run/systemd/journal/syslog-relay and the relay is cat'd via exec -a "systemd-journal-fwd". rsyslog.d conf rename 99-decnet.conf → 50-journal-forward.conf. SSH capture script: /var/decnet/captured → /var/lib/systemd/coredump (real systemd path), logger tag decnet-capture → systemd-journal. Compose volume updated to match the new in-container quarantine path. SD element ID shifts decnet@55555 → relay@55555; synced across collector, parser, sniffer, prober, formatter, tests, and docs so the host-side pipeline still matches what containers emit.	2026-04-17 22:57:53 -04:00
anti	69510fb880	fix(ssh-capture): cloak syslog relay pipe and cat process Rename the rsyslog→stdout pipe from /var/run/decnet-logs (dead giveaway) to /run/systemd/journal/syslog-relay, and launch the relay via exec -a "systemd-journal-fwd" so ps shows a plausible systemd forwarder instead of a bare cat. Casual ps/ls inspection now shows nothing with "decnet" in the name.	2026-04-17 22:51:34 -04:00
anti	09d9f8595e	fix(ssh-capture): disguise watcher as udev helper in ps output Old ps output was a dead giveaway: two "decnet-capture" bash procs and a raw "inotifywait". Install script at /usr/libexec/udev/journal-relay and invoke inotifywait through a /usr/libexec/udev/kmsg-watch symlink so both now render as plausible udev/journal helpers under casual inspection.	2026-04-17 22:44:47 -04:00
anti	a773dddd5c	feat(ssh): capture attacker-dropped files with session attribution inotifywait watches writable paths in the SSH decky and mirrors any file close_write/moved_to into a per-decky host-mounted quarantine dir. Each artifact carries a .meta.json with attacker attribution resolved by walking the writer PID's PPid chain to the sshd session leader, then cross-referencing ss and utmp for source IP/user/login time. Also emits an RFC 5424 syslog line per capture for SIEM correlation.	2026-04-17 22:20:05 -04:00
anti	255c2e5eb7	perf: cache auth user-lookup and admin list_users The per-request SELECT users WHERE uuid=? in require_role was the hidden tax behind every authed endpoint — it kept _execute at ~60% across the profile even after the page caches landed. Even /health (with its DB and Docker probes cached) was still 52% _execute from this one query. - dependencies.py: 10s TTL cache on get_user_by_uuid, well below JWT expiry. invalidate_user_cache(uuid) is called on password change, role change, and user delete. - api_get_config.py: 5s TTL cache on the admin branch's list_users() (previously fetched every /config call). Invalidated on user create/update/delete. - api_change_pass.py + api_manage_users.py: invalidation hooks on all user-mutating endpoints.	2026-04-17 19:56:39 -04:00
anti	2dd86fb3bb	perf: cache /bounty, /logs/histogram, /deckies; bump /config TTL to 5s Round-2 follow-up: profile at 500c/u showed _execute still dominating the uncached read endpoints (/bounty 76%, /logs/histogram 73%, /deckies 56%). Same router-level TTL pattern as /stats — 5s window, asyncio.Lock to collapse concurrent calls into one DB hit. - /bounty: cache default unfiltered page (limit=50, offset=0, bounty_type=None, search=None). Filtered requests bypass. - /logs/histogram: cache default (interval_minutes=15, no filters). Filtered / non-default interval requests bypass. - /deckies: cache full response (endpoint takes no params). - /config: bump _STATE_TTL from 1.0 to 5.0 — admin writes are rare, 1s was too short for bursts to coalesce at high concurrency.	2026-04-17 19:30:11 -04:00
anti	3cc5ba36e8	fix(cli): keep FileNotFoundError handling on decnet api Popen moved inside the try so a missing uvicorn falls through to the existing error message instead of crashing the CLI. test_cli was still patching the old subprocess.run entrypoint; switched both api command tests to patch subprocess.Popen / os.killpg to match the current path.	2026-04-17 19:09:15 -04:00
anti	6301504c0e	perf(api): TTL-cache /stats + unfiltered pagination counts Every /stats call ran SELECT count(*) FROM logs + SELECT count(DISTINCT attacker_ip) FROM logs; every /logs and /attackers call ran an unfiltered count for the paginator. At 500 concurrent users these serialize through aiosqlite's worker threads and dominate wall time. Cache at the router layer (repo stays dialect-agnostic): - /stats response: 5s TTL - /logs total (only when no filters): 2s TTL - /attackers total (only when no filters): 2s TTL Filtered paths bypass the cache. Pattern reused from api_get_config and api_get_health (asyncio.Lock + time.monotonic window + lazy lock).	2026-04-17 19:09:15 -04:00
anti	b5d7bf818f	feat(health): 3-tier status (healthy / degraded / unhealthy) Only database, docker, and ingestion_worker now count as critical (→ 503 unhealthy). attacker/sniffer/collector failures drop overall status to degraded (still 200) so the dashboard doesn't panic when a non-essential worker isn't running.	2026-04-17 17:48:42 -04:00
anti	a10aee282f	perf(ingester): batch log writes into bulk commits The ingester now accumulates up to DECNET_BATCH_SIZE rows (default 100) or DECNET_BATCH_MAX_WAIT_MS (default 250ms) before flushing through repo.add_logs — one transaction, one COMMIT per batch instead of per row. Under attacker traffic this collapses N commits into ⌈N/100⌉ and takes most of the SQLite writer-lock contention off the hot path. Flush semantics are cancel-safe: _position only advances after a batch commits successfully, and the flush helper bails without touching the DB if the enclosing task is being cancelled (lifespan teardown). Un-flushed lines stay in the file and are re-read on next startup. Tests updated to assert on add_logs (bulk) instead of the per-row add_log that the ingester no longer uses, plus a new test that 250 lines flush in ≤5 calls.	2026-04-17 16:37:34 -04:00
anti	11b9e85874	feat(db): bulk add_logs for one-commit ingestion batches Adds BaseRepository.add_logs (default: loops add_log for backwards compatibility) and a real single-session/single-commit implementation on SQLModelRepository. Introduces DECNET_BATCH_SIZE (default 100) and DECNET_BATCH_MAX_WAIT_MS (default 250) so the ingester can flush on either a size or a time bound when it adopts the new method. Ingester wiring is deferred to a later pass — the single-log path was deadlocking tests when flushed during lifespan teardown, so this change ships the DB primitive alone.	2026-04-17 16:23:09 -04:00
anti	d3f4bbb62b	perf(locust): skip change-password in on_start when not required Previously every user did login → change-pass → re-login in on_start regardless of whether the server actually required a password change. With bcrypt at ~250ms/call that's 3 bcrypt-bound requests per user. At 2500 users the on_start queue was ~10k bcrypt ops — users never escaped warmup, so @task endpoints never fired. Login already returns must_change_password; only run the change-pass + re-login dance when the server says we have to. Cuts on_start from 3 requests to 1 for every user after the first DB initialization.	2026-04-17 15:15:59 -04:00
anti	f1e14280c0	perf: 1s TTL cache for /health DB probe and /config state reads Locust hit /health and /config on every @task(3), so each request was firing repo.get_total_logs() and two repo.get_state() calls against aiosqlite — filling the driver queue for data that changes on the order of seconds, not milliseconds. Both caches follow the shape already used by the existing Docker cache: - asyncio.Lock with double-checked TTL so concurrent callers collapse into one DB hit per 1s window. - _reset_* helpers called from tests/api/conftest.py::setup_db so the module-level cache can't leak across tests. tests/test_health_config_cache.py asserts 50 concurrent callers produce exactly 1 repo call, and the cache expires after TTL.	2026-04-17 15:05:18 -04:00
anti	3945e72e11	perf: run bcrypt on a thread so it doesn't block the event loop verify_password / get_password_hash are CPU-bound and take ~250ms each at rounds=12. Called directly from async endpoints, they stall every other coroutine for that window — the single biggest single-worker bottleneck on the login path. Adds averify_password / ahash_password that wrap the sync versions in asyncio.to_thread. Sync versions stay put because _ensure_admin_user and tests still use them. 5 call sites updated: login, change-password, create-user, reset-password. tests/test_auth_async.py asserts parallel averify runs concurrently (~1x of a single verify, not 2x).	2026-04-17 14:52:22 -04:00
anti	bd406090a7	fix: re-seed admin password when still unfinalized (must_change_password=True) _ensure_admin_user was strict insert-if-missing: once a stale hash landed in decnet.db (e.g. from a deploy that used a different DECNET_ADMIN_PASSWORD), login silently 401'd because changing the env var later had no effect. Now on startup: if the admin still has must_change_password=True (they never finalized their own password), re-sync the hash from the current env var. Once the admin sets a real password, we leave it alone. Found via locustfile.py login storm — see tests/test_admin_seed.py. Note: this commit also bundles uncommitted pool-management work already present in sqlmodel_repo.py from prior sessions.	2026-04-17 14:49:13 -04:00
anti	cb12e7c475	fix: logging handler must not crash its caller on reopen failure When decnet.system.log is root-owned (e.g. created by a pre-fix 'sudo decnet deploy') and a subsequent non-root process tries to log, the InodeAwareRotatingFileHandler raised PermissionError out of emit(), which propagated up through logger.debug/info and killed the collector's log stream loop ('log stream ended ... reason=[Errno 13]'). Now matches stdlib behaviour: wrap _open() in try/except OSError and defer to handleError() on failure. Adds a regression test. Also: scripts/profile/view.sh 'pyinstrument' keyword was matching memray-flamegraph-.html files. Exclude the memray- prefix.	2026-04-17 14:01:36 -04:00
anti	bf4afac70f	fix: RotatingFileHandler reopens on external deletion/rotation Mirrors the inode-check fix from `935a9a5` (collector worker) for the stdlib-handler-based log paths. Both decnet.system.log (config.py) and decnet.log (logging/file_handler.py) now use a subclass that stats the target path before each emit and reopens on inode/device mismatch — matching the behavior of stdlib WatchedFileHandler while preserving size-based rotation. Previously: rm decnet.system.log → handler kept writing to the orphaned inode until maxBytes triggered; all lines between were lost.	2026-04-17 13:42:15 -04:00
anti	4b15b7eb35	fix: chown log files to sudo-invoking user so non-root API can append 'sudo decnet deploy' needs root for MACVLAN, but the log files it creates (decnet.log and decnet.system.log) end up owned by root. A subsequent non-root 'decnet api' then crashes on PermissionError appending to them. New decnet.privdrop helper reads SUDO_UID/SUDO_GID and chowns files/dirs back to the invoking user. Best-effort: no-op when not root, not under sudo, path missing, or chown fails. Applied at both log-file creation sites (config.py system log, logging/file_handler.py syslog file).	2026-04-17 13:39:09 -04:00
anti	140d2fbaad	fix: gate embedded sniffer behind DECNET_EMBED_SNIFFER (default off) The API's lifespan unconditionally spawned a MACVLAN sniffer task, which duplicated the standalone 'decnet sniffer --daemon' process that 'decnet deploy' always starts — causing two workers to sniff the same interface, double events, and wasted CPU. Mirror the existing DECNET_EMBED_PROFILER pattern: sniffer is OFF by default, opt in explicitly. Static regression tests guard against accidental removal of the gate.	2026-04-17 13:35:43 -04:00
anti	6572c5cbaf	added: scripts/profile/view.sh — auto-pick newest artifact and open viewer Dispatches by extension: .prof -> snakeviz, memray .bin -> memray flamegraph (overridable via VIEW=table\|tree\|stats\|summary\|leaks), .svg/.html -> xdg-open. Positional arg can be a file path or a type keyword (cprofile, memray, pyspy, pyinstrument).	2026-04-17 13:20:05 -04:00
anti	ba448bae13	docs: py-spy 0.4.1 lacks Python 3.14 support; wrapper aborts early Root cause of 'No python processes found in process <pid>': py-spy needs per-release ABI knowledge and 0.4.1 (latest PyPI) predates 3.14. Wrapper now detects the interpreter and points users at pyinstrument/memray/cProfile.	2026-04-17 13:17:23 -04:00
anti	1a18377b0a	fix: mysql url builder tests expect asyncmy, not aiomysql The builder in decnet/web/db/mysql/database.py emits 'mysql+asyncmy://' URLs (asyncmy is the declared dep in pyproject.toml). Tests were stale from a prior aiomysql era.	2026-04-17 13:13:36 -04:00
anti	319c1dbb61	added: profiling toolchain (py-spy, pyinstrument, pytest-benchmark, memray, snakeviz) New `profile` optional-deps group, opt-in Pyinstrument ASGI middleware gated by DECNET_PROFILE_REQUESTS, bench marker + tests/perf/ micro-benchmarks for repository hot paths, and scripts/profile/ helpers for py-spy/cProfile/memray.	2026-04-17 13:13:00 -04:00
anti	9b59f8672e	chores: cleanup; added: viteconfig	2026-04-16 02:09:30 -04:00
anti	296979003d	fix: pytest -m live works without extra flags Root cause: test_schemathesis.py mutates decnet.web.auth.SECRET_KEY at module-level import time, poisoning JWT verification for all other tests in the same process — even when fuzz tests are deselected. - Add pytest_ignore_collect hook in tests/api/conftest.py to skip collecting test_schemathesis.py unless -m fuzz is selected - Add --dist loadscope to addopts so xdist groups by module (protects module-scoped fixtures in live tests) - Remove now-unnecessary xdist_group markers from live test classes	2026-04-16 01:55:38 -04:00
anti	89099b903d	fix: resolve schemathesis and live test failures - Add 403 response to all RBAC-gated endpoints (schemathesis UndefinedStatusCode) - Add 400 response to all endpoints accepting JSON bodies (malformed input) - Add required 'title' field to schemathesis.toml for schemathesis 4.15+ - Add xdist_group markers to live tests with module-scoped fixtures to prevent xdist from distributing them across workers (fixture isolation)	2026-04-16 01:39:04 -04:00
anti	d1a88e75bd	fix: dynamic TracedRepository proxy + disable tracing in test suite Replace brittle explicit method-by-method proxy with __getattr__-based dynamic proxy that forwards all args/kwargs to the inner repo. Fixes TypeError on get_logs_after_id() where concrete repo accepts extra kwargs beyond the ABC signature. Pin DECNET_DEVELOPER_TRACING=false in conftest.py so .env.local settings don't leak into the test suite.	2026-04-15 23:46:46 -04:00
anti	65ddb0b359	feat: add OpenTelemetry distributed tracing across all DECNET services Gated by DECNET_DEVELOPER_TRACING env var (default off, zero overhead). When enabled, traces flow through FastAPI routes, background workers (collector, ingester, profiler, sniffer, prober), engine/mutator operations, and all DB calls via TracedRepository proxy. Includes Jaeger docker-compose for local dev and 18 unit tests.	2026-04-15 23:23:13 -04:00
anti	a1ca5d699b	fix: use dedicated thread pools for collector and sniffer workers The collector spawned one permanent thread per Docker container via asyncio.to_thread(), saturating the default asyncio executor. This starved short-lived to_thread(load_state) calls in get_deckies() and get_stats_summary(), causing the SSE stream and deckies endpoints to hang indefinitely while other DB-only endpoints worked fine. Give the collector and sniffer their own ThreadPoolExecutor so they never compete with the default pool.	2026-04-15 22:57:03 -04:00
anti	e9d151734d	feat: deduplicate bounties on (bounty_type, attacker_ip, payload) Before inserting a bounty, check whether an identical row already exists. Drops silent duplicates to prevent DB saturation from aggressive scanners.	2026-04-15 18:02:52 -04:00
anti	11d749f13d	fix: wire prober tcpfp_fingerprint events into sniffer_rollup for OS/hop detection The active prober emits tcpfp_fingerprint events with TTL, window, MSS etc. from the attacker's SYN-ACK. These were invisible to the behavioral profiler for two reasons: 1. target_ip (prober's field name for attacker IP) was not in _IP_FIELDS in collector/worker.py or correlation/parser.py, so the profiler re-parsed raw_lines and got attacker_ip=None, never attributing prober events to the attacker profile. 2. sniffer_rollup only handled tcp_syn_fingerprint (passive sniffer) and ignored tcpfp_fingerprint (active prober). Prober events use different field names: window_size/window_scale/sack_ok vs window/wscale/has_sack. Changes: - Add target_ip to _IP_FIELDS in collector and parser - Add _PROBER_TCPFP_EVENT and _INITIAL_TTL table to behavioral.py - sniffer_rollup now processes tcpfp_fingerprint: maps field names, derives OS from TTL via _os_from_ttl, computes hop_distance = initial_ttl - observed - Expand prober DEFAULT_TCPFP_PORTS to [22,80,443,8080,8443,445,3389] for better SYN-ACK coverage on attacker machines - Add 4 tests covering prober OS detection, hop distance, and field mapping	2026-04-15 17:36:40 -04:00
anti	89887ec6fd	fix: serialize HTTP headers as JSON so tool detection and bounty extraction work templates/decnet_logging.py calls str(v) on all SD-PARAM values, turning a headers dict into Python repr ('{'User-Agent': ...}') rather than JSON. detect_tools_from_headers() called json.loads() on that string and silently swallowed the error, returning [] for every HTTP event. Same bug prevented the ingester from extracting User-Agent bounty fingerprints. - templates/http/server.py: wrap headers dict in json.dumps() before passing to syslog_line so the value is a valid JSON string in the syslog record - behavioral.py: add ast.literal_eval fallback for existing DB rows that were stored with the old Python repr format - ingester.py: parse headers as JSON string in _extract_bounty so User-Agent fingerprints are stored correctly going forward - tests: add test_json_string_headers and test_python_repr_headers_fallback to exercise both formats in detect_tools_from_headers	2026-04-15 17:03:52 -04:00
anti	02e73a19d5	fix: promote TCP-fingerprinted nmap to tool_guesses (detects -sC sans HTTP)	2026-04-15 16:44:45 -04:00

1 2 3 4

198 Commits