DECNET

Author	SHA1	Message	Date
anti	e4bf8fa012	feat(creds): Phase 3 — HTTP/HTTPS POST form body cred extraction Login forms (wp-login.php, phpMyAdmin, Joomla, etc.) ship a `Content-Type: application/x-www-form-urlencoded` body with field names like username/user/email/log/pwd/password. The HTTP/HTTPS templates already captured the body as opaque bytes; now they parse common login-form shapes into the universal credential SD shape. Adds canonical templates/syslog_bridge.py: extract_form_credentials(body, content_type) -> dict \| None. Field-name matching is case-insensitive and covers: Principal: username, user, email, login, userid, account, log, user_login (WordPress), uname / pma_username (phpMyAdmin) Secret: password, pass, pwd, passwd, passwort, mot_de_passe, user_password (WordPress), pma_password (phpMyAdmin) The HTTP/HTTPS log_request handlers now call: cred = classify_authorization(...) or extract_form_credentials(...) — Authorization wins when present (current session credential beats a follow-up form change), but POSTs to /wp-login.php with no Auth header still surface their cleartext creds. Secret-without-principal is intentional: a reset-confirm or auto- fill abuse may carry a password without any field that maps to our principal list. The cred row writes with principal=None — the sha256 still correlates across services for reuse analytics. The body capture cap bumped from 512 → 4096 chars so reasonable form bodies aren't truncated before the cred extractor sees them; the body stored in fields.body stays at 512 chars (display-friendly). 36 helper + emitter tests pass. Phases 4-7 still pending.	2026-04-25 07:10:05 -04:00
anti	0c1316f74c	feat(creds): Phase 2 — MySQL handshake hash + MSSQL Login7 plaintext Closes the cred-coverage gap for two database services that had been capturing only the username: - MySQL — extends _handle_packet to read the auth-response after the null-terminated username. mysql_native_password puts a 1-byte length followed by 20 bytes: SHA1(password) XOR SHA1(salt + SHA1(SHA1(password))). Plaintext irrecoverable, lands as secret_kind="mysql_native_password" with the 20 hash bytes in secret_b64. Hash is canonical for "hashcat -m 11200" if an operator ever wants to crack offline. - MSSQL — fixes a pre-existing bug AND adds password capture. The prior _parse_login7_username read offsets 36/38, which is actually ibHostName/cchHostName in the Login7 layout — username sat at 40/42 and was never touched. Replaced with _parse_login7_creds() reading the correct offsets (40 username, 44 password). Login7 password is XOR-then-nibble-swap obfuscated against 0xa5; _deobfuscate_login7_password reverses it. Plaintext-recoverable, lands as secret_kind="plaintext". The pre-existing test_login7_auth_logged_and_closes only verified the error response ships and the connection closes; it didn't validate the parsed username, so the hostname-as-username bug was silent. New tests cover both the deobfuscation algorithm directly and the full ingester round-trip for both services. Sync: copies the canonical syslog_bridge.py into mysql/ and mssql/ template build contexts so service_testing tests load the version with classify_authorization + encode_secret available. 37 tests pass in the touched scope. Phases 3-7 still pending.	2026-04-25 07:07:33 -04:00
anti	3404e3b3a6	feat(creds): Phase 1 — Authorization header + SNMP community capture Closes the cred-coverage gap for 7 services that already had the data on the wire but never landed it in the Credential table: - SNMP — community string lands as secret_kind="snmp_community", principal=None (v1/v2c has no per-user identity, the community IS the auth). - SIP — Digest response hash, previously buried in the auth= header dump, now classify_authorization()-extracted. - HTTP / HTTPS — Authorization header was in the headers JSON but never extracted. Now Basic decodes to plaintext, Bearer → http_bearer (principal=None), Digest → http_digest_md5. - K8s — already extracted Authorization but didn't normalize. Service- account JWTs flow through as Bearer. - Docker API — headers absent entirely. Adds the headers JSON dump and runs Authorization through the classifier. - Elasticsearch — five distinct request handlers; each gains a per-handler _cred_fields() helper. Adds canonical templates/syslog_bridge.py:classify_authorization(). Recognised: Basic / Bearer / Token / Digest. Unknown schemes (NTLM, AWS4-HMAC, Negotiate) return None; the header still rides in the ambient SD-block but isn't normalized as a credential. The SD shape on the wire collapses sip_digest_md5 into http_digest_md5 — same algorithm, so cross-protocol reuse correlates correctly when (rare) nonce collisions allow. Drive-by repair of tests/core/test_fingerprinting.py: - The pre-existing `test_http_useragent_extracted` asserted both that add_bounty was called exactly once AND that the UA payload carried `path` and `method` fields. Both wrong since this session opened: the http_quirks fingerprint added later fires too, and the UA payload never actually included path/method despite the assertion. - Adds `path`/`method` to the UA fingerprint payload (real operator value: "Nikto hit /admin" beats "Nikto seen on this decky"). - Replaces `assert_awaited_once` with a `_find_ua_bounty()` helper that filters add_bounty calls by `fingerprint_type`. New fingerprint families landing later won't retroactively break old tests. - Updates the two credential-bearing tests to use the post-DEBT-039 native shape (`secret_b64` / `principal`) and `upsert_credential`, not the deleted legacy `username+password` adapter. Also rebuilds the per-service fake `syslog_bridge` modules in tests/service_testing/{conftest,test_imap,test_pop3,test_snmp,test_mqtt,test_smtp}.py to expose `encode_secret` + `classify_authorization`. Service templates that import either now no longer fail at test collection. 173 tests pass in the touched scope. Phases 2-7 still pending.	2026-04-25 07:04:10 -04:00
anti	6b16c844b6	fix(creds): MQTT regression + secret_kind for hash credentials Honest correction to the "every cred-emitting service" claim. Audit of templates/* found three gaps: 1. MQTT — was working through the legacy adapter, silently dropped when Phase 3 (`e696c2b`) deleted it. Now migrated to encode_secret() alongside the others. 2. Postgres — `auth, pw_hash=…` event captures the MD5 challenge-response the attacker sent. Plaintext irrecoverable, so it never fit the (principal, secret_b64=raw_bytes) shape. Lands in Credential as secret_kind="postgres_md5_challenge". 3. VNC — `auth_response, response=…hex` event captures the 16-byte DES-encrypted challenge. Same situation as Postgres: plaintext irrecoverable. Lands as secret_kind="vnc_des_response". Adds a `secret_kind` discriminator column to Credential (default "plaintext", indexed). The dedup tuple gains secret_kind so two credentials with the same sha256 but different kinds are fundamentally different rows — different challenges produce different bytes for the same plaintext password, so cross-kind reuse matches are meaningless and would only confuse analytics. The model now genuinely covers every cred-emitting service in the fleet: plaintext SSH, Telnet, FTP, POP3, IMAP, SMTP, Redis, LDAP, MQTT postgres_md5_* Postgres vnc_des_response VNC Username-only services (MySQL/MSSQL — TDS pre-encryption captures the user but never sees the password byte) intentionally don't feed Credential — they're recon signals, not cred attempts. 40 tests pass in the touched scope. New cases: secret_kind dedups independently in the repo; Postgres MD5 + VNC DES emitters thread through; MQTT round-trips through the native branch.	2026-04-25 06:16:57 -04:00
anti	abb4dd9fc0	feat(templates): migrate six cred emitters to native shape Phase 2/3 of DEBT-039. Switches FTP, POP3, IMAP, SMTP, Redis, and LDAP from the legacy `username=` + `password=` SD-block shape to the universal credential shape (`principal=` + `secret_printable=` + `secret_b64=`) the new Credential storage model expects. Pattern is uniform across all six services: _log("auth_attempt", username=u, principal=u, **encode_secret(pw)) Each service emits the canonical SD keys. The ingester's native-shape branch (introduced in `2f47f67`) now writes their cred attempts directly without going through the legacy adapter. Once Phase 3 removes the adapter the contract becomes single-shape. Per-service notes: - POP3 / IMAP — `status="success"\|"failed"` renamed to `outcome="success"\|"failure"` to match Credential.outcome's vocabulary; the ingester reads outcome directly. - SMTP — AUTH path migrated; in addition the existing mail_from event now exposes a parsed `domain=` field alongside the original `value=` so future "what domains do attackers spoof from" analytics have an indexed field. Not stored in Credential — regular Log row. - Redis — was silently dropped by the legacy adapter (no `username` field). Native branch handles `principal=None` correctly. BONUS FIX: the Redis 6+ ACL syntax `AUTH <user> <pw>` now captures the ACL username as principal (was previously discarded). - LDAP — was silently dropped by the legacy adapter (no `password` recognition for the `bind` event). Now lands as `principal=<dn>`. BONUS FIX. Tests (tests/services/test_cred_emitters.py, 9 cases): - per-service native-shape ingest path produces correct Credential rows; outcome maps for POP3/IMAP; principal=None for legacy Redis AUTH; principal=dn for LDAP. - mail_from event does NOT trigger a credential write (it's a Log-only observation, not auth). - 0xff/NUL/ANSI bytes in passwords survive losslessly through secret_b64 even when secret_printable is sanitized. Phase 3 deletes the legacy adapter once all migrations land — the adapter has no live emitters to handle anymore.	2026-04-25 05:43:51 -04:00
anti	aebb9f81c6	feat(templates): encode_secret() helper in canonical syslog_bridge Phase 1/3 of DEBT-039. Adds the Python emitter-side counterpart to auth-helper.c's sd_escape + base64 logic so service templates can emit the universal credential SD shape with a single spread: _log("auth_attempt", principal=user, **encode_secret(password)) secret_printable mirrors the C helper's [0x20, 0x7f) → '?' contract; secret_b64 preserves the ORIGINAL utf-8 bytes losslessly so non-ASCII or control-byte payloads survive as fingerprinting signal even when the printable form sanitizes them. The canonical syslog_bridge.py is what _sync_logging_helper() propagates into per-template build contexts at deploy time, so any service that imports its local syslog_bridge picks this up automatically on next rebuild. Phase 2 migrates the six cred-emitting service templates (FTP, POP3, IMAP, SMTP, Redis, LDAP) onto this helper. Phase 3 deletes the ingester's legacy adapter once nothing emits the old shape.	2026-04-25 05:37:44 -04:00
anti	2f47f67eef	feat(creds): future-proof Credential storage model Replaces the opaque Bounty.bounty_type='credential' path with a dedicated `credentials` table whose schema is forward-compatible across every auth-bearing service in the fleet. Hoisted indexed columns (secret_sha256, principal, service, attacker_ip) carry the universal reuse-analytics signal; service-specific JSON keys ride in `fields`. Cross-service reuse queries become an indexed lookup on secret_sha256 instead of JSON_EXTRACT scans. Schema decisions baked in (per ANTI): - New `Credential` table, not extension to Bounty - Hoisted `principal` column for cross-service principal-reuse - Standardized JSON keys: every payload carries secret_b64 + secret_printable + principal universally; service-specific extras (user, domain, dn, mech, …) ride alongside The auth-helper SD-block emits the new shape natively. The ingester forks at _extract_bounty: - Native shape (SSH/Telnet, future emitters): secret_b64 present → direct upsert_credential - Legacy shape (FTP/POP3/IMAP/SMTP today): username + password → adapter synthesizes secret_{b64,sha256,printable} on the fly, upserts into the same Credential table. Tracked as DEBT-039; one-shot bridge until those service templates migrate. Defense-in-depth across five layers (input validation): - C helper: bytes outside [0x20, 0x7f) collapse to '?', RFC 5424 escape rules for \\, ", ]; b64 preserves exact bytes - Ingester native branch: rejects malformed secret_b64 (regex), drops the credential row but keeps the underlying Log - Ingester legacy adapter: same printable-ASCII filter as the C code; sha256 + b64 over the original utf-8 bytes (lossless, even when secret_printable is sanitized) - DB column caps with truncation warning; sha256 always over the full pre-truncation bytes so reuse queries match across truncation - JSON serialized with ensure_ascii=True so utf8mb4 columns stay safe even with non-ASCII service-specific keys Bounty.bounty_type='credential' is no longer written. Pre-v1: no historical backfill; existing rows stay untouched but unused. 595 tests pass; new tests cover the model + repo (upsert dedup, null-principal independence, cross-service reuse, filters), both ingester branches, b64 validation, sanitization preserving the fingerprinting signal in b64.	2026-04-25 05:29:26 -04:00
anti	f1026b4427	feat(telnet): same PAM cred-capture, /etc/pam.d/login Promotes auth-helper.c to decnet/templates/_shared/auth-helper/ and adds _sync_auth_helper_sources() — mirrors the existing sessrec sync pattern that keeps shared sources in step with per-template build contexts. Telnet's image grows the same multi-stage musl build, COPY of the static helper into /usr/sbin/auth-helper, and prepended pam_exec line in /etc/pam.d/login. Pulls in the `login` package (real Debian PAM-aware /bin/login, replacing busybox's PAM-less applet) and libpam-modules transitively for pam_exec.so. Verified inside the rebuilt telnet image: - /bin/login is the real 53KB Debian binary (PAM-aware) - /etc/pam.d/login top line is the auth-helper hook - pam_exec.so present at /usr/lib/x86_64-linux-gnu/security/pam_exec.so - helper smoke-run emits correct RFC 5424 line for `telnetpw` → password_b64="dGVsbmV0cHc=" SSH Dockerfile updated to read auth-helper.c from auth-helper/ subdirectory so both templates use the synced layout. The canonical source lives in _shared/; per-template copies are tracked in git AND synced at deploy time so a drift on either side rebases on the next deploy. Closes the telnet half of DEBT-038's #5 follow-up.	2026-04-25 04:52:35 -04:00
anti	d064125f61	feat(ssh): capture password attempts via pam_exec auth-helper Real OpenSSH doesn't log attempted passwords — only success/failure with username — leaving SSH the sole auth-bearing service in the fleet that contributes nothing to the cred corpus FTP/MySQL/RDP/ VNC/etc. populate. Closes that gap with a tiny pam_exec shim. A static C helper (~80 LoC, musl, ~38KB stripped) is wired into /etc/pam.d/sshd as `auth optional pam_exec.so expose_authtok stdout /usr/sbin/auth-helper`. pam_exec writes the attempted password to the helper's stdin NUL-terminated; the helper formats an RFC 5424 line in the exact shape templates/syslog_bridge.py produces (facility local0, PEN 55555, MSGID auth_attempt — same MSGID FTP uses) and writes it to /proc/1/fd/1 so the existing collector stdout-reader pipeline picks it up. Two password fields ride in the SD-block: - password= RFC 5424 escaped, ASCII-printable only, ? for non- printables. FTP-compatible — existing dashboard rendering picks up SSH attempts unchanged. - password_b64= base64 of the exact PAM_AUTHTOK bytes. Preserves NUL/0xff/control-byte fingerprinting signal that the plain field necessarily drops. Fail-open by design: the PAM line is `optional` so a malfunctioning helper never blocks sshd auth. Better to miss a cred than break the honeypot. Verified end-to-end inside the rebuilt image: - 38KB static ELF, runs without a dynamic linker - correct RFC 5424 line for `hunter2` → b64 `aHVudGVyMg==` - NUL truncation matches pam_exec's contract - 0xff bytes survive losslessly through password_b64 - empty password produces a well-formed line (e.g. pubkey auth path)	2026-04-25 04:42:50 -04:00
anti	2ff392511b	feat(templates): per-instance stealth seeding helper Adds instance_seed.py to every service template (conpot, docker_api, imap, k8s, llmnr, pop3, rdp, sip, smb, snmp, ssh, telnet, tftp, vnc). Derives a stable per-instance seed from NODE_NAME (+ optional INSTANCE_ID) and exposes deterministic helpers for the boring details scanners would otherwise use to fingerprint the whole fleet as one machine: cluster UUIDs, auth salts, uptime fixtures, minor version strings. Connection-time jitter is intentionally NOT seeded — two hits to the same decky must not replay the same latency curve. Identical source across every template; lives next to each service so the Docker build context picks it up without a shared package-data hop.	2026-04-23 21:51:51 -04:00
anti	c50448995b	feat(smtp): capture full messages + attachments to disk SMTP template now writes each accepted DATA body as a .eml file into a bind-mounted per-decky quarantine dir and emits a `message_stored` log with sha256, size, decoded headers, and an attachment manifest (filename + sha256 + size + content-type). Attachment hashing uses the decoded payload so operators can match against VT / MalwareBazaar directly. Body accumulator is capped at SMTP_MAX_BODY_BYTES (default 10 MB, matching the EHLO SIZE advert) so a streaming client can't OOM the container. The existing /api/v1/artifacts/{decky}/{stored_as} endpoint now takes an optional ?service= query param (defaults to ssh for back-compat) and can serve .eml files out of the smtp subdir. Forensic metadata rides the normal log pipeline, same as SSH file_captured.	2026-04-22 22:17:50 -04:00
anti	3fb84ac5d0	feat(templates): per-instance stealth via instance_seed in service servers Every service template now pulls version strings, cluster/node UUIDs, auth salts, greeting banners, and uptime from the seeded per-instance RNG instead of hard-coded defaults. Scanners sweeping the fleet now see legitimately diverging fingerprints per decky while each decky's own responses stay internally consistent across restarts. Covers elasticsearch, ftp, http, https, ldap, mongodb, mqtt, mssql, mysql, postgres, redis, and smtp templates.	2026-04-22 09:24:16 -04:00
anti	51e9e263ca	feat(templates): add instance_seed stealth helper and wire into template builds Each decky now gets a deterministic-per-instance seeded RNG derived from NODE_NAME, so cluster UUIDs, version strings, uptime, and credentials diverge across the fleet while staying stable within one container. The canonical helper lives at decnet/templates/instance_seed.py; the deployer copies it into every active template build context alongside syslog_bridge.py. Dockerfiles COPY it to /opt/ so server.py can import it. Connection-time jitter intentionally stays unseeded — two hits to the same decky must not replay the same latency curve.	2026-04-22 09:24:04 -04:00
anti	a58d42e492	feat(templates): wire SSH+Telnet to sessrec transcript recorder Build login-session into both images as the swapped root shell, add a quarantine bind mount for telnet (symmetric to SSH), seed transcripts/ dir and service discriminant at entrypoint. Deployer syncs sessrec.c + Makefile into each build context alongside the existing syslog_bridge helper. sessrec falls back to /etc/sessrec.service when env is stripped (busybox /bin/login).	2026-04-21 23:03:42 -04:00
anti	4596c1d69a	feat(templates): add sessrec pty transcript recorder New decnet/templates/_shared/sessrec/ — a small C program installed as the login shell in SSH / Telnet deckies. Forkpty-relays /bin/bash, records each chunk as an asciinema v2 event into a shared JSONL day-shard keyed by sid, and emits one RFC 5424 session_recorded line on exit (direct to PID 1's stdout, same pattern syslog_bridge.py uses). Storage: one shard per (decky, UTC day) at /var/lib/systemd/coredump/transcripts/sessions-YYYY-MM-DD.jsonl. Concurrent appends are lock-free: each write is chunked below PIPE_BUF so O_APPEND interleaves atomically. Per-session cap 10 MB with a trunc sentinel; disk- free precheck (<200 MB) falls through to plain bash with a session_skipped log event. Attacker src_ip resolves from \$SSH_CONNECTION, getpeername(0), or utmp in that order. SIGWINCH appends a 'r' resize event so ncurses replays stay aligned. Stealth for v1: /etc/passwd shell-swap to /usr/libexec/login-session (plausible login-machinery path) + prctl comm disguise. Full LD_PRELOAD argv-zap is deferred — sshd strips LD_PRELOAD from the session env, so wiring the existing argv_zap.so into this path needs a separate wrapper. DEBT-033 opened for size-based day-shard rotation; v1's disk-free precheck covers the worst case but can be blinded by a one-shot disk fill.	2026-04-21 22:56:42 -04:00
anti	897ce4035f	fix(sniffer): mark JA3/JA3S MD5 hashing as non-security JA3/JA3S fingerprints are defined by their specs as MD5 digests of the ClientHello/ServerHello feature tuples — they are identifiers, not security primitives. Pass usedforsecurity=False at the two call sites so bandit stops flagging them as B324 High when scanning outside the templates/ exclude.	2026-04-20 23:06:31 -04:00
anti	6708f26e6b	fix(packaging): move templates/ into decnet/ package so they ship with pip install The docker build contexts and syslog_bridge.py lived at repo root, which meant setuptools (include = ["decnet"]) never shipped them. Agents installed via `pip install $RELEASE_DIR` got site-packages/decnet/* but no templates/, so every deploy blew up in deployer._sync_logging_helper with FileNotFoundError on templates/syslog_bridge.py. Move templates/ -> decnet/templates/ and declare it as setuptools package-data. Path resolutions in services/*.py and engine/deployer.py drop one .parent since templates now lives beside the code. Test fixtures, bandit exclude path, and coverage omit glob updated to match.	2026-04-19 19:30:04 -04:00

17 Commits