DECNET

Author	SHA1	Message	Date
anti	e4bf8fa012	feat(creds): Phase 3 — HTTP/HTTPS POST form body cred extraction Login forms (wp-login.php, phpMyAdmin, Joomla, etc.) ship a `Content-Type: application/x-www-form-urlencoded` body with field names like username/user/email/log/pwd/password. The HTTP/HTTPS templates already captured the body as opaque bytes; now they parse common login-form shapes into the universal credential SD shape. Adds canonical templates/syslog_bridge.py: extract_form_credentials(body, content_type) -> dict \| None. Field-name matching is case-insensitive and covers: Principal: username, user, email, login, userid, account, log, user_login (WordPress), uname / pma_username (phpMyAdmin) Secret: password, pass, pwd, passwd, passwort, mot_de_passe, user_password (WordPress), pma_password (phpMyAdmin) The HTTP/HTTPS log_request handlers now call: cred = classify_authorization(...) or extract_form_credentials(...) — Authorization wins when present (current session credential beats a follow-up form change), but POSTs to /wp-login.php with no Auth header still surface their cleartext creds. Secret-without-principal is intentional: a reset-confirm or auto- fill abuse may carry a password without any field that maps to our principal list. The cred row writes with principal=None — the sha256 still correlates across services for reuse analytics. The body capture cap bumped from 512 → 4096 chars so reasonable form bodies aren't truncated before the cred extractor sees them; the body stored in fields.body stays at 512 chars (display-friendly). 36 helper + emitter tests pass. Phases 4-7 still pending.	2026-04-25 07:10:05 -04:00
anti	3404e3b3a6	feat(creds): Phase 1 — Authorization header + SNMP community capture Closes the cred-coverage gap for 7 services that already had the data on the wire but never landed it in the Credential table: - SNMP — community string lands as secret_kind="snmp_community", principal=None (v1/v2c has no per-user identity, the community IS the auth). - SIP — Digest response hash, previously buried in the auth= header dump, now classify_authorization()-extracted. - HTTP / HTTPS — Authorization header was in the headers JSON but never extracted. Now Basic decodes to plaintext, Bearer → http_bearer (principal=None), Digest → http_digest_md5. - K8s — already extracted Authorization but didn't normalize. Service- account JWTs flow through as Bearer. - Docker API — headers absent entirely. Adds the headers JSON dump and runs Authorization through the classifier. - Elasticsearch — five distinct request handlers; each gains a per-handler _cred_fields() helper. Adds canonical templates/syslog_bridge.py:classify_authorization(). Recognised: Basic / Bearer / Token / Digest. Unknown schemes (NTLM, AWS4-HMAC, Negotiate) return None; the header still rides in the ambient SD-block but isn't normalized as a credential. The SD shape on the wire collapses sip_digest_md5 into http_digest_md5 — same algorithm, so cross-protocol reuse correlates correctly when (rare) nonce collisions allow. Drive-by repair of tests/core/test_fingerprinting.py: - The pre-existing `test_http_useragent_extracted` asserted both that add_bounty was called exactly once AND that the UA payload carried `path` and `method` fields. Both wrong since this session opened: the http_quirks fingerprint added later fires too, and the UA payload never actually included path/method despite the assertion. - Adds `path`/`method` to the UA fingerprint payload (real operator value: "Nikto hit /admin" beats "Nikto seen on this decky"). - Replaces `assert_awaited_once` with a `_find_ua_bounty()` helper that filters add_bounty calls by `fingerprint_type`. New fingerprint families landing later won't retroactively break old tests. - Updates the two credential-bearing tests to use the post-DEBT-039 native shape (`secret_b64` / `principal`) and `upsert_credential`, not the deleted legacy `username+password` adapter. Also rebuilds the per-service fake `syslog_bridge` modules in tests/service_testing/{conftest,test_imap,test_pop3,test_snmp,test_mqtt,test_smtp}.py to expose `encode_secret` + `classify_authorization`. Service templates that import either now no longer fail at test collection. 173 tests pass in the touched scope. Phases 2-7 still pending.	2026-04-25 07:04:10 -04:00
anti	2f47f67eef	feat(creds): future-proof Credential storage model Replaces the opaque Bounty.bounty_type='credential' path with a dedicated `credentials` table whose schema is forward-compatible across every auth-bearing service in the fleet. Hoisted indexed columns (secret_sha256, principal, service, attacker_ip) carry the universal reuse-analytics signal; service-specific JSON keys ride in `fields`. Cross-service reuse queries become an indexed lookup on secret_sha256 instead of JSON_EXTRACT scans. Schema decisions baked in (per ANTI): - New `Credential` table, not extension to Bounty - Hoisted `principal` column for cross-service principal-reuse - Standardized JSON keys: every payload carries secret_b64 + secret_printable + principal universally; service-specific extras (user, domain, dn, mech, …) ride alongside The auth-helper SD-block emits the new shape natively. The ingester forks at _extract_bounty: - Native shape (SSH/Telnet, future emitters): secret_b64 present → direct upsert_credential - Legacy shape (FTP/POP3/IMAP/SMTP today): username + password → adapter synthesizes secret_{b64,sha256,printable} on the fly, upserts into the same Credential table. Tracked as DEBT-039; one-shot bridge until those service templates migrate. Defense-in-depth across five layers (input validation): - C helper: bytes outside [0x20, 0x7f) collapse to '?', RFC 5424 escape rules for \\, ", ]; b64 preserves exact bytes - Ingester native branch: rejects malformed secret_b64 (regex), drops the credential row but keeps the underlying Log - Ingester legacy adapter: same printable-ASCII filter as the C code; sha256 + b64 over the original utf-8 bytes (lossless, even when secret_printable is sanitized) - DB column caps with truncation warning; sha256 always over the full pre-truncation bytes so reuse queries match across truncation - JSON serialized with ensure_ascii=True so utf8mb4 columns stay safe even with non-ASCII service-specific keys Bounty.bounty_type='credential' is no longer written. Pre-v1: no historical backfill; existing rows stay untouched but unused. 595 tests pass; new tests cover the model + repo (upsert dedup, null-principal independence, cross-service reuse, filters), both ingester branches, b64 validation, sanitization preserving the fingerprinting signal in b64.	2026-04-25 05:29:26 -04:00
anti	f1026b4427	feat(telnet): same PAM cred-capture, /etc/pam.d/login Promotes auth-helper.c to decnet/templates/_shared/auth-helper/ and adds _sync_auth_helper_sources() — mirrors the existing sessrec sync pattern that keeps shared sources in step with per-template build contexts. Telnet's image grows the same multi-stage musl build, COPY of the static helper into /usr/sbin/auth-helper, and prepended pam_exec line in /etc/pam.d/login. Pulls in the `login` package (real Debian PAM-aware /bin/login, replacing busybox's PAM-less applet) and libpam-modules transitively for pam_exec.so. Verified inside the rebuilt telnet image: - /bin/login is the real 53KB Debian binary (PAM-aware) - /etc/pam.d/login top line is the auth-helper hook - pam_exec.so present at /usr/lib/x86_64-linux-gnu/security/pam_exec.so - helper smoke-run emits correct RFC 5424 line for `telnetpw` → password_b64="dGVsbmV0cHc=" SSH Dockerfile updated to read auth-helper.c from auth-helper/ subdirectory so both templates use the synced layout. The canonical source lives in _shared/; per-template copies are tracked in git AND synced at deploy time so a drift on either side rebases on the next deploy. Closes the telnet half of DEBT-038's #5 follow-up.	2026-04-25 04:52:35 -04:00
anti	2ff392511b	feat(templates): per-instance stealth seeding helper Adds instance_seed.py to every service template (conpot, docker_api, imap, k8s, llmnr, pop3, rdp, sip, smb, snmp, ssh, telnet, tftp, vnc). Derives a stable per-instance seed from NODE_NAME (+ optional INSTANCE_ID) and exposes deterministic helpers for the boring details scanners would otherwise use to fingerprint the whole fleet as one machine: cluster UUIDs, auth salts, uptime fixtures, minor version strings. Connection-time jitter is intentionally NOT seeded — two hits to the same decky must not replay the same latency curve. Identical source across every template; lives next to each service so the Docker build context picks it up without a shared package-data hop.	2026-04-23 21:51:51 -04:00
anti	a58d42e492	feat(templates): wire SSH+Telnet to sessrec transcript recorder Build login-session into both images as the swapped root shell, add a quarantine bind mount for telnet (symmetric to SSH), seed transcripts/ dir and service discriminant at entrypoint. Deployer syncs sessrec.c + Makefile into each build context alongside the existing syslog_bridge helper. sessrec falls back to /etc/sessrec.service when env is stripped (busybox /bin/login).	2026-04-21 23:03:42 -04:00
anti	6708f26e6b	fix(packaging): move templates/ into decnet/ package so they ship with pip install The docker build contexts and syslog_bridge.py lived at repo root, which meant setuptools (include = ["decnet"]) never shipped them. Agents installed via `pip install $RELEASE_DIR` got site-packages/decnet/* but no templates/, so every deploy blew up in deployer._sync_logging_helper with FileNotFoundError on templates/syslog_bridge.py. Move templates/ -> decnet/templates/ and declare it as setuptools package-data. Path resolutions in services/*.py and engine/deployer.py drop one .parent since templates now lives beside the code. Test fixtures, bandit exclude path, and coverage omit glob updated to match.	2026-04-19 19:30:04 -04:00

7 Commits