Files
DECNET/CHANGELOG.md
anti c7d5f3a086 release: bump to v1.1.0; add CHANGELOG
Worker consolidation release. batch + cpu supervisor groups (verified
live, -737MB / 2.57GB->1.83GB). Prefork (process-model change) deferred
to 1.2.0.
2026-06-18 18:52:31 -04:00

2.7 KiB
Raw Permalink Blame History

Changelog

All notable changes to DECNET are documented here.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

1.1.0 - 2026-06-18

Worker consolidation: cut the long-running worker fleet's resident memory by hosting co-resident workers in shared supervisor processes instead of one OS process per worker. Behaviour-preserving — workers run the same code; only where they are hosted changes, and any worker remains extractable back to its own unit.

Added

  • decnet supervise <group> — hosts a co-resident worker group in one process, paying the Python import floor and the DB connection pool once instead of once per worker. Groups: batch and cpu.
  • decnet.supervisor — in-process supervision primitive: each worker runs in its own restart loop with exponential backoff (in-process Restart=on-failure), run concurrently so one worker crashing never cancels its siblings. Deliberately not asyncio.TaskGroup, whose all-or-nothing cancellation would break worker isolation.
  • decnet.offload — shared-pool CPU-kernel offload. The cpu group runs its two O(n²) connected-components kernels (cluster_observations, cluster_identities) in one shared ProcessPoolExecutor (forkserver) so they run in parallel instead of serialising under the GIL. Inline when no pool is installed, so standalone workers and tests are unchanged.
  • systemd units decnet-supervise-batch.service and decnet-supervise-cpu.service (auto-rendered by decnet init); each Conflicts= the individual units it replaces, preventing accidental double-run.

Changed

  • decnet.topology no longer eagerly imports the topology generator (and the SQLModel ORM behind it) at package import. generate is now a lazy PEP 562 re-export; the public API is unchanged.

Performance

  • batch group (reconcile + enrich + orchestrate + mutate): 509 MB across 4 processes → 129 MB in one. 380 MB (75%), verified live.
  • cpu group (clusterer + campaign-clusterer + attribution + reuse-correlate): 502 MB → ~146 MB (incl. forkserver). 357 MB (71%), verified live.
  • Fleet total: 2.57 GB → ~1.83 GB (737 MB).

Notes

  • webhook (external-HTTP egress; needs hard timeouts) and canary (manages its own repo) intentionally remain standalone for now.
  • bus, api/web, profiler, and ttp remain separate by design (broker / multiprocess servers / heavy resident state + sustained CPU).

1.0.0 - 2026

Initial 1.0 release. See tag v1.0.0.