Worker consolidation release. batch + cpu supervisor groups (verified live, -737MB / 2.57GB->1.83GB). Prefork (process-model change) deferred to 1.2.0.
59 lines
2.7 KiB
Markdown
59 lines
2.7 KiB
Markdown
# Changelog
|
||
|
||
All notable changes to DECNET are documented here.
|
||
|
||
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
|
||
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
|
||
|
||
## [1.1.0] - 2026-06-18
|
||
|
||
Worker consolidation: cut the long-running worker fleet's resident memory by
|
||
hosting co-resident workers in shared supervisor processes instead of one OS
|
||
process per worker. Behaviour-preserving — workers run the same code; only
|
||
*where* they are hosted changes, and any worker remains extractable back to its
|
||
own unit.
|
||
|
||
### Added
|
||
- `decnet supervise <group>` — hosts a co-resident worker group in one process,
|
||
paying the Python import floor and the DB connection pool once instead of once
|
||
per worker. Groups: `batch` and `cpu`.
|
||
- `decnet.supervisor` — in-process supervision primitive: each worker runs in its
|
||
own restart loop with exponential backoff (in-process `Restart=on-failure`),
|
||
run concurrently so one worker crashing never cancels its siblings.
|
||
Deliberately not `asyncio.TaskGroup`, whose all-or-nothing cancellation would
|
||
break worker isolation.
|
||
- `decnet.offload` — shared-pool CPU-kernel offload. The `cpu` group runs its two
|
||
O(n²) connected-components kernels (`cluster_observations`, `cluster_identities`)
|
||
in one shared `ProcessPoolExecutor` (forkserver) so they run in parallel
|
||
instead of serialising under the GIL. Inline when no pool is installed, so
|
||
standalone workers and tests are unchanged.
|
||
- systemd units `decnet-supervise-batch.service` and `decnet-supervise-cpu.service`
|
||
(auto-rendered by `decnet init`); each `Conflicts=` the individual units it
|
||
replaces, preventing accidental double-run.
|
||
|
||
### Changed
|
||
- `decnet.topology` no longer eagerly imports the topology generator (and the
|
||
SQLModel ORM behind it) at package import. `generate` is now a lazy PEP 562
|
||
re-export; the public API is unchanged.
|
||
|
||
### Performance
|
||
- **batch** group (`reconcile` + `enrich` + `orchestrate` + `mutate`):
|
||
509 MB across 4 processes → **129 MB** in one. **−380 MB (75%)**, verified live.
|
||
- **cpu** group (`clusterer` + `campaign-clusterer` + `attribution` +
|
||
`reuse-correlate`): 502 MB → **~146 MB** (incl. forkserver). **−357 MB (71%)**,
|
||
verified live.
|
||
- Fleet total: **2.57 GB → ~1.83 GB (−737 MB)**.
|
||
|
||
### Notes
|
||
- `webhook` (external-HTTP egress; needs hard timeouts) and `canary` (manages its
|
||
own repo) intentionally remain standalone for now.
|
||
- `bus`, `api`/`web`, `profiler`, and `ttp` remain separate by design (broker /
|
||
multiprocess servers / heavy resident state + sustained CPU).
|
||
|
||
## [1.0.0] - 2026
|
||
|
||
Initial 1.0 release. See tag `v1.0.0`.
|
||
|
||
[1.1.0]: https://git.resacachile.cl/anti/DECNET/compare/v1.0.0...v1.1.0
|
||
[1.0.0]: https://git.resacachile.cl/anti/DECNET/releases/tag/v1.0.0
|