Lift the format-agnostic pieces from decnet/orchestrator/emailgen/
into the new decnet/realism/ library so file-class content generation
(stage 3 of the realism migration) can reuse them. Email-specific
delivery (RFC 2822 EML, IMAP/POP3 spool, thread chains) stays in
orchestrator/.
Renames (history-preserving git mv):
emailgen/personas.py -> realism/personas.py
emailgen/prompt.py -> realism/prompts/email.py
emailgen/global_pool.py -> realism/personas_pool.py
emailgen/llm/ -> realism/llm/
Env-var clean break (pre-v1, no aliases):
DECNET_EMAILGEN_LLM -> DECNET_REALISM_LLM
DECNET_EMAILGEN_MODEL -> DECNET_REALISM_MODEL
DECNET_EMAILGEN_TIMEOUT -> DECNET_REALISM_TIMEOUT
DECNET_EMAILGEN_PERSONAS -> DECNET_REALISM_PERSONAS
DECNET_EMAILGEN_FAKE_OUTPUT -> DECNET_REALISM_FAKE_OUTPUT
Importers rewritten in: orchestrator/emailgen/scheduler.py,
orchestrator/drivers/email.py, web/router/{emailgen,topology}/
api_personas.py, cli/emailgen.py. Tests for moved modules relocated
to tests/realism/; tests for stay-put modules updated in place.
API URL `/api/v1/emailgen/personas` and CLI `decnet emailgen
import-personas` keep their public names until the service-collapse
commit (stage 5).
48 lines
1.4 KiB
Python
48 lines
1.4 KiB
Python
"""Backend protocol shared by every LLM transport.
|
|
|
|
Deliberately narrow: realism consumers need one async ``generate``
|
|
call that takes a prompt string and returns the model's output text
|
|
plus enough metadata to populate per-event payloads (model name,
|
|
latency, success bit). Streaming, embeddings, multi-turn chat — all
|
|
out of scope here; realism only ever does one-shot single-prompt
|
|
generations.
|
|
"""
|
|
from __future__ import annotations
|
|
|
|
from dataclasses import dataclass, field
|
|
from typing import Any, Protocol
|
|
|
|
|
|
class LLMTimeout(Exception):
|
|
"""Raised when a generation exceeds the backend's wall-clock cap.
|
|
|
|
Backends MUST raise this rather than returning silently empty
|
|
output; the driver discriminates timeout from "model produced
|
|
nothing useful" so payloads carry the right ``stage`` value.
|
|
"""
|
|
|
|
|
|
@dataclass
|
|
class LLMResult:
|
|
"""Outcome of one ``generate`` call.
|
|
|
|
``success`` is ``False`` when the backend ran cleanly but produced
|
|
no usable output (e.g. an empty stdout). Hard failures (subprocess
|
|
crash, network error) raise; soft failures land here so the driver
|
|
can persist + log them as one event.
|
|
"""
|
|
success: bool
|
|
text: str
|
|
model: str
|
|
latency_ms: int
|
|
extra: dict[str, Any] = field(default_factory=dict)
|
|
|
|
|
|
class LLMBackend(Protocol):
|
|
"""Minimal contract for a realism LLM provider."""
|
|
|
|
model: str
|
|
timeout: float
|
|
|
|
async def generate(self, prompt: str) -> LLMResult: ...
|