Stage 6 of the realism migration. User-class file bodies (note, todo, draft, script) optionally get LLM-authored content; system classes (cron / daemon logs, /tmp caches) stay template-only because formulaic *is* the right look for them. New surface: - realism.llm.circuit.LLMCircuitBreaker — process-local sliding-window breaker. 3 consecutive failures trip open; 60s cooldown to half-open; half-open success closes, failure re-opens. Protects the orchestrator tick from sustained Ollama wedges (per-call timeout already covers one-shot hangs). - realism.prompts._style — em-dash suppression lifted from the email prompt. Persona.uses_llms_heavily opts out per the feedback_em_dash_llm_tell.md memory. Includes strip_em_dashes belt-and-braces sub for output that slipped past the prompt rule. - realism.prompts.filebody — class-conditioned prompts (note / todo / draft / script) with persona context, language pinning, output shape rule. - realism.bodies.make_body_with_llm — async wrapper around make_body that calls the LLM when one is provided AND the breaker allows. Falls back to template on timeout / error / empty / system-class. Wiring: - scheduler.pick_file accepts optional llm + llm_breaker + llm_timeout. When the planner picks a create action and the content_class is a user-class, the body_hint is replaced with the LLM-authored body (or falls back to the deterministic body_hint). - orchestrator.worker constructs get_llm() at startup gated by DECNET_REALISM_LLM env var (any non-empty value enables; empty / "off" / "none" / "0" disables). Passes llm + breaker through every tick. - decnet orchestrate gains --llm/--no-llm flag overriding the env var.
82 lines
2.3 KiB
Python
82 lines
2.3 KiB
Python
"""LLMCircuitBreaker — process-local sliding-window breaker."""
|
|
from __future__ import annotations
|
|
|
|
from decnet.realism.llm.circuit import LLMCircuitBreaker
|
|
|
|
|
|
def test_starts_closed_and_allows_calls() -> None:
|
|
breaker = LLMCircuitBreaker()
|
|
assert breaker.state == "closed"
|
|
assert breaker.allow_call() is True
|
|
|
|
|
|
def test_trips_open_after_threshold_failures() -> None:
|
|
clock_value = [0.0]
|
|
breaker = LLMCircuitBreaker(
|
|
failure_threshold=3, cooldown_seconds=60.0,
|
|
clock=lambda: clock_value[0],
|
|
)
|
|
breaker.record_failure()
|
|
assert breaker.state == "closed"
|
|
breaker.record_failure()
|
|
assert breaker.state == "closed"
|
|
breaker.record_failure()
|
|
assert breaker.state == "open"
|
|
assert breaker.allow_call() is False
|
|
|
|
|
|
def test_success_resets_consecutive_failure_count() -> None:
|
|
breaker = LLMCircuitBreaker(failure_threshold=3)
|
|
breaker.record_failure()
|
|
breaker.record_failure()
|
|
breaker.record_success()
|
|
breaker.record_failure()
|
|
breaker.record_failure()
|
|
assert breaker.state == "closed" # only 2 since the success
|
|
|
|
|
|
def test_half_open_after_cooldown() -> None:
|
|
clock_value = [0.0]
|
|
breaker = LLMCircuitBreaker(
|
|
failure_threshold=2, cooldown_seconds=10.0,
|
|
clock=lambda: clock_value[0],
|
|
)
|
|
breaker.record_failure()
|
|
breaker.record_failure()
|
|
assert breaker.state == "open"
|
|
assert breaker.allow_call() is False
|
|
|
|
clock_value[0] = 11.0
|
|
assert breaker.allow_call() is True
|
|
assert breaker.state == "half_open"
|
|
|
|
|
|
def test_half_open_failure_re_opens() -> None:
|
|
clock_value = [0.0]
|
|
breaker = LLMCircuitBreaker(
|
|
failure_threshold=2, cooldown_seconds=5.0,
|
|
clock=lambda: clock_value[0],
|
|
)
|
|
breaker.record_failure()
|
|
breaker.record_failure()
|
|
clock_value[0] = 6.0
|
|
breaker.allow_call()
|
|
assert breaker.state == "half_open"
|
|
breaker.record_failure()
|
|
assert breaker.state == "open"
|
|
|
|
|
|
def test_half_open_success_closes() -> None:
|
|
clock_value = [0.0]
|
|
breaker = LLMCircuitBreaker(
|
|
failure_threshold=2, cooldown_seconds=5.0,
|
|
clock=lambda: clock_value[0],
|
|
)
|
|
breaker.record_failure()
|
|
breaker.record_failure()
|
|
clock_value[0] = 6.0
|
|
breaker.allow_call()
|
|
breaker.record_success()
|
|
assert breaker.state == "closed"
|
|
assert breaker.allow_call() is True
|