BEAM and LiveView Roadmap

The current working assumption is that Wardwright’s primary implementation should move toward a BEAM architecture:

Elixir owns runtime plumbing: HTTP, LiveView, supervision, registries, GenServers, ETS ownership, sidecar/NIF boundaries, provider calls, telemetry, dynamic config, and operational dashboards.
Gleam owns correctness-heavy pure logic: policy/config data types, action/result ADTs, route arbitration, guard-loop state machines, cache eviction decisions, receipt classification, and projection generation where exhaustiveness materially reduces bugs.
Phoenix LiveView owns the first-party operator UI so policy authoring, simulation, receipts, and runtime state can be driven directly from the same supervised backend.
Phoenix PubSub is the early visibility bus between supervised runtime state and LiveView projections. Session/model processes should publish receipt events, policy transitions, queue health, and simulation updates so the UI and other nodes can observe behavior without taking ownership of the hot session state.

This selection is now strong enough that the old Go and Rust backend prototypes have been removed from the live tree. They remain useful historical evidence in git, but active implementation should happen on the BEAM unless a later spike shows a concrete reason to reverse course. The key difference is that Elixir and Gleam do not need to compete as whole backends: each pure function or runtime boundary can be assigned to the language that fits it best.

Boundary Rule

Default to Gleam when all of these are true:

the logic is pure or nearly pure
invalid states can be represented with typed variants instead of ad hoc maps
exhaustive pattern matching would catch real product mistakes
the Elixir/Gleam boundary can be expressed as a small stable input/output shape

Default to Elixir when any of these are true:

the code owns a process, supervisor, registry, socket, endpoint, ETS table, or sidecar
behavior is intentionally dynamic or operator-configured
the code needs mature Phoenix/Plug/LiveView/Ecto/Telemetry APIs
the code is mostly orchestration, IO, or lifecycle management

Runtime call overhead between Elixir and Gleam should not drive the decision. Both compile to BEAM modules. The real costs are build/tooling complexity, library maturity, data-shape translation, and duplicated logic across the boundary.

Runtime Shape

The target process hierarchy is:

application supervisor
model registry and dynamic supervisor
one model runtime subtree per synthetic model/version
session registry and dynamic supervisor under each model runtime
one session runtime per caller/session/run
narrow workers for provider calls, sidecars, dirty NIF calls, alert queues, stream windows, and policy evaluation

Required runtime tests:

crash one session and prove sibling sessions continue
crash or restart one model runtime and prove other models continue
saturate or timeout a sidecar/alert queue and prove unrelated failure domains do not inherit backpressure
publish model/session/receipt events over PubSub and prove LiveView-style subscribers see ordered visibility updates without mutating session state
run a dirty NIF policy evaluation and document scheduler isolation separately from killability
emit receipts with model id/version, session id, policy version, attempt id, and failure domain

Cluster visibility should start as PubSub-backed projections, not distributed session mutation. A session should have one authoritative owner process tree at a time. Other nodes can subscribe to visibility topics, render near-real-time state, and consume receipt/event projections. Cross-node session handoff, distributed locking, or multi-node mutation should be treated as later explicit features with their own failure semantics. Phoenix PubSub is the application mechanism; actual multi-node delivery still requires an explicit node discovery and clustering configuration such as distributed Erlang/libcluster.

Sidecars remain attractive for hard killability, but they must be scored as backpressure and scaling risks: queue depth, single-worker serialization, protocol failures, cold starts, restart storms, pool sizing, and cross-session or cross-model saturation.

State machines appear in several layers and should not be conflated:

policy state-machine artifacts are user-facing governance data
pure transition selectors can live in Gleam when exhaustiveness helps
Elixir runtime processes own supervision, timers, cancellation, PubSub, ETS, and provider IO
long-lived or highly eventful machines may compile to or be hosted by gen_statem when process lifecycle semantics matter

The older gen_fsm mental model is useful vocabulary, but the implementation spike should evaluate modern gen_statem and ordinary GenServer-plus-pure-core options. Users should not need to author raw callbacks for the default path. If an expert mode later allows code-backed machines, the code must still expose a transition graph, declared effects, simulation hooks, timeout behavior, and receipt trace spans.

LiveView Direction

The removed TypeScript prototype was useful for shape discovery, but the next operator UI should be built in LiveView unless a workflow proves it needs a client-heavy canvas app.

Initial LiveView surfaces:

synthetic model catalog and version switcher
policy projection workbench
simulation runner with trace overlay
receipt explorer and diff view
runtime dashboard for model/session trees, queue depth, restarts, and policy failures
advanced policy editor with a deterministic artifact preview

LiveView pages should subscribe to PubSub topics for the model, session, receipt, policy artifact, and simulation scope they are rendering. The server projection remains authoritative: PubSub messages should be small invalidation or event records that cause the LiveView to update from supervised state, durable receipts, or cached projections.

The UI must render stable backend projections rather than engine-specific implementation details. The policy artifact and compiled plan remain the authority; projection and simulation are review aids.

Library Shortlist

Use Phoenix and LiveView primitives first. LiveView provides server-rendered interactive UI, async cancellation, hooks, and server-to-client events for the small amount of client-side behavior needed by graph widgets.

Area	Candidate	Use
Base LiveView UI	SaladUI or Petal Components	Try one small page before adopting broadly. SaladUI is shadcn-inspired with accessible components and charts; Petal is mature HEEX/Tailwind with optional LiveView.JS/Alpine behavior.
Accessible component kit	Fluxon UI	Evaluate if its component set fits dashboards better than SaladUI/Petal.
Interactive policy graph	LiveFlow	Spike for node graphs. It is very young, so treat it as experimental and keep a fallback path.
Custom graph/canvas	LiveView hook plus Cytoscape, D3, Mermaid, or custom SVG	Use only for graph interactions LiveView components cannot express cleanly. Keep the graph data shape server-owned.
Operations dashboard	Phoenix LiveDashboard plus custom pages	Use for VM/process/telemetry inspiration and possibly embed internal metrics pages.

Near-Term Spikes

Projection Contract Merge Review and merge the policy projection FE/BE contract work. The contract should describe projection nodes, confidence, effects, conflicts, simulation traces, and receipt previews without assuming a client runtime.
LiveView Projection Workbench Keep the current LiveView projection prototype focused on three modes: phase map, effect matrix, and trace overlay. Add server-side tests for route behavior and projection shape before adding a UI component library.
Gleam Decision Core Initial Gleam decision modules now live under app/src/wardwright and are called from the live Elixir path through wrapper modules. They currently own structured-output guard-loop arbitration, recent-history threshold classification, and alert enqueue/backpressure classification. Next candidates are TTSR action arbitration, state-machine transition selection, and deterministic cache eviction. For now, each Gleam core should keep an Elixir equivalent behind WARDWRIGHT_POLICY_CORE=elixir|gleam|compare so the project can measure whether Gleam’s type-safety benefit remains worth the extra boundary as Elixir’s set-theoretic type system matures.
Runtime Isolation Demo Build model/session dynamic supervisors in the primary Elixir backend and expose a small LiveView or admin endpoint that shows child trees, restarts, queue depth, PubSub visibility topics, and failure-domain receipts.
Dune vs Starlark Sandbox Spike Dune should be evaluated separately from Gleam. Gleam is a typed core language; Dune is an Elixir sandbox candidate. Compare Dune with Starlark on sandbox strength, timeout/reduction limits, source review, visualization, ergonomics, and sidecar/NIF/backpressure tradeoffs. Track executable findings in Sandbox Language Evaluation.