[D] A contract-driven agent runtime: separating workflows, state, and LLM contract generation

By skyforbes Dec 9, 2025 No Comments

I’ve been exploring architectures that make agent systems reproducible, debuggable, and deterministic. Most current agent frameworks break because their control flow is implicit and their state is hidden behind prompts or async glue.

I’m testing a different approach: treat the LLM as a compiler that emits a typed contract, and treat the runtime as a deterministic interpreter of that contract. This gives us something ML desperately needs: reproducibility and replayability for agent behavior.

Here’s the architecture I’m validating with the MVP:

Reducers don’t coordinate workflows — orchestrators do

I’ve separated the two concerns entirely:

Reducers:

Use finite state machines embedded in contracts
Manage deterministic state transitions
Can trigger effects when transitions fire
Enable replay and auditability

Orchestrators:

Coordinate workflows
Handle branching, sequencing, fan-out, retries
Never directly touch state

LLMs as Compilers, not CPUs

Instead of letting an LLM “wing it” inside a long-running loop, the LLM generates a contract.

Because contracts are typed (Pydantic/JSON/YAML-schema backed), the validation loop forces the LLM to converge on a correct structure.

Once the contract is valid, the runtime executes it deterministically. No hallucinated control flow. No implicit state.

eployment = Publish a Contract

Nodes are declarative. The runtime subscribes to an event bus. If you publish a valid contract:

The runtime materializes the node
No rebuilds
No dependency hell
No long-running agent loops

Why do this?

Most “agent frameworks” today are just hand-written orchestrators glued to a chat model. They batch fail in the same way: nondeterministic logic hidden behind async glue.

A contract-driven runtime with FSM reducers and explicit orchestrators fixes that.

I’m especially interested in ML-focused critique:

oes a deterministic contract layer actually solve the reproducibility problem for agent pipelines?
Is this a useful abstraction for building benchmarkable systems?
What failure modes am I not accounting for?

Happy to provide architectural diagrams or the draft ONEX protocol if useful for discussion.

By skyforbes

MachineLearning

[P] Fast and Simple Solution to Kaggle’s `Jigsaw – Agile Community Rules Classification`

skyforbes Dec 9, 2025

MachineLearning

[R] Adopting a human developmental visual diet yields robust, shape-based AI vision

skyforbes Dec 9, 2025

MachineLearning

[D] I built a synthetic “nervous system” (Dopamine + State) to stop my local LLM from hallucinating. V0.1 Results: The brakes work, but now they’re locked up.

skyforbes Dec 8, 2025

[D] A contract-driven agent runtime: separating workflows, state, and LLM contract generation

Reducers don’t coordinate workflows — orchestrators do

Reducers:

Orchestrators:

LLMs as Compilers, not CPUs

eployment = Publish a Contract

Why do this?

Like this:

By skyforbes

Leave a ReplyCancel reply

You Missed

Customer held me hostage in the McDonald’s parking lot until I proved their nuggets were “buckled in.”

The “Paradox” of beginner distros

We stopped prompt-juggling and built one GPT Director that manages all roles — stable, context-aware, no drift.

4 brain doctors on the small habits they keep for younger brains

Archives

[D] A contract-driven agent runtime: separating workflows, state, and LLM contract generation

Reducers don’t coordinate workflows — orchestrators do

Reducers:

Orchestrators:

LLMs as Compilers, not CPUs

eployment = Publish a Contract

Why do this?

Like this:

By skyforbes

Related Posts

[P] Fast and Simple Solution to Kaggle’s `Jigsaw – Agile Community Rules Classification`

[R] Adopting a human developmental visual diet yields robust, shape-based AI vision

[D] I built a synthetic “nervous system” (Dopamine + State) to stop my local LLM from hallucinating. V0.1 Results: The brakes work, but now they’re locked up.

Leave a ReplyCancel reply

You Missed

Customer held me hostage in the McDonald’s parking lot until I proved their nuggets were “buckled in.”

The “Paradox” of beginner distros

We stopped prompt-juggling and built one GPT Director that manages all roles — stable, context-aware, no drift.

4 brain doctors on the small habits they keep for younger brains