Codex-Temporal Design: Session Integration

Problem

Codex's Session is a monolithic "god object" that owns everything needed to run the agent in a single process: conversation state, model client, tool execution, MCP connections, auth, analytics, sandbox, file watching, network proxy, agent lifecycle, and more.

The Temporal harness only needs the state management portion. Session::new_minimal() constructs a full Session with most service fields stubbed to defaults/no-ops. This works and is the chosen approach — it avoids splitting Session into separate types, which would require moving ~20 methods, updating all callers, and touching 16+ files for marginal benefit.

Approach: Single Session with Stubs

Rather than extracting a SessionCore base type, we keep a single Session struct and use Session::new_minimal() to construct it for the Temporal workflow. The stub fields carry no meaningful runtime cost and the constructor is straightforward.

Session (used by both codex CLI and Temporal workflow)
├── conversation_id
├── state (Mutex<SessionState>)        ← conversation history, token info
├── features
├── active_turn
├── event_sink (dyn EventSink)         ← ChannelEventSink (codex) / BufferEventSink (Temporal)
├── storage (dyn StorageBackend)       ← RolloutFileStorage (codex) / InMemoryStorage (Temporal)
├── next_internal_sub_id
├── tx_event
├── agent_status
├── pending_mcp_server_refresh_config
├── js_repl
└── services: SessionServices
    ├── model_client                   ← stubbed in Temporal (unused — activities do model calls)
    ├── mcp_connection_manager         ← stubbed (default empty)
    ├── auth_manager                   ← stubbed (API key "harness")
    ├── otel_manager                   ← no-op (no telemetry in workflow)
    ├── ... (24 fields total)          ← all stubbed to defaults/no-ops
    └── show_raw_agent_reasoning       ← false

Why this works

event_sink and storage are injected via traits — Temporal provides its own implementations.
EntropyProviders are scoped per-turn via task-local ENTROPY — Temporal injects deterministic implementations.
ModelStreamer and ToolCallHandler are generic parameters on try_run_sampling_request — Temporal provides its own activity-backed implementations.
The 24 stubbed service fields are never accessed in the Temporal workflow path. They're dead weight (~few KB) but cause no correctness issues.

Why not SessionCore

We explored extracting a SessionCore struct (containing only the fields try_run_sampling_request needs) and having Session wrap it via Deref. This required:

Moving ~20 methods from impl Session to impl SessionCore
Updating every call site that accesses services.otel_manager (16+ files, 40+ references)
Changing try_run_sampling_request signature from Arc<Session> to Arc<SessionCore>
Updating all plan-mode helpers, HandleOutputCtx, drain_in_flight, etc.

The refactoring touched too many files for too little benefit. The single-Session approach with stubs is simpler, works today, and can be revisited if the stub overhead ever becomes a real problem.

How It Maps to Temporal

Temporal decomposes Session's responsibilities across its own primitives. Session::new_minimal() runs in the workflow; SessionServices functionality is replaced by activities:

┌─────────────────────────────────────────────────────────┐
│ Temporal Workflow (deterministic)                       │
│                                                         │
│  Session (via new_minimal)                              │
│  ├── conversation history    (workflow state)           │
│  ├── event sink              (BufferEventSink)          │
│  ├── storage                 (InMemoryStorage)          │
│  ├── config                  (Config::for_harness)      │
│  └── services                (all stubbed)              │
│                                                         │
│  try_run_sampling_request(sess, streamer, handler)      │
│       │                          │            │         │
│       │                   ┌──────┘     ┌──────┘         │
│       ▼                   ▼            ▼                │
│  Session             ModelStreamer  ToolCallHandler      │
│  (state + stubs)     (trait)       (trait)               │
└───────────────────────┬───────────────┬─────────────────┘
                        │               │
                  ┌─────▼─────┐   ┌─────▼─────┐
                  │ Activity  │   │ Activity  │
                  │model_call │   │ tool_exec │
                  │           │   │           │
                  │ModelClient│   │shell/patch│
                  │(codex)    │   │/read/grep │
                  └───────────┘   └───────────┘

What runs where

Responsibility	Codex (single process)	Temporal
Conversation history	`Session.state`	`Session::new_minimal()` in workflow
Prompt building	`Session` methods	Same `Session` methods in workflow
Model calls	`Session.services.model_client`	`model_call` activity (uses `ModelClient`)
Tool execution	`Session.services` (shell, sandbox)	`tool_exec` activity (subprocess/file I/O)
Event delivery	`Session.event_sink` (channel)	`BufferEventSink` + query polling
Persistence	`Session.services.rollout` (files)	Temporal workflow history (built-in)
Fault tolerance	None (process dies = state lost)	Workflow replay (automatic)
Auth	`Session.services.auth_manager`	Worker env var (`OPENAI_API_KEY`)
Entropy	System random + clock	Deterministic `TemporalRandomSource` / `TemporalClock`

Codex-Core Extension Points (already done)

No SessionCore extraction needed. The following traits/abstractions are already in place:

EventSink trait — pluggable event delivery (ChannelEventSink / BufferEventSink)
StorageBackend trait — pluggable rollout persistence (RolloutFileStorage / InMemoryStorage)
ModelStreamer trait — pluggable model calls (generic on try_run_sampling_request)
ToolCallHandler trait — pluggable tool execution (generic on try_run_sampling_request)
EntropyProviders — pluggable random/clock via task-local ENTROPY
AgentSession trait — pluggable agent backend for TUI
Session::new_minimal() — zero-service constructor for external harnesses
Config::for_harness() — zero-I/O config constructor

Codex-Temporal Changes (next steps)

Workflow

Uses Session::new_minimal() (already done)
Approval decisions via ExecApprovalRequest signal + wait_condition

Activities

model_call — uses codex ModelClient / ModelClientSession (done)
tool_exec — add handlers for apply_patch, read_file, list_dir, grep_files
Each tool handler is straightforward I/O (~10-150 lines each)

Worker-Level State for Persistent Processes

Some codex features require long-lived processes that outlive individual activity calls: PTY sessions, JS REPL kernels, MCP server connections. Activities are stateless one-shot functions — they can't hold a subprocess or connection open between invocations.

The solution is worker-level state: the worker process owns persistent resources, and activities access them by reference.

┌──────────────────────────────────────────────────────┐
│ Worker Process (long-lived)                          │
│                                                      │
│  WorkerResources (shared across all activities)      │
│  ├── mcp_connections: HashMap<String, McpConnection> │
│  ├── pty_sessions: HashMap<String, PtySession>       │
│  ├── js_repl: Option<JsReplKernel>                   │
│  └── sandbox_config: SandboxConfig                   │
│                                                      │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐  │
│  │ Activity    │  │ Activity    │  │ Activity    │  │
│  │ tool_exec   │  │ model_call  │  │ tool_exec   │  │
│  │  accesses   │  │             │  │  accesses   │  │
│  │  PTY #3     │  │  ModelClient│  │  MCP "git"  │  │
│  └─────────────┘  └─────────────┘  └─────────────┘  │
└──────────────────────────────────────────────────────┘

How it works

Worker startup: The worker process creates WorkerResources — starts MCP servers, initializes sandbox config, prepares PTY/REPL infrastructure.
Activity access: Activities receive a reference to WorkerResources via the activity context. They look up or create persistent resources by key (e.g., workflow ID → PTY session).
Lifecycle: Resources are tied to the worker process lifetime. When the worker restarts, resources are re-created. The workflow is unaffected — it replays and activities re-establish connections.

What this enables

Feature	Worker resource	Activity usage
MCP tools	`McpConnection` per server	Activity sends request, gets response
JS REPL	`JsReplKernel` per workflow	Activity evaluates code, returns result
PTY / unified exec	`PtySession` per terminal	Activity writes command, reads output
Background terminals	Multiple `PtySession`s	`/ps` queries workflow state, execution via activities
Sandbox (bubblewrap)	`SandboxConfig`	Activity wraps subprocess in bubblewrap
Network proxy	`ManagedProxy`	Activity routes requests through proxy

Trade-off: Sticky task routing

Worker-level state means a workflow's activities must run on the same worker that holds its resources. This requires sticky task routing — assigning each workflow to a specific worker via a workflow-specific task queue or session-based routing.

This breaks the "any worker can handle any task" assumption, but is the correct trade-off for a coding agent:

The worker runs on the machine with the code (filesystem access required)
Coding agents are inherently single-machine (you edit files on one machine)
The worker IS the machine — sticky routing is the natural model

For multi-machine deployments, each machine runs its own worker with its own task queue. The TUI connects to the workflow, which routes activities to the right worker.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codex-Temporal Design: Session Integration

Problem

Approach: Single Session with Stubs

Why this works

Why not SessionCore

How It Maps to Temporal

What runs where

Codex-Core Extension Points (already done)

Codex-Temporal Changes (next steps)

Workflow

Activities

Worker-Level State for Persistent Processes

How it works

What this enables

Trade-off: Sticky task routing

FilesExpand file tree

DESIGN.md

Latest commit

History

DESIGN.md

File metadata and controls

Codex-Temporal Design: Session Integration

Problem

Approach: Single Session with Stubs

Why this works

Why not SessionCore

How It Maps to Temporal

What runs where

Codex-Core Extension Points (already done)

Codex-Temporal Changes (next steps)

Workflow

Activities

Worker-Level State for Persistent Processes

How it works

What this enables

Trade-off: Sticky task routing