Run Receipt
2026-05-06-goal-weighted-2026-04-22_2026-05-06-frontier-v2
- Status
- published
- Window
- 2026-04-22 to 2026-05-06
- Signals
- 6
Revision Reason
Elevates Codex /goal from generic worker-native state to a first-class long-horizon autonomy signal.
Frontier Roll-Up: April 22-May 6, 2026
The last two weeks were not about one winning coding agent. They were about worker tools becoming fuller environments.
Codex added persisted goals, which is the most important signal in this window: worker-native objectives are beginning to survive beyond a single prompt or session. Around that, Codex also expanded permission profiles, plugin workflows, external session import, and multi-agent controls. Claude Code pushed cloud multi-agent review, session recaps, plugin distribution, hook behavior, MCP governance, and telemetry attribution. Gemini CLI hardened workspace trust and environment loading while experimenting with reviewable memory patches. Hermes introduced a background Curator for skill-library maintenance. Pi kept its minimal-harness posture while rapidly changing providers, transports, extensions, and terminal rendering.
Main Signals
Worker-Native Goals Are Emerging
Codex /goal is the strongest signal in this window. It is not just memory.
It is a worker-native objective register: a way for a coding worker to carry a
durable direction of travel across longer arcs of work.
That unlocks more serious long-horizon work, but it also creates a new authority question. A provider-native goal can guide a worker, but it should not silently become the operator's charter, mandate, or memory.
Bitter should receipt worker goals explicitly and reconcile them against
CHARTER.md, the run mandate, and the wake packet.
Signal: 2026-05-06-worker-native-goals
Worker-Native State Is Becoming a Memory Layer
Claude session recaps, Gemini Auto Memory, and Hermes Curator all point in the same direction: workers are learning how to carry context forward.
Bitter should leverage those surfaces, but receipt them as worker-native state: which recap, memory patch, skill report, or resume state governed the work, and what crossed back into operator-owned receipts and wake packets.
Signal: 2026-05-06-worker-native-memory
Authority Semantics Are Explicit, But Fragmented
Codex expanded permission profiles and sandbox metadata. Gemini added secure
.env loading, workspace trust, and shell allowlists. Claude's changelog moved
around plugins, hooks, MCP, telemetry, and permission prompts. Pi's provider
and extension layers changed quickly.
Bitter capability profiles should record the worker's actual trust and permission state: version, channel, env policy, sandbox/profile, plugin set, MCP surface, transport, and credential posture.
Signal: 2026-05-06-fragmented-authority-semantics
Verification Is Moving Into the Workers
Claude /ultrareview is the clearest signal: provider-native cloud fleets can
review branches and PRs. Codex multi-agent controls, Gemini subagent/eval work,
and Hermes Curator reports all rhyme with it.
Bitter should treat these as evidence producers, not final truth. The run contract still needs to say what evidence proves progress and what gets settled into memory.
Signal: 2026-05-06-worker-verification
Plugins, Extensions, and Skills Are the New Surface Area
Codex plugins, Claude plugins, Gemini extensions/MCP, Hermes skills, and Pi extension APIs are becoming the integration membrane.
Bitter adapters should record enabled plugin, extension, and skill surfaces. BitterLearn should not ingest worker-local skills or memories as durable Bitter memory without settlement.
Signal: 2026-05-06-plugin-extension-skill-surface
Worker Integrations Are Not Doctrine
Pi removed built-in Gemini CLI and Antigravity support while adding many new providers. Gemini's stable, preview, and nightly channels differ materially. Codex alpha and app-server surfaces move quickly.
The durable layer is the run contract: charter, mandate, authority, execution, evidence, judgment, memory, and next run.
Signal: 2026-05-06-worker-integrations-not-doctrine
What Operators Should Do
- Treat persistent worker goals as mission registers that must be reconciled against the operator's charter and run mandate.
- Treat worker-native state as useful but not authoritative.
- Record which goals, recaps, memories, plugins, skills, permission profiles, release channels, and transports were active during serious runs.
- Treat provider-native review as evidence, not final judgment.
What Bitter Should Do Next
- Define
worker_goalreceipt fields and settlement rules for Codex/goal. - Draft adapter receipt vocabulary for worker-native state, permissions, plugin surfaces, verification outputs, and release-channel metadata.
- Build small probes for Codex
/goal, Claude/ultrareview, Gemini workspace trust and memory patches, Hermes Curator output, and Pi session transport/extension metadata. - Create a worker capability matrix before deeper integration work.