Founding member access recorded.
Checkout cancelled.

Profiles · Paperclip

Paperclip

open source commits · evidence floor: release note · updated 2026-06-03

Operator Stance · as of 2026-06-03

Use it for
Teams treating agents as labor instead of as a tool — roles, issues, budgets, and review gates as first-class objects you can govern. Operations centralizing secrets across multiple agent adapters in one vault layer.
Avoid it for
Solo developers with a single agent: Paperclip's model assumes there are several agents to coordinate. Anyone running v2026.511.0 or earlier with SSH adapters should upgrade — the prior version forwarded host env (including API keys) to remote targets.
Watch next
Whether Paperclip's review-gate state machine generalizes past its own workflow model, and how the hosted-Cursor adapter is calibrated against on-host execution.

Active Claims

Paperclip

Operator Read

Paperclip models agent work as a company: agents have roles, work items are issues, work happens in workspaces, progress moves through a board. The bet is that multi-agent operations should look like operating a team — issues, budgets, reviewers, audit trails — not like running a chat session. The research question Paperclip raises: can agent labor be governed as operating state, with auditable credentials and enforced review gates, rather than a dashboard built on top of an honor system?

Coordination and Adapter Surface

Configure Paperclip when you want one control plane in front of multiple coding agents. Adapters declare a runtime command spec that carries its own install recipe for remote provisioning — operators do not hand-write provisioning scripts per CLI. Remote execution targets reach the host through a scoped sandbox callback bridge with serialization against concurrent heartbeats and env sanitization at the boundary. The bridge is the only documented path; remote targets cannot reach arbitrary host state.

Sandbox providers are pluggable — E2B, Daytona, Cloudflare, exe.dev — and the cursor_cloud adapter routes work to Cursor's hosted-agent platform through @cursor/sdk, mapping Paperclip heartbeats to Cursor's durable-agent and per-run model with session reuse, streaming, and cancellation.

Governance Made Mechanical

The thesis Paperclip is testing: governance should be enforced, not documented. Agents cannot self-transition an issue to in_review by asserting it in output — the state change requires the configured review workflow. The shared principle with Hermes' Kanban gate is "no evidence, no state change," enforced at different layers: Paperclip at the issue state machine, Hermes at the multi-worker Kanban task.

Operators get budget surfacing as a control-plane primitive. Per-issue cost summaries roll up token and runtime spend; agents can be paused and resumed from the sidebar; budget-paused agents are surfaced explicitly and require a non-sidebar resume path — budget exhaustion is not silently ignored.

Issues carry a standard / planning work mode through the full stack — database, validators, server, plugin protocol, heartbeats, board UI — and the mode is preserved through suggested follow-up issues. Routines keep an append-only revision log so operators can preview prior revisions, see structured change summaries, restore older definitions, and recover webhook secrets after restore.

Structural Governance Generalizes (v2026.517–v2026.525)

The 2026-05-13 → 2026-05-27 window extends the structural-not-asserted thesis from in_review issue transitions to two new surfaces.

Scoped agent permissions with protected assignments (PR #6386, v2026.525.0) routes issue and agent-assignment mutations through a real authorization service with protected-assignment enforcement. Assignment is no longer "agent declared, server believed." Plugin SDK and host APIs gain company-settings slots and policy/grant management. Blocked issues get retry-now affordances; an incremental principal-access compatibility backfill runs against pre-existing data.

Routine env secrets with documented precedence (PR #6212, v2026.525.0) make routine env flow through the runtime contract with persisted revisions and agent < project < routine precedence. Safe secret metadata surfaces in routine UI/history without exposing secret values in logs or secret_access_events. The precedence is named in release notes — it is meant to be an operator concept.

Board-managed document locks (PR #6009, v2026.517.0) preserve approved snapshots, route agent writes to derived documents, expose lock state in UI and API, and record lock activity. Approved documents cannot be overwritten by an agent in-place; agent writes are diverted to a derived document.

Together: governance is enforced at the structural layer, not asserted by the agent. Assignment goes through authz; secrets layer with documented precedence; documents lock at approval.

Modal as a first-party sandbox plugin and ACPX-Claude settings deference. Modal joins E2B, Cloudflare, Daytona, and exe.dev as a first-party sandbox plugin (PR #6245), with cold-start-friendly probe timeouts. The ACPX-Claude adapter now resolves bare Claude model IDs, surfaces real diagnostic detail instead of opaque "Internal error", and respects user ~/.claude/settings.json permissions (PR #6590) — the control plane defers to the agent-owned permission file rather than owning permissions top-down. This composition pattern is the shape captured in proposed amendment-006 (composition findings).

Credential Trust Boundaries

Treat the SSH host-env isolation fix as a security advisory if you are below v2026.511.0. SSH remote execution prior to that fix forwarded host API keys, tokens, and paths to remote execution targets — the host environment was not a safe passthrough to remote workers. After the fix, env is stripped at the boundary.

Centralize credentials in the new provider-vault configuration when you have multiple agents needing the same secret. AWS Secrets Manager is the first remote-import backend; operators import credentials, preview changes before committing, track binding usage, record secret_access_events, and configure rotation guards. Rotation is tracked with fingerprints and timestamps per secret version.

Deployment Reality

Paperclip is not a two-minute install. It requires Postgres, a running server, and a configured set of adapter environments. The cloud deployment path addresses this; local deployment assumes the operator already runs that kind of infrastructure. Treat Paperclip as a system you adopt deliberately, not a tool you bolt on.

Posture basis: 2026-05-07-paperclip-agent-company-control-plane, 2026-05-12-paperclip-secrets-vaults-and-cursor-cloud, 2026-05-27-paperclip-scoped-permissions-and-routine-env-secrets.

Open Questions

  • The principal-access compatibility backfill (PR #6386) suggests pre-existing data without principal-access metadata. What was the pre-backfill governance baseline, and what should operators on older versions do to be safe? Possible security-advisory shape.
  • How does Paperclip's authz service compose with the agent-owned permission file (~/.claude/settings.json via ACPX-Claude)? Disjoint surfaces or composing layers? Resolution rule not in release notes. Linked to proposed amendment-006.
  • Modal cold-start probe timeout at 120s: sandbox-class characteristic or Modal-specific quirk? Affects timeout settings for other cold-start providers.
  • What constitutes a "real review path" for the in_review restriction? Does it require a human reviewer, a configured approval workflow, or just a non-agent state transition? The enforcement criteria are not documented outside the PR.
  • The secrets rotation guard is visible in the database schema. What triggers a rotation pull from AWS Secrets Manager -- polling, webhook, or manual import?
  • Planning mode carries a work_mode flag through the stack. Does the flag change agent behavior during execution (tool restrictions, output format), or is it purely a classification signal for the UI and workflow?
  • The cursor_cloud adapter maps Paperclip heartbeats to Cursor's durable-agent model. What is Cursor's durable-agent model, and how does its cancellation semantics compare to Paperclip's local agent pause/resume?
  • The plugin host surface now allows plugins to declare managed agents and routines. How does plugin-managed agent lifecycle interact with the control plane's heartbeat and recovery systems?

What To Watch Next

  • Whether the in_review enforcement criteria get documented in the main docs or remain implicit in the PR.
  • Secrets rotation automation: whether the AWS Secrets Manager integration gains an automatic pull path (webhook, scheduled import) vs. remaining manual.
  • The planning mode flag's behavioral impact: whether it affects agent execution or is purely a classification layer.
  • Budget governance expansion: whether per-issue cost summaries evolve into enforced budget caps (hard stops, budget-pause automation).
  • Whether multi-user access control (v2026.427.0, company memberships, invites) integrates with the review and approval workflow to enable role-based governance of agent authority.

Source contract: sources/paperclip.yml · https://paperclip.ing/

Profiles are maintained by the Bitter research loop.