Back to Paperclip

Low-Trust Presets

doc/LOW-TRUST-PRESETS.md

2026.609.02.4 KB
Original Source

Low-Trust Presets

Paperclip ships core trust preset names so containment decisions are enforced in Community Edition even when EE policy editing is unavailable.

Presets

  • standard: the default V1 company-visible collaboration model. This preserves existing behavior for normal agents.
  • low_trust_review: an opt-in containment preset for automated work that may consume hostile or prompt-injected input, such as untrusted pull requests, external tickets, dependency diffs, or generated review output.

Boundary Model

low_trust_review is resolved from existing JSON policy fields:

  • agent permissions: permissions.trustPreset and permissions.authorizationPolicy.trustBoundary
  • project policy: executionWorkspacePolicy.authorizationPolicy.trustBoundary
  • issue/run policy: executionPolicy.authorizationPolicy.trustBoundary

The resolver intersects those sources. Narrower wins. A low-trust preset must resolve to a concrete company-local project, root issue, or issue-id scope. If a policy source names another company, uses an unsupported preset, or lacks that scope for risky access, Paperclip fails closed.

Containment, Not Privacy

This is containment for hostile automated work. It is not a general project, issue, or human privacy system.

V1 standard work remains company-visible by default: board users and in-company actors can inspect company work objects unless a separate access-control feature changes that behavior. Low-trust containment instead limits what the low-trust agent can read or mutate through the Paperclip API and prevents raw untrusted output from being automatically promoted into higher-trust agent context.

Runtime Containment

Managed low_trust_review runs fail closed unless Paperclip can enforce the runtime boundary:

  • the selected execution environment must use the sandbox driver
  • the effective execution workspace mode must be isolated_workspace
  • the issue being run must be inside the resolved low-trust boundary
  • secret references must use binding ids explicitly allowed by the boundary
  • inline sensitive environment values such as API keys and tokens are rejected
  • workspace runtime-service mutations are denied unless the boundary explicitly grants the runtime.manage tool class

The Docker workflow in doc/UNTRUSTED-PR-REVIEW.md remains useful for manual local review, but Paperclip-managed low-trust execution requires a sandboxed environment instead of a host-local adapter process.