Paperclip

Playbook

April 1, 2026

Generation is mostly solved, whether it’s writing text or code, or creating images or music. Undoubtedly, models have gotten to a level that feels advanced enough for most users across a variety of use cases and industries. The next step, however, involves proactive actions and more agentic workflows. Similar to how enterprises operate, coordination now introduces a management problem. That’s the problem that Paperclip set out to solve. It treats agents less like chatbots and more like employees with managers, budgets, approvals, and an audit trail. (Paperclip)

Why Agent Teams Need Management

Founders and entrepreneurs (especially solopreneurs) don’t need another agent that writes decent copy or patches a file. They need a way to run a small army of agents without losing the plot. That’s Paperclip’s thesis: if agents are becoming workers, they need a company around them. Paperclip makes the company the unit of orchestration: who owns the task, what it costs, who approves risky actions, and what happened last time. (Paperclip)

Paperclip is the clearest open-source attempt so far to turn a pile of agents into something you can actually operate. It is strong where most agent stacks are thin: hierarchy, task ownership, budgets, approvals, and auditability. It is weaker in the places that decide trust at scale: auth edge cases, audit fidelity, security hardening, and mature multi-human governance. (Paperclip)

What Paperclip Is

Paperclip is a Node.js server and React UI that sits above agent runtimes. It acts as a control plane that can orchestrate local and remote agents through adapters for Claude Code, Codex, Gemini CLI, OpenCode, Cursor, OpenClaw, Hermes, generic shell processes, and HTTP webhooks. In its own architecture docs, the core design choice is explicit: Paperclip orchestrates agents; it does not replace their runtime. (Paperclip)

The product makes the most sense for technical founders, small product teams, agencies, and operators already coordinating several agents. The README is blunt about that audience, and the hosted product broadens the pitch by saying non-technical users can operate the dashboard while developers go deeper with adapters, APIs, and plugins. (GitHub)

Enterprise as a Product

Paperclip’s breakthroughs are structural, not model-scientific. It makes the company a first-order object. It uses heartbeats instead of pretending every agent should run continuously. It requires atomic task checkout once multiple agents can grab the same work. It keeps communication attached to issues and comments. It uses skills as modular markdown instructions with metadata-first loading, which keeps prompts smaller. Paperclip is ready for serious pilots in bounded, low-blast-radius workflows. It is not yet the product you’d trust to run unsandboxed production changes, payments, or other irreversible actions without human review.

The Core Idea

Paperclip’s central move is treating the company, not the agent, as the primary abstraction. A company has a goal, employees, org structure, and a task hierarchy, meaning one human operator should be able to run a small AI-native company end to end with visibility and control. That is a sharper idea than “multi-agent framework,” because it brings in the missing management layer: chain of command, budgets, approvals, and a board that can step in anywhere. (GitHub)

Rather than an enterprise SaaS, Paperclip is open-source, self-hosted software with no account required for local use. Local onboarding can start the API server with an embedded database and local storage, without external database setup. There is also a hosted commercial product that packages the same basic control-plane idea into a no-install service. (GitHub)

The system is company-scoped. You create a company, set a goal, create a CEO agent, build the org chart, set budgets, and launch heartbeats. The board sits above the org and can pause agents, terminate them, reassign tasks, and override budgets. The reporting structure is intentionally strict: every non-CEO agent reports to exactly 1 manager, forming a tree rather than a matrix org. (Paperclip)

Paperclip is not a chatbot, not a prompt manager, not a drag-and-drop workflow builder, not a single-agent tool, and not a code review product. The product stays focused on operating agents as a managed workforce rather than trying to absorb every adjacent tool category. (GitHub)

There is one more nuance that also matters. The open-source core is still centered on a single human board operator, and the README roadmap still lists multiple human users as future work. The hosted commercial product already markets enterprise features such as SSO, SAML, and role-based access control. Those 2 things can both be true, but they are not the same product maturity story. (GitHub)

Who It’s For

Paperclip is best for teams that already know a single good agent is not the problem. If the pain is coordination rather than generation, the product makes immediate sense. The target user is someone juggling many agents, wanting 24/7 operation, cost controls, and a workflow that feels closer to a task manager than a chat window.

The clearest fit today is the technical founder or very small team building a product faster than headcount should allow. A second good fit is an agency, research desk, or media operation with recurring work and structured handoffs. A third is an engineering team mixing coding agents with manager-style agents that assign, review, and escalate. Public usage evidence points in those directions. (Paperclip)

The weak fit is just as important. Paperclip is overkill for 1 strong agent and a human operator. It is also the wrong starting point if your real problem is model capability rather than coordination. Paperclip will not make a weak agent smart. It can make a fleet of decent agents more legible. (GitHub)

A Small Product Team

A high-signal first deployment is a 3 or 4 agent product org: CEO, CTO, engineer, maybe designer. The board sets 1 measurable goal, say “reduce onboarding drop-off 15 percent this month.” The CEO proposes the strategy. The board approves it. The CTO breaks that into issues. The engineer claims 1 task at a time, comments progress, and escalates blockers. A daily routine posts activation metrics into an issue. A weekly routine asks the CEO to reprioritize based on those numbers. Paperclip adds value here because the work tree, costs, approvals, and activity history stay in 1 system. GitHub, CI, and deployment still live elsewhere. (Paperclip)

A Content Or Research Desk

Paperclip also fits a small editorial or research desk: strategist, researcher, writer, editor. Company skills can hold house style, sourcing rules, and publishing checklists. Routines can create a daily market brief and a weekly editorial-planning task. Because comments are the primary communication channel and every task has a parent chain, research notes, drafts, approvals, and edits stay attached to the assignment rather than drifting across chats and documents. That is valuable when the work repeats often but still needs judgment. (Paperclip)

An Ops Or Monitoring Desk

A third use case is monitoring and incident prep. A routine can wake an ops agent every morning or on fixed intervals, check health signals through a process or HTTP adapter, and open or update tasks when something drifts. The board or a manager agent can then triage the issue and ask a coding agent to investigate. This is a clean use of Paperclip’s heartbeat model because it handles periodic attention without keeping every agent alive all day. I would not let the system take irreversible production actions on its own yet. (Paperclip)

How To Pilot It

If we were rolling Paperclip out today, we’d keep the first deployment narrow and boring.

Start with 1 company, 1 goal, and 3 agents. Paperclip’s own setup flow starts with a company goal, a CEO, the org chart, and budgets. Mirror that. Keep the first org shallow. (Paperclip)
Pick 1 runtime family for the first month. Mixed fleets are powerful, but they complicate auth, debugging, and cost analysis. Paperclip is adapter-agnostic, so you can mix later once the operating model is stable. (Paperclip)
Force all work through issues and comments. That is where checkout, status, audit history, and budget visibility live. Side chats erase most of Paperclip’s value. (Paperclip)
Turn budgets on immediately. Use low per-agent caps at first, and raise them only after a week of clean runs. Paperclip warns at 80 percent and auto-pauses at 100 percent. (Paperclip)
Treat local_trusted as local only. Move to authenticated mode for anything shared, and enable strict secret mode beyond local trusted. If you import a company, verify adapter configs before re-enabling heartbeat timers, because imported companies disable them by default. (Paperclip)
Keep human approval on anything expensive, destructive, or external-facing. Hires, production deploys, payments, customer emails, and data deletes should stay gated. That is consistent with the product’s approval model and with its current maturity. (Paperclip)

Final Take

Paperclip’s real contribution is that it makes organization the product. It treats agents like employees with managers, budgets, and approvals, rather than like isolated clever loops. Paperclip is already good enough to study seriously and to pilot in small, technical, low-blast-radius settings. It looks strongest for founder-led teams, agencies, and operators who already feel multi-agent sprawl. The idea is ahead of the maturity curve. The idea is also strong enough that the maturity curve is worth watching closely. (Paperclip)