Introduction

Building agents is easy. Running them in production is hard. Polos handles sandboxes, durability, approvals, triggers, and observability - so you can focus on writing the agent. No DAGs. No graph syntax. Just Python or TypeScript.

from polos import Agent, sandbox_tools, SandboxToolsConfig, DockerConfig

# Create a sandboxed environment - agents get exec, read, write,
# edit, glob, and grep tools automatically.
sandbox = sandbox_tools(SandboxToolsConfig(
    env="docker",
    docker=DockerConfig(image="node:20-slim", memory="2g"),
))

# Give the agent sandbox tools - it can now run commands,
# read/write files, and explore the codebase autonomously.
coding_agent = Agent(
    id="coding_agent",
    provider="anthropic",
    model="claude-sonnet-4-5",
    system_prompt="You are a coding assistant. The repo is at /workspace.",
    tools=sandbox,  # exec, read, write, edit, glob, grep
)

Polos vs. Typical Agent Frameworks

Challenge	Typical agent framework	With Polos
Sandboxing	None - DIY or run unsandboxed	Docker, E2B + built-in tools (exec, files, search)
Durability	Agent crashes, start over	Auto-retry, resume from exact step
Approvals	Build it yourself	Slack, UI, terminal - one tap
Slack	Build a bot from scratch	@mention an agent, get responses in-thread
Triggers	Glue code for every webhook	Built-in: HTTP, webhooks, cron, events
Observability	Grep through logs	Full tracing, every tool call, every decision
Cost	Re-run failed LLM calls from scratch	Durable execution, prompt caching, 60-80% savings

What You Get With Polos

Sandboxed Execution - Isolated Docker and E2B environments. Built-in tools: exec, read, write, edit, glob, grep, web_search. No tool code to write, no sandbox lifecycle to manage.
Human-in-the-Loop - Approval flows for any tool call. Reach your team via Slack, Discord, email. Paused agents consume zero compute.
Durable Workflows - Auto-retry, log-replay, concurrency control. Resume from the exact step that failed. 60-80% cost savings via prompt caching.
Slack Integration - @mention your bot to trigger any agent. Responses stream back to the thread. Tool approvals show up as Approve/Reject buttons in Slack.
Triggers - Webhook URLs, HTTP API, cron schedules, event-driven.
Observability - OpenTelemetry tracing for every step, tool call, and approval. Full execution history. Visual dashboard for monitoring and debugging.
Bring Your Stack - Any LLM via Vercel AI SDK and LiteLLM. CrewAI, LangGraph, and Mastra compatible. Python or TypeScript. Open source - run anywhere.

CLI and UI to monitor and run your agents

polos dev                    # Start server + worker with hot reload
polos run <agent>            # Start an interactive session with an agent
polos agent list             # List available agents
polos tool list              # List available tools
polos logs <agent>           # Stream logs from agent runs

Polos observability dashboard showing agent traces

See It in Action

Watch a coding agent built with Polos - sandboxed execution, tool approvals, and real-time observability.

What Can You Build?

Coding Agent - Clones repos, writes code, runs tests, and opens PRs - all inside a Docker sandbox with human approval before anything ships.
Data Analyst - Connects to data warehouses, executes SQL in sandboxed environments, builds charts, sends approval pages before sharing with stakeholders.
Research Agent - Crawls multiple sources, extracts findings, builds knowledge bases. Durable execution checkpoints after each source. Notifies via Slack or Discord.
Order Processing - Handles incoming orders, runs fraud checks, suspends for human review on flagged transactions. Exactly-once execution guarantees no double charges.
Scheduled Workflows - Cron-triggered agents that run nightly data syncs, generate reports, or clean up stale resources. Durable state means missed runs pick up where they left off.

Next Steps

Quickstart

Build and run a sandboxed coding agent in minutes

Core Concepts

Learn how Polos handles state and durability

Examples

See how to build HITL agents and multi-agent systems

Polos is 100% open source. Star us on GitHub if you find it useful, and join our Discord to connect with the community.

Introduction

Getting Started

Fundamentals

Agents

Workflows

Observability

Guides and Examples

Community

Welcome to Polos

Introduction

Polos vs. Typical Agent Frameworks

What You Get With Polos

CLI and UI to monitor and run your agents

See It in Action

What Can You Build?

Next Steps

Quickstart

Core Concepts

Examples

Introduction

Getting Started

Fundamentals

Agents

Workflows

Observability

Guides and Examples

Community

Documentation Index

​Introduction

​Polos vs. Typical Agent Frameworks

​What You Get With Polos

​CLI and UI to monitor and run your agents

​See It in Action

​What Can You Build?

​Next Steps

Quickstart

Core Concepts

Examples

Introduction

Polos vs. Typical Agent Frameworks

What You Get With Polos

CLI and UI to monitor and run your agents

See It in Action

What Can You Build?

Next Steps