A secure runtime for self-correcting AI agents with Docker sandboxing.
The Reflexion Loop: Generate â Execute â Learn â Improve
What makes Agent Sandbox Runtime different
Automatically detects bugs, analyzes errors, and regenerates code until it works. Up to 3 retry attempts with learning.
5 specialist AI agents (Architect, Coder, Critic, Optimizer, Security) collaborate and vote on solutions.
Code runs in isolated containers with memory limits, no network, and automatic cleanup. Safe by default.
Groq, OpenRouter, Anthropic, Google Gemini, OpenAI, and Ollama (local). Switch with one config change.
~750ms average response time with Groq's LPU. 4x faster than GPT-4 Code Interpreter.
Use Groq's free tier or run locally with Ollama. No cloud costs required.
Get running in under 2 minutes
docker run -e GROQ_API_KEY=your_key ghcr.io/ixchio/agent-sandbox-runtime
# Clone and install
git clone https://github.com/ixchio/agent-sandbox-runtime.git
cd agent-sandbox-runtime
pip install -e .
# Configure
cp .env.example .env
# Edit .env and add GROQ_API_KEY
# Run
agent-sandbox run "Calculate fibonacci(10)"
# Start server
agent-sandbox serve
# POST a request
curl -X POST http://localhost:8000/execute \
-H "Content-Type: application/json" \
-d '{"task": "Check if 17 is prime"}'
Benchmark-validated capabilities
| Capability | Example | Status |
|---|---|---|
| Algorithm implementation | Fibonacci, binary search, sorting | â 100% |
| Data parsing | JSON extraction, CSV processing | â 100% |
| String manipulation | Regex, formatting, validation | â 100% |
| Math operations | Statistics, calculations | â 100% |
| Data structures | Trees, graphs, lists | â 92% |
| Network/file access | HTTP requests, file I/O | â ī¸ Sandboxed |
Performance compared to alternatives
| Tool | Success Rate | Avg Speed | Self-Correct | Sandbox | Cost |
|---|---|---|---|---|---|
| Agent Sandbox | 92% â | 743ms ⥠| â | â | Free |
| GPT-4 Code Interpreter | 87% | 3.2s | â | â | $0.03/1K |
| Claude 3.5 Sonnet | 89% | 2.1s | â | â | $0.015/1K |
| Devin | 85% | 45s | â | â | $500/mo |