Security

Phantom runs as a persistent process with full computer access. This document covers the security model, authentication, permissions, and hardening.

Slack Permissions

Bot Token Scopes (all required)

These are configured automatically by the app manifest.

Scope	Purpose
`app_mentions:read`	Hear @Phantom mentions in channels
`channels:history`	Read messages in public channels
`channels:read`	See public channel list
`chat:write`	Send messages and replies
`chat:write.public`	Post to public channels without being invited
`groups:history`	Read messages in private channels
`im:history`	Read direct messages
`im:read`	See DM list
`im:write`	Start DM conversations
`reactions:read`	Track feedback reactions (thumbs up/down)
`reactions:write`	Add status reactions (eyes, brain, checkmark)

App Token Scopes

Scope	Purpose
`connections:write`	Socket Mode WebSocket connection

Why each scope is needed

app_mentions:read + channels:history - The core interaction model. Users @mention Phantom in a channel, and the bot reads the message context to respond. Without channels:history, the bot cannot see the conversation thread.
channels:read - Used to verify channel existence and list available channels. Not strictly required for basic operation but needed for the bot to discover which channels it has access to.
chat:write + chat:write.public - The bot must be able to send messages. chat:write covers channels the bot has joined. chat:write.public allows posting to any public channel by ID without needing to be invited first. Thread replies, progressive updates, and the intro message all require these scopes.
groups:history - Same as channels:history but for private channels. Without this, the bot silently fails to respond in private channels.
im:history + im:read + im:write - Direct message support. im:read lets the bot see its DM list, im:history lets it read DM content, and im:write lets it open new DM conversations (used for the DM-based onboarding option).
reactions:read - Phantom tracks thumbsup/thumbsdown/heart reactions as feedback signals. These feed into the self-evolution pipeline to improve the agent over time.
reactions:write - The status reaction state machine adds emoji reactions to user messages to show processing state (eyes -> brain -> wrench -> checkmark). Without this, the user has no visual indicator that the bot is working.

Environment Variables

Variable	Required	Purpose
`ANTHROPIC_API_KEY`	Yes	Claude Opus 4.7 API access
`SLACK_BOT_TOKEN`	For Slack	Bot user OAuth token (`xoxb-`)
`SLACK_APP_TOKEN`	For Slack	App-level token for Socket Mode (`xapp-`)
`SLACK_CHANNEL_ID`	For Slack	Default channel for intro message on first start
`TELEGRAM_BOT_TOKEN`	For Telegram	Telegram bot token from @BotFather
`OWNER_EMAIL`	For web chat	Email address for magic link login to `/chat`
`RESEND_API_KEY`	For email login	Resend API key for sending magic link emails
`PORT`	No (default 3100)	HTTP server port

Web Chat Authentication

The web chat at /chat uses cookie-based sessions with magic link login:

On first visit, the user enters their email address
Phantom sends a one-time magic link via Resend (or prints a bootstrap token to stdout)
Clicking the link sets a session cookie (30-day expiry, HttpOnly, SameSite=Lax)
The cookie is validated on every chat API request and SSE connection

On first run without Slack, Phantom triggers the login flow automatically for OWNER_EMAIL. The magic link token is single-use and expires after 30 minutes.

Web Push (VAPID)

Web Push notifications use VAPID (Voluntary Application Server Identification). The VAPID key pair is generated on first use and stored in SQLite (the kv_store table). Private keys never leave the server. Push subscriptions are per-browser and stored in SQLite.

File Attachment Security

Uploads to /chat/sessions/:id/attachments are validated server-side:

Type allowlist: images (JPEG, PNG, GIF, WebP), PDF, and text/code files only
Size limits: images 10 MB, PDFs 32 MB, text 1 MB, total request 40 MB
Per-request cap: 10 files maximum
Filename sanitization: path separators, special characters, and leading dots are stripped
Preview Content-Disposition: attachment previews use inline disposition so browsers render them rather than downloading

MCP Authentication

All MCP requests require a Bearer token. Tokens are hashed with SHA-256 before storage. The raw token is shown once during creation and never stored.

# Create tokens
bun run phantom token create --client claude-code --scope operator
bun run phantom token create --client dashboard --scope read

# Tokens are stored as hashes in config/mcp.yaml

Scopes

Scope	Access
`read`	Query status, memory, config, metrics, history, list dynamic tools
`operator`	read + ask questions, create tasks
`admin`	operator + register/unregister dynamic tools

Admin scope is required for dynamic tool registration because dynamic tools can execute arbitrary code.

Rate Limiting

Token bucket rate limiter per client. Default: 60 requests/minute, burst of 10. Configure in config/mcp.yaml.

Webhook Authentication

The webhook channel uses HMAC-SHA256 signature verification:

Request body is signed with a shared secret
Timestamp freshness check (5-minute window) prevents replay attacks
Timing-safe comparison prevents timing attacks

Agent Safety Hooks

The agent runtime includes safety hooks:

Dangerous Command Blocker - blocks destructive commands (rm -rf /, mkfs, dd to device, docker system prune, git push --force, etc.) before execution. This is defense-in-depth, not a security boundary. The real safety layers are the constitution, LLM judges, and owner access control.
File Tracker - tracks all files the agent reads and writes for audit

Constitution

The self-evolution engine has 8 immutable principles in phantom-config/constitution.md that cannot be modified by the evolution process:

Never exfiltrate data
Never modify its own safety hooks
Always respect user corrections
Always maintain audit trail
(and 4 more)

These are enforced by the Constitution Gate during every evolution cycle. The constitution checker rejects any config delta that would violate these principles.

5-Gate Validation

Every evolution change passes through 5 gates:

Constitution Gate - does it violate immutable principles?
Regression Gate - does it break golden test cases?
Size Gate - is any config file over 200 lines?
Drift Gate - has semantic distance from the original drifted too far?
Safety Gate - does it touch protected patterns?

When LLM judges are enabled, triple-judge voting with minority veto is used for safety-critical gates (safety and constitution). A single judge objection blocks the change.

Secrets Management

API keys are provided via environment variables, never stored in config files
config/channels.yaml uses ${ENV_VAR} substitution so tokens stay in the environment
MCP tokens are stored as SHA-256 hashes, not plaintext
On Specter VMs, secrets are injected via cloud-init and deleted after boot
phantom init writes tokens to .env.local (gitignored) when provided interactively, but skips .env.local when tokens come from environment variables (avoids duplicating secrets)

Network Security (Specter VMs)

When deployed via Specter:

Dual firewall (Hetzner Cloud Firewall + ufw), ports 22/80/443 only
Caddy provides automatic TLS via Let's Encrypt
systemd hardening: NoNewPrivileges, ProtectSystem=strict, ProtectHome=read-only, PrivateTmp
fail2ban for SSH brute-force protection
The agent process runs as the specter user, not root
Memory limits: 2GB max, 1.5GB high watermark, 256 max tasks

Audit Logging

Every MCP interaction is logged in SQLite:

Client name, method, tool name
Input summary (truncated to 200 chars)
Duration, cost, success/error status
Timestamp

View recent entries via phantom_history MCP tool.

Dynamic Tool Security

Dynamic tools (registered at runtime by the agent) execute code in isolated subprocesses:

Only admin-scoped clients can register tools
Two handler types: shell (bash commands) and script (bun scripts on disk)
The inline handler type (which used new Function(), equivalent to eval) has been removed as a security P0 fix
Subprocesses run with a sanitized environment containing only PATH, HOME, LANG, TERM, and TOOL_INPUT. API keys, tokens, and other secrets are never passed to dynamic tool subprocesses.
Bun script handlers use --env-file= to prevent automatic loading of .env files
Tool input is passed via the TOOL_INPUT environment variable (JSON string)

Avatar Upload

The Settings > Identity card accepts PNG, JPEG, and WebP images up to 2 MB, stored at data/identity/avatar.<ext> with a companion avatar.meta.json. The upload path is locked down across several layers:

Zero server-side image decoding. Bun writes bytes verbatim; the image is only ever decoded inside the browser's sandboxed renderer. This eliminates the "malformed JPEG crashes Bun" class of attack.
MIME allowlist plus magic-byte sniff. image/png, image/jpeg, image/webp only. SVG is rejected at MIME AND by inspecting the first bytes, which catches SVG (or any other format) renamed to .png and submitted with a forged MIME.
Extension derived from the validated MIME. The operator's filename is never used in any filesystem path, so path traversal via a crafted filename is impossible.
2 MB cap enforced at content-length AND at read. The client-side Content-Length header is treated as a hint; the server re-checks after reading so a lying or missing header cannot bypass the cap.
Atomic tmp + rename. Both the image file and its meta JSON are written to *.tmp first and then renamed, so a mid-write failure leaves the previous avatar intact.
Auth posture. POST + DELETE require the cookie session (owner). GET /ui/avatar and the scope-friendly mirror GET /chat/icon are public because the landing page renders before login.

The avatar is operator-visual state, not configuration; it is not subject to the phantom.yaml audit log.

Webhook Callback URL Validation

Webhook callback URLs are validated before use to prevent SSRF attacks:

Private IP ranges are blocked (10.x, 172.16-31.x, 192.168.x, 127.x)
Cloud metadata endpoints are blocked (169.254.169.254, metadata.google.internal)
Localhost and 0.0.0.0 are blocked
Only HTTP and HTTPS protocols are allowed

Layered Security Model

Phantom's security is defense-in-depth, with multiple independent layers:

Owner access control - Only the configured owner can talk to the agent (Slack user ID filtering)
Constitution - 8 immutable behavioral principles the agent cannot override
LLM safety judges - Independent Sonnet judges review evolution changes, triple-judge with minority veto. Sonnet is the default cross-model judge since main runs on Opus; operators may opt into Opus judges explicitly.
Dangerous command blocker - Regex patterns catch obvious destructive commands (not a security boundary)
Subprocess isolation - Dynamic tools run in clean environments without secrets
MCP authentication - Bearer tokens with scoped access (read/operator/admin)
Network firewalls - Hetzner Cloud Firewall + ufw, ports 22/80/443 only
Audit logging - All MCP interactions logged to SQLite

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security

Slack Permissions

Bot Token Scopes (all required)

App Token Scopes

Why each scope is needed

Environment Variables

Web Chat Authentication

Web Push (VAPID)

File Attachment Security

MCP Authentication

Scopes

Rate Limiting

Webhook Authentication

Agent Safety Hooks

Constitution

5-Gate Validation

Secrets Management

Network Security (Specter VMs)

Audit Logging

Dynamic Tool Security

Avatar Upload

Webhook Callback URL Validation

Layered Security Model

FilesExpand file tree

security.md

Latest commit

History

security.md

File metadata and controls

Security

Slack Permissions

Bot Token Scopes (all required)

App Token Scopes

Why each scope is needed

Environment Variables

Web Chat Authentication

Web Push (VAPID)

File Attachment Security

MCP Authentication

Scopes

Rate Limiting

Webhook Authentication

Agent Safety Hooks

Constitution

5-Gate Validation

Secrets Management

Network Security (Specter VMs)

Audit Logging

Dynamic Tool Security

Avatar Upload

Webhook Callback URL Validation

Layered Security Model