feat(server): add MCP_ALLOWED_HOSTS env var for DNS rebinding allowlist by tricamtech · Pull Request #2 · thebackpackdevorg/vault-mcp-server

tricamtech · 2026-04-08T15:15:53Z

Problem

When the MCP server binds to 127.0.0.1/localhost (the recommended posture when fronting with a reverse proxy or tunnel), FastMCP automatically enables its DNS rebinding protection middleware with only the localhost wildcards in the allow list — see mcp/server/fastmcp/server.py:178-181 in the upstream mcp SDK:

if transport_security is None and host in ("127.0.0.1", "localhost", "::1"):
    transport_security = TransportSecuritySettings(
        enable_dns_rebinding_protection=True,
        allowed_hosts=["127.0.0.1:*", "localhost:*", "[::1]:*"],
    )

This is the right default for direct local clients, but it breaks the exact deployment pattern this project's README recommends in the "Remote deployment" section:

To access the vault from other machines, expose it through a reverse proxy. The server supports an optional OAuth layer for authentication when deployed remotely.

…with a Cloudflare Tunnel + Access example. With FastMCP's defaults, every request that arrives via the tunnel carries Host: vault.example.com and gets rejected:

HTTP/1.1 421 Misdirected Request
2026-04-08 ... mcp.server.transport_security WARNING Invalid Host header: vault.example.com

There's currently no way to tell vault-mcp-server about additional acceptable Host header values — the FastMCP transport_security argument is hardcoded to None, and the only escape hatches are to either disable DNS rebinding protection entirely (loses the protection) or refuse to bind to localhost.

Reproduction

VAULT_PATH=/some/markdown/dir SERVER_HOST=127.0.0.1 SERVER_PORT=8091 \
  python -m vault_mcp.server &

# Front it with anything that rewrites Host (cloudflared tunnel, nginx, traefik, etc.)
# Then from outside:
curl -X POST https://vault.example.com/mcp \
  -H 'Content-Type: application/json' \
  -H 'Accept: application/json, text/event-stream' \
  -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{...}}'
# → HTTP 421, "Invalid Host header"

Fix

Add a simple MCP_ALLOWED_HOSTS env var (comma-separated). When set, it gets merged with the localhost wildcards FastMCP would have injected anyway, and passed through as an explicit transport_security argument:

ALLOWED_HOSTS = [
    h.strip() for h in os.environ.get("MCP_ALLOWED_HOSTS", "").split(",") if h.strip()
]

if ALLOWED_HOSTS:
    from mcp.server.transport_security import TransportSecuritySettings
    _allowed = ["127.0.0.1:*", "localhost:*", "[::1]:*", *ALLOWED_HOSTS]
    _mcp_kwargs["transport_security"] = TransportSecuritySettings(
        enable_dns_rebinding_protection=True,
        allowed_hosts=_allowed,
    )
    logger.info("DNS rebinding allowed_hosts: %s", _allowed)

Empty / unset = preserves existing behavior (let FastMCP construct its default)
Wildcards like vault.example.com:* are honored by the underlying middleware
Local stdio/HTTP testing isn't affected because the localhost defaults are still in the merged list

Verification

After the patch on the same setup as above:

MCP_ALLOWED_HOSTS=vault.example.com,vault.example.com:* \
  VAULT_PATH=... SERVER_HOST=127.0.0.1 SERVER_PORT=8091 \
  python -m vault_mcp.server
# Logs: DNS rebinding allowed_hosts: ['127.0.0.1:*', 'localhost:*', '[::1]:*', 'vault.example.com', 'vault.example.com:*']

curl -X POST https://vault.example.com/mcp [...]
# → HTTP 200 + valid initialize response

Notes

Found while deploying vault-mcp-server behind a Cloudflare Tunnel for cross-device access from claude.ai web. Really enjoying the section-level chunk retrieval design — it's noticeably more token-efficient than competing markdown MCP servers that dump whole files. Thanks for the project.

Sister PR to #1 (the vault_reindex is_ready guard).

When the MCP server binds to 127.0.0.1/localhost FastMCP auto-enables its DNS rebinding protection with only `127.0.0.1:*`, `localhost:*`, and `[::1]:*` in the allowlist (mcp/server/fastmcp/server.py:178-181 in the upstream mcp SDK). This is the right default for direct local use, but it breaks any deployment where you front the server with a reverse proxy or tunnel that rewrites the Host header — every request hits the server with `Host: vault.example.com` and FastMCP returns HTTP 421 "Misdirected Request" with a `transport_security WARNING Invalid Host header` log line. This is exactly the deployment shape the README's "Remote deployment" section recommends: > To access the vault from other machines, expose it through a > reverse proxy. The server supports an optional OAuth layer for > authentication when deployed remotely. …with a Cloudflare Tunnel example. Without this change that pattern hits the 421 wall on first request. Add a simple env var: `MCP_ALLOWED_HOSTS` is comma-separated and gets merged with the localhost defaults. Set it to your reverse-proxied hostname(s) and you're done. Empty string preserves the existing behavior (let FastMCP do its thing). Wildcards like `vault.example.com:*` are honored by the underlying middleware. Reproduction: VAULT_PATH=/some/markdown/dir SERVER_HOST=127.0.0.1 SERVER_PORT=8091 \ python -m vault_mcp.server & curl --resolve vault.example.com:443:<your CF edge IP> \ -X POST https://vault.example.com/mcp \ -H 'Content-Type: application/json' \ -H 'Accept: application/json, text/event-stream' \ -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{...}}' # → HTTP 421, "Invalid Host header: vault.example.com" After patch: MCP_ALLOWED_HOSTS=vault.example.com,vault.example.com:* \ VAULT_PATH=... SERVER_HOST=127.0.0.1 SERVER_PORT=8091 \ python -m vault_mcp.server # → HTTP 200 + valid initialize response Hit while deploying vault-mcp behind a Cloudflare Tunnel for cross- device access from claude.ai web. Working great with section-level chunk retrieval — really enjoying the design.

The OAuth approval form's PIN input has `maxlength="10"` hardcoded in `_render_approve_page` (auth.py:321). When `OAUTH_PIN` is set to anything longer than 10 characters, the browser silently truncates the user's input — they paste the full PIN, the form submits the truncated version, and the server returns the generic "Wrong PIN" error with no indication that the truncation happened. The user ends up rotating PINs in confusion. Hit while configuring OAuth for a deployment with a 16-char hex PIN generated via `secrets.token_hex(8)`. The "Wrong PIN" error sent me hunting through env-var loading, systemd reload, and process env inspection before I noticed the maxlength constraint in the rendered HTML. Fix: 1. Compute `pin_maxlen = max(len(provider.pin), 16)` so the field always accommodates the actual configured PIN, padded upward to 16 to allow growth without re-rendering the constant. 2. Widen the CSS so a longer PIN renders visibly: replace fixed `width: 120px` with `min-width: 120px; max-width: 280px` and reduce `letter-spacing` from 4px to 2px so 16 chars fits inside the wider field. The localhost defaults still match (a 4-char PIN gets the bumped-up 16-char input box, looks fine). Reproduction: OAUTH_PIN=$(python3 -c 'import secrets; print(secrets.token_hex(8))') \ OAUTH_ISSUER_URL=https://example.com VAULT_PATH=... \ python -m vault_mcp.server # Browser: navigate the OAuth flow, type/paste the 16-char PIN # → "Wrong PIN" error every time, no clue why. After patch: Same setup, paste the 16-char PIN → approved → token issued. Found while filing thebackpackdevorg#1 (vault_reindex guard) and thebackpackdevorg#2 (MCP_ALLOWED_HOSTS).

vault-mcp's incremental indexing currently only runs: - at server startup (full mtime scan via index_all) - when its own write tools modify a file (vault_write/vault_edit call indexer.index_file inline) External changes — git pull, manual edits in Obsidian, sync scripts, another process editing the markdown — are invisible until the next server restart. For deployments where the vault directory is shared with anything other than vault-mcp itself (which is most of them), the index goes stale fast. Add a background asyncio task that uses watchfiles (already a transitive dep via uvicorn) to subscribe to filesystem events under VAULT_PATH and react in real time: - added or modified .md → indexer.index_file(rel_path) - deleted .md → indexer._delete_files_chunks([rel_path]) - non-.md events → ignored - watcher crashes → logged, server keeps running (auto-reindex silently disabled until restart) The watcher starts immediately after the initial index_all completes (at the end of indexer.start()), so the server is fully ready before the watcher starts emitting events. Errors per-file are caught and logged so a single bad chunk doesn't take down the watcher. Configuration: - New env var VAULT_WATCH (default: true). Set VAULT_WATCH=false to disable the watcher entirely if you want the old behavior. - New constructor arg VaultIndexer(..., watch_files=True) so library users can opt out without env vars. - watchfiles added as an explicit dependency in pyproject.toml. It was already pulled in transitively via uvicorn, but pinning it explicitly means the import won't break if uvicorn drops it. Verification — full add/modify/delete cycle on a live server, no restart between steps: $ mkdir /tmp/vault && echo '# initial' > /tmp/vault/a.md $ VAULT_PATH=/tmp/vault SERVER_HOST=127.0.0.1 SERVER_PORT=8794 \ python -m vault_mcp.server & ... Filesystem watcher started on /tmp/vault # Add $ echo '# new file with Shadowfax the horse' > /tmp/vault/b.md ... Watcher: re-indexing b.md (added) $ vault_search "Shadowfax" → returns b.md, score 0.752 # Modify $ echo '# revised, mentions Wisteria Lane' > /tmp/vault/a.md ... Watcher: re-indexing a.md (modified) $ vault_search "Wisteria" → returns a.md with new content, 0.771 $ vault_search "old content phrase" → no longer matches a.md # Delete $ rm /tmp/vault/b.md ... Watcher: removing b.md $ vault_search "Shadowfax" → only weak fallback match, b.md gone Found while deploying vault-mcp behind a Cloudflare Tunnel for cross-device access. The vault is also git-synced from a separate machine, so the "restart to pick up new content" UX would have been painful. Sister to thebackpackdevorg#1 (vault_reindex guard), thebackpackdevorg#2 (MCP_ALLOWED_HOSTS), and thebackpackdevorg#3 (PIN field maxlength).

The SimpleOAuthProvider stores everything in in-memory dicts: self._clients: dict[str, OAuthClientInformationFull] = {} self._access_tokens: dict[str, AccessToken] = {} self._refresh_tokens: dict[str, RefreshToken] = {} self._auth_codes: dict[str, AuthorizationCode] = {} self._pending: dict[str, dict] = {} Restart wipes all of them. For deployments where the only consumer is a single Claude Desktop instance running on the same machine that's restarted manually, this is fine — that's the use case the docstring explicitly calls out: Stores clients, authorization codes, and tokens in memory. Tokens survive until server restart (acceptable for personal use). But for the deployment shape the README's "Remote deployment" section recommends — vault-mcp running as a long-lived systemd service behind a reverse proxy, with claude.ai web / Claude desktop / multiple Claude Code clients all connected through dynamically-registered OAuth — a restart silently invalidates every client's stored bearer token. The next request from each client returns 401, and the user has to walk through the PIN approval flow again per-client. Even worse: if the restart happens with an approval page open in a tab, the new process has no _pending entry for the request_id, so the user gets "Invalid or expired authorization request" with no clue that the entire state was wiped, not just their pending request. I hit this several times today running the public instance behind a Cloudflare Tunnel for cross-device access. Every PIN rotation, every config change requiring `systemctl restart`, every accidental crash required re-authing in claude.ai's connector dialog. ## Fix Add optional disk persistence for the long-lived state: - New constructor arg: `state_path: Path | None = None` - New env var: `OAUTH_STATE_PATH` - When set, the provider loads `_clients`, `_access_tokens`, and `_refresh_tokens` from a JSON file on init, and writes them back after every mutation. What is persisted: - Registered DCR clients (`_clients`) - Issued access tokens (`_access_tokens`) - Issued refresh tokens (`_refresh_tokens`) What is NOT persisted (intentionally): - The per-process `cf_bypass_token` — regenerated each start so it can never be exfiltrated from a stale state file. The save filter `if k != self._cf_token` excludes it explicitly. - `_auth_codes` — short-lived (5 minute expiry by design), no value in persisting - `_pending` — even shorter-lived (in-flight authorization requests), no value in persisting; surviving across restart could even be confusing (user clicks Approve on a half-dead request_id) Implementation details: - Atomic writes via `tempfile.mkstemp` + `os.replace` so a crash mid-write can't corrupt the state file - File mode 0600 (root-only readable, like /root/.env) - `asyncio.Lock` (`_save_lock`) serializes concurrent saves - Actual file I/O dispatched to the default executor to avoid blocking the event loop - Save errors are logged via `logger.exception` and swallowed — we never want a transient disk issue to block an auth flow - Load errors fall back to empty state with a logged warning, so a corrupted state file doesn't take down the server - Refresh-token rotation now correctly drops the OLD refresh token from the dict (it wasn't being cleaned up before, just orphaned; now it's removed and the save reflects that) - `version: 1` field in the state file for future schema migrations ## Verification End-to-end on a live server: $ OAUTH_STATE_PATH=/tmp/state.json OAUTH_PIN=test1234 ... \ python -m vault_mcp.server & ... INFO OAuth provider initialized — state path: /tmp/state.json, loaded 0 clients / 0 access tokens / 0 refresh tokens $ curl -X POST http://127.0.0.1:8795/register \ -d '{"client_name":"test","redirect_uris":["http://localhost/cb"]}' → client_id: 567926af-... $ ls -la /tmp/state.json -rw------- 1 root root 719 ... /tmp/state.json $ jq '.clients | keys' /tmp/state.json ["567926af-..."] # Kill and restart $ kill %1 && python -m vault_mcp.server & ... INFO OAuth provider initialized — state path: /tmp/state.json, loaded 1 clients / 0 access tokens / 0 refresh tokens ^^^^^^^^^ # Register a second client → both are now persisted $ curl -X POST http://127.0.0.1:8795/register ... $ jq '.clients | keys' /tmp/state.json ["567926af-...", "bff56b80-..."] The same flow with real access/refresh tokens (after walking the full Authorization Code grant via the browser) preserves token validity across restart — the connecting client doesn't need to re-auth after a service restart. ## Backwards compatibility `state_path` defaults to `None`. Without `OAUTH_STATE_PATH` set in the environment, behavior is byte-identical to before this commit: all state in memory, wiped on restart, no disk I/O, no new files. Existing deployments are unaffected. Sister to thebackpackdevorg#1 (vault_reindex guard), thebackpackdevorg#2 (MCP_ALLOWED_HOSTS), thebackpackdevorg#3 (PIN field maxlength), and thebackpackdevorg#4 (watchfiles auto-reindex).

tricamtech mentioned this pull request Apr 8, 2026

fix(auth): make PIN field maxlength match the configured PIN #3

Open

tricamtech mentioned this pull request Apr 8, 2026

feat(indexer): watchfiles-based auto-reindex of VAULT_PATH #4

Open

tricamtech mentioned this pull request Apr 8, 2026

feat(auth): persist OAuth state to disk to survive restarts #5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(server): add MCP_ALLOWED_HOSTS env var for DNS rebinding allowlist#2

feat(server): add MCP_ALLOWED_HOSTS env var for DNS rebinding allowlist#2
tricamtech wants to merge 1 commit intothebackpackdevorg:mainfrom
tricamtech:feat/mcp-allowed-hosts

tricamtech commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tricamtech commented Apr 8, 2026

Problem

Reproduction

Fix

Verification

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant