fix(sandbox): make interactive connect resilient on stopped/resumed sandboxes by marc-vercel · Pull Request #215 · vercel/sandbox

marc-vercel · 2026-06-01T17:08:39Z

Problem

sandbox connect (and interactive sandbox exec) could hang indefinitely on Waiting for connection..., or fail in a confusing way, when run against a stopped sandbox that has to be resumed. It worked reliably against an already-running sandbox, which is why it only showed up intermittently after a stop/resume.

Several independent issues combined to produce this:

Real connection errors were swallowed. Once the connection handshake landed, the abort signal that stops the "did the command exit early?" check was also used to filter errors from attach(). So any failure that happened after the handshake (for example, the resumed session not yet exposing a route for the interactive port) was silently discarded instead of surfaced.
The spinner kept the process alive. The progress spinner's teardown called ora.clear(), which only erases the current frame but leaves its render interval running. That timer keeps Node's event loop alive, so on any early teardown the CLI would sit forever on the spinner instead of exiting.
Early server exits were opaque. When the in-sandbox interactive server exited before connecting, the CLI showed a generic "may have timed out" hint with no detail.
The in-sandbox server trusted a stale config. pty-tunnel-server decided whether a server was already running purely from a leftover config file and a liveness check on its recorded PID. Across a snapshot/resume that config is restored from the snapshot while the original process is gone, so a coincidentally-reused PID made it connect to a dead socket and exit.

Solution

Stop funneling attach() through the connection-established abort filter, so genuine connection failures propagate instead of being swallowed.
Always stop() the spinner on teardown (not just clear()), so a failure before the connection is established can no longer hang the process.
Include the in-sandbox server's stderr in the error when it exits before connecting, so the real cause is visible.
Have pty-tunnel-server health-check a server before reusing it, and remove any leftover config before spawning a new one, so a stale config restored from a snapshot can no longer cause a connection to a dead socket.

Together these turn the previous silent hang into either a working connection or a fast, legible error.

🤖 Generated with Claude Code

vercel · 2026-06-01T17:08:45Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
sandbox	Ready	Preview, Comment, Open in v0	Jun 1, 2026 7:55pm
sandbox-cli	Ready	Preview, Comment	Jun 1, 2026 7:55pm
sandbox-sdk	Ready	Preview, Comment	Jun 1, 2026 7:55pm
sandbox-sdk-ai-example	Ready	Preview, Comment	Jun 1, 2026 7:55pm
workflow-code-runner	Ready	Preview, Comment	Jun 1, 2026 7:55pm

…andboxes `sandbox connect` could hang on "Waiting for connection..." or fail when run against a stopped/resumed sandbox. Three independent issues: - The CLI swallowed real `attach()` failures: once the connection handshake landed, the same abort signal used to stop the premature-exit check also discarded any later `attach()` error, so failures were never surfaced. - The spinner's disposer called `ora.clear()` instead of `stop()`, leaving the render interval running and keeping the event loop (and the CLI) alive indefinitely on teardown. - When the interactive server exited early, the generic error hid the actual cause; we now include the server's stderr. - The in-sandbox server (pty-tunnel-server) trusted a leftover /tmp/vercel/interactive/config.json restored from a snapshot whenever its recorded PID happened to be alive, connecting to a dead socket. It now health-checks a reused server and removes the stale config before spawning a fresh one. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

vercel Bot deployed to Preview – workflow-code-runner June 1, 2026 17:09 View deployment

vercel Bot deployed to Preview – sandbox-sdk-ai-example June 1, 2026 17:09 View deployment

vercel Bot deployed to Preview – sandbox-sdk June 1, 2026 17:09 View deployment

vercel Bot deployed to Preview – sandbox-cli June 1, 2026 17:09 View deployment

vercel Bot deployed to Preview – sandbox June 1, 2026 17:09 View deployment

marc-vercel force-pushed the marc-vercel/fix-interactive-connect-resume branch from b04bc6b to beb73c4 Compare June 1, 2026 19:54

vercel Bot deployed to Preview – sandbox-sdk June 1, 2026 19:55 View deployment

vercel Bot deployed to Preview – sandbox-cli June 1, 2026 19:55 View deployment

vercel Bot deployed to Preview – workflow-code-runner June 1, 2026 19:55 View deployment

vercel Bot deployed to Preview – sandbox June 1, 2026 19:55 View deployment

vercel Bot deployed to Preview – sandbox-sdk-ai-example June 1, 2026 19:55 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(sandbox): make interactive connect resilient on stopped/resumed sandboxes#215

fix(sandbox): make interactive connect resilient on stopped/resumed sandboxes#215
marc-vercel wants to merge 1 commit into
mainfrom
marc-vercel/fix-interactive-connect-resume

marc-vercel commented Jun 1, 2026

Uh oh!

vercel Bot commented Jun 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

marc-vercel commented Jun 1, 2026

Problem

Solution

Uh oh!

vercel Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jun 1, 2026 •

edited

Loading