Rewrite Dockerfile into multi-stage build, minor updates to support that change by rtizzy · Pull Request #1362 · oraios/serena

rtizzy · 2026-04-16T15:30:30Z

The previous Docker setup is a bit confusing but I tried to infer what was intended.

Changes:

Ensures a proper ENTRYPOINT and CMD are so the MCP server starts successfully
Utilizes a multi-stage build so the version pushed to the registry is a slim image (~275 MB) that can have various LSPs and tooling layered on top as needed.
Includes minor updates to docs and ensures the push workflow leverages the proper target (prod)

The idea here is that someone who wishes to run this MCP inside of Docker can base off the pushed serena image and layer on their necessary dependencies.

Still a bit more work to do here and I'm currently experimenting a bit locally.

MischaPanch · 2026-04-16T16:50:23Z

Thanks, yes, it would be nice to improve this. We haven't put much attention to the docker topic or to optimizing the image. Pls ping me when you are done on your side, I'll then review.

Pinging @tirthpatel90 who is also working on improving Serena's docker issues

tirthpatel90 · 2026-04-17T15:35:02Z

Awesome work @rtizzy! Just a heads-up, I'm currently working on Dockerfile.maximal in #1339 specifically to bake in all the heavy language dependencies (like R, Julia, OCaml) to accelerate the CI pipelines. Looking forward to seeing both our Docker optimizations in the next release!

rtizzy · 2026-04-22T08:15:57Z

@tirthpatel90

Thanks. A bit of constructive criticism:

It may make sense to step back and look at the entire CI/CD pipeline instead of just the Dockerfile for a few reasons.

The current multi-OS test suite is running outside Docker to test how it works on that exact OS for someone running outside Docker, which is still the standard setup. At minimum, you will still need the Windows test to exist outside Docker.

I haven't had a chance to dig fully into the tests yet but if all the tests do not require the full tool chain (My bet is they don't), what likely makes sense is something like:

Segment the tests by toolchain
Parallelize the CI/CD workflows across both OS (if desired) and by toolchain.

That means instead of needing one fat image, you can have a base optimized image (such as the one I'm working on here), base further images off of that if desired, install only the necessary toolchain, and run many of the tests in parallel.

That optimization will work with or without Docker and should result in a significant speedup.

I intended to perform that work after finishing this PR but feel free to pick it up.

rtizzy · 2026-04-22T09:28:26Z

@MischaPanch

Hope you're well.

Feel free to give this a once over and let me know if there is anything I might have missed.

I did have a question to make sure I'm understanding something correctly:

I'm pretty sure the architecture is that Serena MCP can only have one project activated at a time. I.E A single MCP server can only have one project activated and in use.

Is that correct?

tirthpatel90 · 2026-04-22T12:34:55Z

Hi @rtizzy,

Thanks for the constructive feedback! You make an excellent architectural point. Segmenting the tests by toolchain and running them in parallel matrix jobs is definitely a cleaner, more scalable CI/CD practice compared to maintaining a monolithic image.

I would be happy to pivot and pick up the CI workflow segmentation to work alongside your base image.

@MischaPanch, what are your thoughts on this direction? If you prefer this parallelized approach over the Phase 1 maximal image, I can hold off on #1339 and start drafting the matrix workflows to run the segmented tests using rlzzy's optimized image. Let me know how you'd like to proceed!

MischaPanch · 2026-04-24T09:57:00Z

Hi @rtizzy . Thanks for the contribution and the discussion!

I'm pretty sure the architecture is that Serena MCP can only have one project activated at a time. I.E A single MCP server can only have one project activated and in use.

Yes, that's correct. In this PR you let the entrypoint use http mode for the MCP server - is that needed? Can't we use stdio and somehow forward the stream? It would be easier for the users

MischaPanch · 2026-04-24T10:00:27Z

@tirthpatel90 yes, if you are up to it, the parallelized approach would be far superior. Even though we would use more compute (due to multiple container startups) and the complexity of CI would become much larger, it would have the huge benefit of not having to wait 40 minutes each time.

A necessary requirement is that in the parallelized approach we don't have to change the GH workflows each time language support is added (although we should be able to do that when desired). Thus, it should be something like batch1, batch2, ... batchN, last_batch_executing_everything_else. Do you think that's feasible?

tirthpatel90 · 2026-04-24T13:26:12Z

Hi @MischaPanch,

Yes, that is absolutely feasible and a brilliant way to future-proof the CI!

To satisfy the requirement of not modifying the GH workflows for every new language, we can dynamically batch the test discovery. For instance, we can explicitly assign the heaviest toolchains (like C++, Go, Rust, Java) into their own isolated parallel matrix jobs (batch1, batch2, etc.). For the final batch (last_batch_executing_everything_else), we can programmatically run pytest on all remaining tests that were NOT explicitly included in the previous batches.

This ensures that any newly added language tests automatically fall into the "catch-all" bucket without requiring manual YAML updates, keeping the maintenance overhead strictly at zero.

Since we are officially pivoting to this superior architecture, I will close my maximal image PR (#1339) to keep the repo clean. I'll start drafting the matrix workflows to run the segmented batches using rtizzy's leaner image and will open a fresh PR for this soon.

I'll get to work on this!

rtizzy · 2026-04-27T10:24:53Z

Hi @rtizzy . Thanks for the contribution and the discussion!

I'm pretty sure the architecture is that Serena MCP can only have one project activated at a time. I.E A single MCP server can only have one project activated and in use.

Yes, that's correct. In this PR you let the entrypoint use http mode for the MCP server - is that needed? Can't we use stdio and somehow forward the stream? It would be easier for the users

@MischaPanch

Thanks for the details. Give me a bit and I'll test things out with that approach.

stdio seems to fit the use case of Serena in VSCode much much better and completely circumvents a problem I was reasoning about.

rtizzy · 2026-05-02T13:27:17Z

@MischaPanch

Alright, I've tested this out a bit.

I have a feeling more doc updates are prudent.

I was testing this against a Continue based setup in VSCode (With/without dev containers).

The most portable thing I could come up with that worked there was something like this:

name: Serena
version: 0.0.1
schema: v1
mcpServers:
  - name: serena
    command: docker
    cwd: ${{ secrets.HOST_REPO_PATH }} 
    type: stdio
    args:
    - run
    - --rm
    - -i
    - -v
    - ./:/workspace
    - serena-infra # This is a container built with the necessary deps for my own repo.

# .continue/.env in project 
HOST_REPO_PATH="ABSOLUTE_PATH_TO_REPO_ON_HOST"

If that should be documented in the Serena docs itself is another question as it's pretty tooling specific.

MischaPanch

Thanks @rtizzy . I tried to leave some comments but seems like github is having problems.

I didn't have time to do a deep dive yet, but some things seem strange to me, maybe you could briefly explain.

what's the role of target prod vs dev here?
why are there multiple entrypoints? And why are they stdio, shouldn't it be http in the end?
Why the override in compose-dev.yaml? It only differs in the target
Shouldn't the default policy be pull? Do you want the user to always build?
The ansible and opentofu examples are rather niche, how about using rust or java as example?

rtizzy · 2026-05-15T13:10:48Z

@MischaPanch

No problem.

what's the role of target prod vs dev here?

The prod target is what is eventually pushed into Github. It's what end users would use.

dev is intended for development of Serena itself.

why are there multiple entrypoints?

Not strictly required and can be consolidated into the base if desired.

And why are they stdio, shouldn't it be http in the end?

It was originally HTTP but you requested it to be changed to stdio in an earlier comment, which likely makes more sense considering the architecture of Serena.

Why the override in compose-dev.yaml? It only differs in the target

This likely makes more sense once you consider the two workflows:

Build and push for other users
Develop serena itself.

The compose-dev example inside the repo is for people working on Serena itself.

Shouldn't the default policy be pull?

No.

Do you want the user to always build?

Yes.

Here is the way this works:

The serena project pushes a base image to Dockerhub
An end user creates a Dockerfile and references the Dockerhub image with FROM
They then layer on any necessary dependencies and build this image
This new image is how they use the Serena MCP

A user can push that into their own registry but that use case is probably unlikely for most people.

The ansible and opentofu examples are rather niche, how about using rust or java as example?

That'd be out of scope for me.

The point of the example is to show "We are providing a base image, layer on what you need in a Dockerfile. Here is a best practice example."

rtizzy · 2026-05-15T13:13:41Z

As a side note:

Useful to know what the "happy path" is intended to be.

I'd imagine the vast majority of users will end up leveraging a stdio based setup.

The "Single project" approach of Serena isn't particularly well suited to HTTP usage.

rtizzy force-pushed the dockerfilesanity branch 2 times, most recently from d190159 to a95532b Compare April 22, 2026 10:06

This was referenced Apr 24, 2026

feature: add Dockerfile.maximal to optimize CI (#1252) #1339

Closed

ci: implement parallel matrix architecture for segmented testing #1421

Open

rtizzy added 16 commits May 2, 2026 15:27

Optimized multi-stage docker build with a few fixes

25ccc7d

Make sure prod target is used for build and push

bf77343

Remove build, comment out command

3575014

Remove unused var

b6a216d

Move serena user creation and config to base layer

b239f2d

Update example command

793007e

Add example of a basing a app dev image on the base image

8d1e95e

Update the CMD to automatically activate the the project

1da1c36

Add a Serena dev compose that includes the base compose

57f04b6

Fix command, set pull_police to always, ensure proper target of prod

70865da

Update the Docker documentation

80fb04c

Ensure .devcontainer targets dev

9c107b0

Fix typo

b95e1cf

Fix typo

3e4af83

Fix typo

026867e

One more typo fix for great justice

05ba20f

rtizzy added 2 commits May 2, 2026 15:27

Default command to stdio based setup

4a09b5f

Keeping this as HTTP for now

1c692c0

rtizzy force-pushed the dockerfilesanity branch from 39c788a to 1c692c0 Compare May 2, 2026 13:28

MischaPanch reviewed May 6, 2026

View reviewed changes

MischaPanch force-pushed the main branch 2 times, most recently from 420a0ba to 016ccbe Compare May 26, 2026 11:45

opcode81 force-pushed the main branch from 6d6303e to c57b7f2 Compare May 28, 2026 11:36

Uh oh!

Conversation

rtizzy commented Apr 16, 2026

Changes:

Uh oh!

MischaPanch commented Apr 16, 2026

Uh oh!

tirthpatel90 commented Apr 17, 2026

Uh oh!

rtizzy commented Apr 22, 2026

Uh oh!

rtizzy commented Apr 22, 2026

Uh oh!

tirthpatel90 commented Apr 22, 2026

Uh oh!

MischaPanch commented Apr 24, 2026

Uh oh!

MischaPanch commented Apr 24, 2026

Uh oh!

tirthpatel90 commented Apr 24, 2026

Uh oh!

rtizzy commented Apr 27, 2026

Uh oh!

rtizzy commented May 2, 2026

Uh oh!

MischaPanch left a comment

Choose a reason for hiding this comment

Uh oh!

rtizzy commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rtizzy commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rtizzy commented May 15, 2026 •

edited

Loading