Balena multiple services and docker "layer deduplication"?

luchko · January 16, 2023, 6:46pm

Hello,
I do not feel comfortable with advanced docker stuff, but i was wondering something : can balena achieve some optimization when docker-composed services share some layers?

As an explanantion, see example below:
simplified docker-compose :

  talker0:
    image: ros:noetic-ros-core-focal
    command: stdbuf -o L rostopic pub /chatter std_msgs/String "hello" -r 1

  listener0:
    image: ros:noetic-ros-core-focal
    command: stdbuf -o L rostopic echo /chatter
(...)
  talkerN:
    image: ros:noetic-ros-core-focal
    command: stdbuf -o L rostopic pub /chatter std_msgs/String "hello" -r 1

  listenerN:
    image: ros:noetic-ros-core-focal
    command: stdbuf -o L rostopic echo /chatter

leads to following logs :

┌──────────────────┬─────────────┬────────────┐
│ Service          │ Image Size  │ Build Time │
├──────────────────┼─────────────┼────────────┤
│ talker           │ 738.35 MB   │ < 1 second │
├──────────────────┼─────────────┼────────────┤
│ ... *N           │ 738.35 MB   │ < 1 second │
├──────────────────┼─────────────┼────────────┤
│ listener         │ 738.35 MB   │ < 1 second │
└──────────────────┴─────────────┴────────────┘

Empirical tests make me conclude that I won’t use N* 738Mb of diskspace.
Can someone point me out the name of that mechanism please, and potentially condition I need to met to allow such diskspace savings !?

pipex · January 16, 2023, 7:58pm

Hi there @luchko. By the nature of docker, when pulling images, assuming you are not squashing the images, the engine will pull the shared layers once, and only pull the different layers when they exist. For instance if you have 3 different services building from a node parent image, all these services will share the node layers and the extra data downloaded will be the distinct files in these services.

In the example you provided you are correct that since there is only one image, the engine will only pull the ros:noetic-ros-core-focal image once and just configure the services with the distinct commands.

You can also achieve the same when building from a local dockerfile by using the build and image properties together. For instance

service1:
   build: ./my-service
   image: my-service
   command: ./my-command hello
service2:
   build: ./my-service
   image: my-service
   command: ./my-command goodbye

In this case both services will share the same image which means that data will only be downloaded once.

Please let us know if this answers your question or if you need some more specific examples.

luchko · January 16, 2023, 8:00pm

Thanks, that answers my question perfectly.

Would it be possible to add a column with sthg like balena system df -v at the end of balena push ?
That would allow users to see “real diskspace usage” instead of “image size”.
Thanks!

pipex · January 16, 2023, 8:11pm

Glad to help. Regarding your suggestion I can certainly make a note of it, but I’m not entirely sure how feasible it is, since the builder reports on what’s been uploaded to the registry, but on the registry these images share disk space with other app images and there is no strict separation on what belongs to a single release.

If you want more control you can always do a balena build followed by a balena deploy and query the engine state in between to get an idea on how much disk space will be used.

Topic		Replies	Views
Sharing an image among services - Balena Engine not deleting old images on release update. balenaEngine docker	3	904	April 27, 2020
question about services shared by multiple applications openBalena	1	291	March 4, 2021
Optimized image reuse across multiple services? Product support docker	1	209	September 5, 2023
Avoiding downloading the same layers multiple times Product support docker	2	473	March 25, 2019
Doubts about services sharing an image balenaOS docker-compose	1	36	February 26, 2025

Balena multiple services and docker "layer deduplication"?

Related topics