Reduce Deployment Time?

scarlyon · May 24, 2022, 6:25am

I have a multi-container application and one service(container) is a python application where the Nvidia driver is installed inside it.

The problem is its size - its image size is ~3GB and would take more than 30 minutes in slow networks to be deployed.

My questions are:

Any tips to reduce the image size?
Is it possible to deploy a smaller container instead and then pull the fat one when the network speed is fast enough?

Cheers,
Shane.

TJvV · May 24, 2022, 7:11am

Hi,

Have you looked at preloading?
This can probably save you some time in deployment by already including the application in the initial image you write to the devices.
You will also want to look at the delta updates.

scarlyon · May 24, 2022, 7:21am

Hi, @TJvV
Thanks for your reply!

preloading won’t work because our customers don’t want to download a fat image file! lol

And the delta update - thanks for sharing with me!

Cheers,
Shane.

TJvV · May 24, 2022, 7:27am

Hi,

Can you maybe share your docker-compose (and Dockerfiles)?
It can maybe give some insights in how to reduce size.

I’m guessing one of the reasons your image is so big, is that you need an SDK to build the nvidia driver?
Multistage builds may help reduce the size of what you’re actually deploying by splitting up the process.

scarlyon · May 24, 2022, 7:30am

Sure!

FROM balenalib/%%BALENA_MACHINE_NAME%%-ubuntu:focal

ARG RESINOS_VERSION=2.73.1%2Brev2.prod
ARG YOCTO_VERSION=5.8.18
ARG NVIDIA_DRIVER_VERSION=465.31

ENV YOCTO_KERNEL=${YOCTO_VERSION}-yocto-standard
ENV NVIDIA_DRIVER_RUN=NVIDIA-Linux-x86_64-${NVIDIA_DRIVER_VERSION}.run
ENV DEBIAN_FRONTEND=noninteractive

# Install Nvidia Driver
RUN apt-get update && apt-get install -y wget gcc build-essential apt-utils dialog aufs-tools libc-dev iptables conntrack unzip libglu1-mesa-dev
RUN wget -nv https://files.balena-staging.com/images/%%BALENA_MACHINE_NAME%%/${RESINOS_VERSION}/kernel_modules_headers.tar.gz && \
    tar -xzf kernel_modules_headers.tar.gz && \
    mkdir -p /lib/modules/${YOCTO_KERNEL} && \
    cp -r kernel_modules_headers /lib/modules/${YOCTO_KERNEL}/build && \
    ln -s /lib64/ld-linux-x86-64.so.2 /lib/ld-linux-x86-64.so.2 && \
    wget -nv http://us.download.nvidia.com/XFree86/Linux-x86_64/${NVIDIA_DRIVER_VERSION}/${NVIDIA_DRIVER_RUN} && \
    chmod +x ./${NVIDIA_DRIVER_RUN} && \
    mkdir -p /nvidia && \
    mkdir -p /nvidia/driver && \
    ./${NVIDIA_DRIVER_RUN} \
        --kernel-install-path=/nvidia/driver \
        --ui=none \
        --no-drm \
        --no-x-check \
        --install-compat32-libs \
        --no-nouveau-check \
        --no-nvidia-modprobe \
        --no-rpms \
        --no-backup \
        --no-check-for-alternate-installs \
        --no-libglx-indirect \
        --no-install-libglvnd \
        --x-prefix=/tmp/null \
        --x-module-path=/tmp/null \
        --x-library-path=/tmp/null \
        --x-sysconfig-path=/tmp/null \
        --kernel-name=${YOCTO_KERNEL} && \
    rm -rf /tmp/* ${NVIDIA_DRIVER_RUN} kernel_modules_headers.tar.gz kernel_modules_headers

# Install docker.
RUN apt-get install -y apt-transport-https ca-certificates curl gnupg-agent software-properties-common \
    && curl -fsSL https://download.docker.com/linux/debian/gpg | apt-key add - \
    && add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable" \
    && apt-get update && apt-get install -y docker-ce

# Install Nvidia Container Toolkit
RUN distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
   && curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - \
   && curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
RUN apt-get update && apt-get install -y nvidia-docker2

# Installing AWS CLI V2
RUN curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip" && unzip -qq awscliv2.zip \
    && ./aws/install --bin-dir /usr/bin && rm -r ./aws && rm awscliv2.zip

# Install some other utilities
RUN apt-get install -y python3-pip python3-dev dbus dmidecode lshw hdparm smartmontools v4l-utils && pip3 install -U pip setuptools wheel

# Enable udevd so that plugged dynamic hardware devices show up in our container.
ENV UDEV=1

ENV INITSYSTEM on

# Set our working directory
WORKDIR /usr/app

# Install OpenVINO's HDDL driver for Mustang-V100-MX8
COPY ./hddl ./hddl
RUN apt-get update && apt-get install -y cmake libudev-dev libjson-c-dev && \
    wget -nv https://files.balena-staging.com/images/%%BALENA_MACHINE_NAME%%/${RESINOS_VERSION}/kernel_modules_headers.tar.gz && \
    tar -xzf kernel_modules_headers.tar.gz && rm kernel_modules_headers.tar.gz && \
    mkdir -p /usr/src/kernel && \
    mv kernel_modules_headers/* /usr/src/kernel/ && rm -r kernel_modules_headers && \
#    ln -s /lib64/ld-linux-x86-64.so.2 /lib/ld-linux-x86-64.so.2 && \
    cd /usr/app/hddl/drv_vsc && make

RUN apt-get install -y pciutils pkg-config && cd /usr/app/hddl/hddl-bsl/src && make && make install && cd .. && \
    mkdir build && cd build && cmake .. -DINSTALL_USB_RULES=TRUE && make && make install && cd ../../../

RUN apt-get clean && rm -rf /var/lib/apt/lists/* /var/tmp*

vipulgupta2048 · May 26, 2022, 12:10pm

Hi, couple of optimizations right of the bat to make the dockerfile cleaner, slimmer

Start using install_packages command that come packaged in every balena base image. Install_packages is a installer script that:

Install the named packages, skipping prompts etc.
Clean up the apt metadata afterwards to keep the image small.
Retrying if apt fails. Sometimes a package will fail to download due to a network issue, and this may fix that, which is particularly useful in an automated build pipeline.

Next, any specific reason to install Docker
Why not install all OS dependencies in one step for them to be more easily cached?
Why install Python when you can use the base image from balena that has Python pre installed, check out Docker Hub (You will find a similar base that is available for your device type)
Optionally to build faster, you can use a beefier local build machine (balena ARM servers are already quite substantial) so that you can use balena build command to build locally and then using balena deploy to deploy that image.

vipulgupta2048 · May 26, 2022, 12:12pm

With each round of optimizations, do post the metrics for the final image so that we know we are making progress. Another hard optimization you could possibly do would be to use multi-stage builds but that very much depends on your usecase.

Topic		Replies	Views
Enable GPU on Container balenaOS docker , nvidia	79	4922	January 20, 2021
analyze Images size and code balenaOS support	13	644	June 16, 2020
Balena CLI push taking a very long time Product support	4	492	September 26, 2022
Balena deploy timeout Product support support , docker	0	189	June 29, 2022
Balena Integration to CI/CD Product support	4	48	September 27, 2024

Reduce Deployment Time?

Related topics