Using SCP/RSYNC Through Balena Tunnel into a container

atwoodc · January 28, 2025, 7:19pm

Hi everyone,

I’m currently working on automating a task that involves remotely copying files from a field data-logger device and uploading them to a local machine. Unfortunately, the Balena releases for these devices are managed by another group, so I have limited control over how the system is set up. Here’s what I’ve been able to achieve so far through my process research:

Use Balena Tunnel to access the host OS.
Copy files from the desired container to the host OS.
Run SCP on my local machine to retrieve the files from the host OS.

The challenge is that some field files (e.g., video files) can be larger than 100GB, but the host OS only has about 30GB of available storage. This creates a bottleneck when transferring large files because I cannot temporarily store them on the eMMC. I have struggled to SCP from the container to my local machine directly, but I’ve had no problems through the tunnel in the hostOS.

To work around this, I am exploring two potential solutions:

Use Balena Tunnel to directly access the container I need, allowing me to run SCP directly from the container to my local machine.
Use Balena Tunnel to access the host OS and configure SCP to copy files through the tunnel directly from the container, without temporarily storing them on the eMMC.

Does anyone have experience with these approaches or suggestions on how to implement either solution? Any advice on setting up a direct file transfer pipeline through Balena Tunnel would be greatly appreciated.

Thanks in advance!

Simontaga · January 30, 2025, 3:31pm

One suggestion (assuming the video files live in a named volume):

Take a peek (from the host OS) at /var/lib/docker/volumes/. In there you’ll find e.g 12345_volume-name/_data/... (with the contents of the volume)

I believe that should remove the need to make a copy, and instead you can access the files directly.

atwoodc · February 6, 2025, 12:52am

Unfortunately It looks like that directory only has data on mounted devices. I thought this was on the right track with one of my units, but it turns out that one has been writing to the eMMC since it doesn’t have an NVME installed.

The NVME is mounted in another container. Is there any other way you know where I might be able to access it similarly to this method?

mpous · February 6, 2025, 10:19am

Hello @atwoodc first of all, welcome to the balena community! Thanks @Simontaga for helping here!

Let me ask internally to see how we can help you but could you please confirm what device are you using and balenaOS versions?

In the other hand, do you think you can share more about your use case?

Finally, feel free to share more confidential details in our private support if you want, as you are a paid customer.

Thanks

mpous · February 6, 2025, 1:07pm

@atwoodc first of all, we strongly discourage you to use the balena tunnel. The balena tunnel is designed to maintain stability and reliability to all the customers that need to interact to their remote devices. We do not recommend to send huge amounts of data through it. Actually we are thinking on putting in place limits.

We have been thinking about alternatives. Does the remote device run balena? I was thinking if you can add an extra container that uploads the files to S3, or use a kind of p2p system in case there is an issue with connectivity.

A colleague mentioned thet there is a relay based system that may be worth to try.

Let us know more details!

mpous · February 7, 2025, 9:57am

Hello @atwoodc another recommendation is to use your own VPN service. Maybe you can use this Tailscale block in your device

Having said that, we will continue exploring for solutions for use cases like yours!

Let us know if we can help you more!

atwoodc · February 7, 2025, 7:10pm

hi, @mpous. Thanks for all those suggestions! I’ll have to chat with my team, but I’m sure at least one of them will work well for what we need. I’ll come back and mark resolved if anything we try works.

Extra context. We are using the embedded version of the Pi 4 with Balena OS 4.0.23.

Use case wise, we have a remote telematics fleet that also records video. There’s infrastructure in place for sensor data and short (4 second) video clips. Currently, video is taken of our remote units by using removable hard drives that a technician has to remove and upload from satellite offices, but we’d like to be able to recover video more often to reduce that demand on technicians and have more of an “on-demand/as needed” data extraction service available.

We’re not sure how willing our hardware vendor will be to committing big changes since our development contract is coming to a close, so a lightweight solution would be ideal.

mpous · February 10, 2025, 11:33am

@atwoodc keep us posted and let us know more, so we can try to help you!

Thanks

Topic		Replies	Views
Is it possible to transfer files from a device to a local machine? Product support	11	6003	May 15, 2021
How to use scp to transfer files to a specific container? Product support support , ssh	1	1677	March 22, 2021
Rsync over balena ssh tunnel? Product support summary	9	2847	August 12, 2019
Transfer file to IOT General	7	2867	November 25, 2019
Trouble Rsyncing/SCPing over balena tunnel balenaOS network	3	465	October 11, 2023

Using SCP/RSYNC Through Balena Tunnel into a container

Related topics