We had some issues with 2.47 the last two weeks. Because of this we decided to upgrade to the latest version, which was a success. After two days of running, the device showed VPN only.
Jul 25 10:04:03 neonlink balenad[10700]: time="2022-07-25T10:04:03.551074404Z" level=error msg="failed to load container" container=166502a7abc46afb161306c6c3541b43b1aa1be4e77203122e5b6e244838f683 error="open /var/lib/docker/containers/166502a7abc46afb161306c6c3541b43b1aa1be4e77203122e5b6e244838f683/config.v2.json: no such file or directory"
Jul 25 10:04:03 neonlink balenad[10700]: time="2022-07-25T10:04:03.556193760Z" level=error msg="failed to load container" container=31af5b098ff67e9635ec111c9ef967344885f4b1a145dc1938e4203958e09116 error="invalid character 'L' after object key"
Jul 25 10:04:04 neonlink balenad[10700]: time="2022-07-25T10:04:04.049928457Z" level=info msg="stopping event stream following graceful shutdown" error="<nil>" module=libcontainerd namespace=moby
Jul 25 10:04:04 neonlink balenad[10700]: time="2022-07-25T10:04:04.055346093Z" level=info msg="stopping healthcheck following graceful shutdown" module=libcontainerd
Jul 25 10:04:04 neonlink balenad[10700]: time="2022-07-25T10:04:04.055579947Z" level=info msg="stopping event stream following graceful shutdown" error="context canceled" module=libcontainerd namespace=plugins.moby
Jul 25 10:04:04 neonlink extract-balena-ca[10749]: [extract-balena-ca][INFO] The config.json file does not contain custom CA
Jul 25 10:04:05 neonlink balenad[10700]: failed to start daemon: Error initializing network controller: error obtaining controller instance: failed to get bridge network configurations from store: invalid character '\r' in string literal
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Main process exited, code=exited, status=1/FAILURE
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Failed with result 'exit-code'.
Jul 25 10:04:05 neonlink systemd[1]: Failed to start Balena Application Container Engine.
Jul 25 10:04:05 neonlink balena-supervisor[10767]: Cannot connect to the balenaEngine daemon at unix:///var/run/balena-engine.sock. Is the balenaEngine daemon running?
Jul 25 10:04:05 neonlink balenad[10768]: Cannot connect to the balenaEngine daemon at unix:///var/run/balena-engine.sock. Is the balenaEngine daemon running?
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 1706 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 1707 (balena-healthch) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 1708 (balena) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6534 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6615 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6692 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6761 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6830 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6901 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 6989 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7061 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7129 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7198 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7287 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7360 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7434 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7520 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7593 (exe) in control group while starting unit. Ignoring.
Jul 25 10:04:05 neonlink systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Jul 25 10:04:05 neonlink systemd[1]: balena.service: Found left-over process 7661 (exe) in control group while starting unit. Ignoring.
diag shows this
--- prefixing commands with 'date --utc --rfc-3339=ns ; /usr/bin/time -o /dev/stdout timeout --preserve-status --kill-after=20 -v 10 bash -c' ---
Cannot connect to the balenaEngine daemon at unix:///var/run/balena-engine.sock. Is the balenaEngine daemon running?
Cannot connect to the balenaEngine daemon at unix:///var/run/balena-engine.sock. Is the balenaEngine daemon running?
I have a few questions:
Can I refresh the entire os somehow?
In the future. Is there a possibility that makes it possible to switch to an external drive as a backup?
Any more tips to prevent this?
I already granted access to you guys can take a look.
Relating to balenaEngine issue, lets check the service status via systemctl status balena-engine. Also, please share the device UUID, so we can take a look.
We reproduced these errors also in the 64-bit RasperryPi4 images of version 2.98.* immediately after flashing the images on 3 devices. As far as I can tell these BalenaOS images aren’t working correctly. We reverted to the raspberrypi4-64 v2.95.8 images, they work fine so far. I am guessing that the software versions in the v2.98 images are not somehow not fully compatible.
Hi Bernhard, it’s unlikely there is something basically wrong with the balenaOS 2.98 image series as they are actively being used across thousands of RaspberryPi devices in balenaCloud fleets.
Have you tried creating an empty fleet and downloading the images from there? Those will not run any application and will proof that the images are fine. You can then try pushing some example app to see how they are able to run applications. And then you may move them to your final fleet so we can debug the specific problem.
Messages of the type invalid character 'L' have been associated in the past with errors in compose file and/or corruption in downloaded images.
Martijn, your issue seems to have been caused by filesystem corruption. Even if you fix the errors the engine storage is usually left in an inconsistent state. If you share the UUID and set a long support access to the device we can take a look.