i mean the device downloads the image, then by the time it reaches 99% it starts downloading the new release again and again. i think i saw it doing like 6-7 times today.
balenaEngine is getting killed by the watchdog, and when that happens with an application update it usually means that the SD card is not taking the increased load well. We recommend the SanDisk’s Extreme Pro in these cases as in our experience they work well. Do you know if all Pis are using the same SD card model?
yeah all the pi’s are using the same sd card model. might mean this card is about to fail or there could be something else? all pis are using the same card/pi model/same application so the load should kinda be indentical on all of them.
Hello, the device seems to be recovered and is downloading the application now. As Trong mentioned there seemed to be a lot of unused docker layers, I tried performing some cleanup to specifically remove the dangling ones, but balena engine kept not being able to start on the device.
I ended up removing the supervisor and application images which finally allowed the balena service to start and begin downloading the new release correctly. Since the device had to re-download the supervisor I also took the liberty of updating it to 10.8.0 from 10.6.27, there are a lot of crucial improvements in stability and error reporting in the latest version, and we are making sure to take any chance to update devices to the new version, let me know if that is ok with you.
Hi there, we’ve taken a look at the device and it appears to be struggling with disk I/O. Running the device diagnostics show rather high disk write latency (4s to write 1mb):
The best course of action at this point would be to replace the SD card in this device and run the health check again to verify disk write latency is reasonable.
For reference, we’ve had very few problems with SanDisk Extreme PRO cards.
just wanted to give an update. we didn’t do anything to the machine and it just recovered. didn’t swap the sd card. it’s awkward. you guys have any idea what could have caused it to recover?
The update might have proceeded in just the right way for your SD card to take it without raising any other problems. On the next update, keep an eye to see if this issue reappears or not.