I have a multi-container system operating on a NUC, and am stress-testing it. When the containers fail, I see multiple errors in Health Check. I’m having difficulty sorting out what the actual root problem is.
Sometimes I see low memory, however I often get the same symptoms/container failure with plenty of memory.
I’ve read through the tutorials provided by Balena, but I think my Linux background is too sparse to help me resolve this. Can anyone point me in the right direction for diagnosing & resolving the root problem(s)?
Below is Health Check and attached Diagnostics run.
Thanks for any advice and please advise if this is incorrect use of this forum.
Sandy
Health Check
{“diagnose_version”:“4.21.3”,“checks”:[{“name”:“check_balenaOS”,“success”:true,“status”:“Supported balenaOS 2.x detected”},{“name”:“check_container_engine”,“success”:false,“status”:“Some container_engine issues detected: \ntest_container_engine_running_now Container engine balena is NOT running\ntest_container_engine_restarts Container engine balena has 2894 restarts and may be crashlooping (most recent start time: Mon 2021-10-18 15:26:22 UTC)\ntest_container_engine_responding Error querying container engine: Cannot connect to the balenaEngine daemon at unix:///var/run/balena-engine.sock. Is the balenaEngine daemon running?”},{“name”:“check_localdisk”,“success”:false,“status”:“Some localdisk issues detected: \ntest_data_partition_mounted Data partition not mounted read-write”},{“name”:“check_memory”,“success”:false,“status”:“Low memory: 2% (96MB) available, 3680MB/3776MB used”},{“name”:“check_networking”,“success”:true,“status”:“No networking issues detected”},{“name”:“check_os_rollback”,“success”:true,“status”:“No OS rollbacks detected”},{“name”:“check_service_restarts”,“success”:true,“status”:“No services are restarting unexpectedly”},{“name”:“check_supervisor”,“success”:false,“status”:“Supervisor is NOT running”},{“name”:“check_temperature”,“success”:true,“status”:“No temperature issues detected”},{“name”:“check_timesync”,“success”:true,“status”:“Time is synchronized”}]}
68e5f8598c35b231be3bdf337eb1916d_diagnostics_2021.10.18_15.30.52+0000.txt (975.3 KB)