HI Balena Team,
I think this question is best asked in the forum so others may benefit from the answer.
I have a custom carrier board that is using Crucial P3 NVMe drives and I have had 3 drive failures this year. When I run the nvme smart-log /dev/nvme0n1 -H
command I see that most of my power downs are considered unsafe. See below.
I have 2 questions:
- What is the process to safe shutdown a device from within a container?
- Are we able to get a signal in a container that a shutdown action from the UI has occurred?
Smart Log for NVME device:nvme0n1 namespace-id:ffffffff
critical_warning : 0
Available Spare[0] : 0
Temp. Threshold[1] : 0
NVM subsystem Reliability[2] : 0
Read-only[3] : 0
Volatile mem. backup failed[4] : 0
Persistent Mem. RO[5] : 0
temperature : 46 C
available_spare : 100%
available_spare_threshold : 5%
percentage_used : 0%
endurance group critical warning summary: 0
data_units_read : 1265
data_units_written : 3759
host_read_commands : 104368
host_write_commands : 184706
controller_busy_time : 3
power_cycles : 39
power_on_hours : 33
unsafe_shutdowns : 37
media_errors : 0
num_err_log_entries : 9
Warning Temperature Time : 0
Critical Composite Temperature Time : 0
Temperature Sensor 1 : 46 C
Temperature Sensor 2 : 51 C
Temperature Sensor 8 : 46 C
Thermal Management T1 Trans Count : 0
Thermal Management T2 Trans Count : 0
Thermal Management T1 Total Time : 0
Thermal Management T2 Total Time : 0