How to troubleshoot reboots?

Ankan · September 2, 2021, 8:54pm

My custom board sometimes reboots. On bench I can see a stack dump in console but how do I troubleshoot less common reboots on devices in production mode on the field?

The board is a stm32mp1 based board and the problem can be in the M4 code or in A7 kernel/os/code.

Can an upgrade of balenaos help or an upgrade of kernal by upgrading yocto version from thud to dunfell?

pranavpeshwe · September 3, 2021, 5:19am

Hello Ankan,
Can you share some more details on what balena image you use on your custom board? The list of officially supported devices lives here: Single-board computers - Balena Documentation Also, if you are looking to resolve a specific kind of issue, then sharing details/logs about the same could help folks help you.

There is no easy way to debug random reboots in production. Enabling persistent logging will allow you to go through logs after the device comes up. Whether those logs allow you to debug your specific problem will depend on what exactly the problem is. I don’t think that simply upgrading balenaOS will automatically address your concern.

Hope that helps.

Thanks.

Ankan · September 3, 2021, 8:01am

I have compiled my own image with Yocto thud as there is no official support for my cpu.
I have enabled persistent logging, but can’t see anything in the log.
If the problem is in Linux I guess it should be shown in journal log, even if it is an watchdog timeout so maybe the problem lay in the M4 code.

dtischler · September 4, 2021, 1:46am

What exact device is this @Ankan, including model number? Is it an Avenger96, STM Discovery, Olimex, etc?

Ankan · September 5, 2021, 1:17pm

@dtischler it’s a QSMP-1570 based board.

dtischler · September 7, 2021, 7:33pm

Ok, that is not one I am familiar with…The best advice I can think of is to enable persistent logging so that you can go back and review the logs after a reboot (otherwise they are lost, as balenaOS default logging is to RAM), hook up and leave a serial UART console running so you can see what the kernel error is, and perhaps double check to see if the vendor provides a newer kernel version, in case there were in fact issues with their vendor branch.

Topic		Replies	Views
Raspberry Pis keep rebooting Product support raspberrypi3	58	2401	October 1, 2019
CM4 Won't Reboot Running balenaOS 2.94.4+rev1 Running On Custom Hardware balenaOS	3	556	April 15, 2022
Find reason for automatic reboots in Balena OS balenaOS	1	275	November 24, 2023
Trying to develop a Balena System with reliable rebooting balenaOS	10	506	September 28, 2021
Some basic questions about running balena on custom board balenaOS	9	725	May 31, 2023

How to troubleshoot reboots?

Related topics