Enabling hardware watchdog timer on the Raspberry Pi CM3

bartvanarnhem · February 9, 2021, 9:16pm

Hi all,

I found an article describing how to enable the Raspberry Pi hardware watchdog timer to have devices recover from any potential hardware or kernel lockups:

Other than a Balena post from 2016 at Keeping Your System Running with a Host OS Watchdog and a few mentions on the Balena forum I could not find any clear documentation.

My questions:

Is it possible to enable the hardware watchdog timer on recent versions of BalenaOS?
Is this in any way bad practice?

We are running a Compute Module 3 using the Raspberry Pi 3 32bit BalenaOS base image.

Thanks!

Bart

dfunckt · February 10, 2021, 11:10am

What is your use case? balenaOS already utilizes the hardware watchdog as part of a chain of health checks to ensure the device is always available and responsive. If you’re looking to restart your application when it becomes unresponsive, you might be better off looking into docker-compose file’s “healthcheck” directive that balenaOS also supports: Compose file version 2 reference | Docker Documentation

bartvanarnhem · February 10, 2021, 7:41pm

We are sporadically seeing issues with the camera module that we are using on the Pi which looks similar to the one posted here: Experiencing hung raspberry pi while using camera - Arducam. Sometimes this seems to lead to our devices losing connection.

I am already in contact with your colleagues in another support thread trying to get to the root cause, but I figured it would be good to see if there are any measures we can take to at least make the device recover if it gets into this state.

Good to know that balenaOS already utilizes the hardware watchdog. That answers my question. Thanks!

jkridner · March 24, 2021, 5:45pm

Does a bad “healthcheck” result in the watchdog not being “pet” and therefore result in a reboot?

Is this enabled on all boards with hardware watchdogs, like BeagleBone Black?

zwhitchcox · March 26, 2021, 6:44pm

Hi, Jason, you can find more about docker-compose health checks here, but basically, you run a script within your container, and if it returns an error or cannot execute, it restarts your container.

It can look something like this:

        healthcheck:
            test: [ "CMD", "pg_isready", "-q", "-d", "${DB_NAME}", "-U", "${DB_USER}" ]
            timeout: 45s
            interval: 10s
            retries: 10

jkridner · March 29, 2021, 6:14pm

I’d like for it to reset my board, not my container.

The point of a hardware watchdog is that if, for any reason, you don’t perform the action telling the hardware watchdog everything is fine on a regular periodic basis, the board resets.

Why are we resetting containers when the board needs to be reset?

Where is the hardware watchdog help?

zwhitchcox · March 30, 2021, 9:49pm

Jason,

Watchdog is already implemented and managed by BalenaOS. This utilizes the hardware watchdog to reset the board if the kernel is unresponsive for the given amount of time. Docker also has healthchecks that can restart a container if your container if it fails a given test.

So, there are two separate processes ensuring your application/board are up and running.

Please let me know if anything is unclear.

jkridner · April 18, 2021, 5:16am

I want to reboot the system if the container healthchecks are bad, not just restart the container. Is that possible?

ab77 · April 30, 2021, 7:56pm

You could run a container with the balena socket mapped into it, which would then give you a docker API inside the container. You could write a script/app to periodically run a status check (i.e. docker ps) to check on the health of the running containers and if your conditions for a system reboot are met, issue a reboot from inside the container.

Topic		Replies	Views
No more hardware or kernel lock-ups: system watchdog in 1.24.0 Product support	8	1000	April 30, 2021
How is the RPi HW watchdog implemented and is it configurable balenaOS raspberrypi4	1	301	August 3, 2023
Container Watchdog balenaOS	8	917	August 25, 2022
Not waking from Firmata sleep balenaFin	0	426	September 30, 2022
Watchdog restarts the device during a release update balenaOS raspberrypi3 , network	16	1418	July 15, 2020

Enabling hardware watchdog timer on the Raspberry Pi CM3

Related topics