Recommendations for monitoring openBalena VPS?

I just discovered that our openBalena server had crashed. I was able to get it back up and running fairly quickly with a restart, but realized this would be a pretty big issue in production, as edge devices wouldn’t get updates! Does anyone have any recommendations for monitoring the services/containers on an openBalena instance?

I could write up a cron job that runs a python script which attempts to log in and sends an email if it fails, but don’t want to reinvent the wheel if something better (and preferably low-cost) is out there!

We have the same problem in our environment.

Lets collaborate and write this simple monitor together!

I propose to add also push notification, e-mail and SMS notifications

1 Like

The simplest way is to setup an external service such as Nodeping or equivalent that polls the API’s /ping endpoint and raises an alert if it doesn’t get a timely OK response. Most if not all open-balena services expose a similar endpoint.

I like the idea, but I’d prefer not to have to put a ton of time into building something at the moment. I think I’ll see what my boss thinks about Nodeping though!

Awesome, I’ll look into that, thanks!