I’m looking for a way to notify ops when our devices are not ok. I really like the device diagnostics feature, since it already covers a lot of of ground. to start with I’m thinking I make a batch job that triggers the diagnostics check every few hours for all of our online devices. And make it send slack messages for failed checks.
It seems to me that would give me a lot of operational insight for not a whole lot of effort, and with a minimal use of bandwidth. These are all things I like very much
Now I was wondering if I could use this as a basic infrastructure towards better monitoring overall. Can I add my own application specific device checks to the diagnostics? I found this https://github.com/balena-io/device-diagnostics and that looks quite easy to extend. But how would I go about getting those extensions on my devices? And would the API and dashboard automatically pick those new checks up?