Supervisor is unable to start service

bithell · July 3, 2019, 7:58pm

Python script initiated a reboot of the entire device using api (it may have done this multiple times)
Device rebooted
Supervisor was unable to initialise script again
I manually rebooted from cloud console
Device rebooted
Script started fine

Anyone seen this behaviour before?

samothx · July 3, 2019, 10:47pm

Hi @bithell, I find it hard to image how this behaviour would be triggered by the balena env. Could there be any state in the weatherStationCoreLink container that makes it fail to start ?
The way to analyse this would be to look at the device while it is in the failing state. Can you trigger the reboot and does it show that behaviour when you do ?

bithell · July 4, 2019, 11:15am

Hi @samothx - no when I trigger the reboot in the dashboard it starts fine

jviotti · July 4, 2019, 11:30am

Can you please enable support access and share the device url/uuid with us so we can take a closer look?

bithell · July 4, 2019, 11:48am

Sure thing its f453f0e67eb717e9b5930cd9452d2d1f

jviotti · July 4, 2019, 11:55am

Hey there,

Is the device on the problematic state right now? If not, can you make device get into the state where the supervisor fails to start the service? I’d be interesting to check the device logs as that happens

bithell · July 4, 2019, 12:07pm

It’s not in that state right now sorry, and not really sure how to get it there - it’s happened a few times today (and yesterday) but I don’t really have a sure fire way to get it there other than by initiating a restart pretty soon after the script starts

jviotti · July 4, 2019, 12:15pm

No worries! Please keep an eye on it and once it happens, leave it in that state and ping us here. We are always watching the forums so we’ll be able to jump into the device as soon as it happens again and get all the necessary information.

bithell · July 4, 2019, 12:22pm

@jviotti it’s just gone into it now

jviotti · July 4, 2019, 12:23pm

Awesome, I’m taking a look now

jviotti · July 4, 2019, 12:37pm

Hey @bithell,

I have a theory for what’s going on. It seems that the supervisor starts all the other containers before binding to the HTTP port, so I think that if your container is fast enough, the supervisor API might not be available yet. I’m double checking that this is indeed the case, and if so we can update it to ensure that the API is available before the other containers start.

For now, can you try updating your script to retry various times if the connection is refused, waiting a bit before each retry? I believe that the supervisor will eventually start responding

bithell · July 4, 2019, 1:03pm

Sure thing - I’ve set it to loop around waiting 10 minutes each time

bithell · July 14, 2019, 6:28pm

@jviotti - seems to be stuck in a state of trying to start the service but it can’t. Any chance you could take a look?

imrehg · July 15, 2019, 10:42am

Hey @bithell, can you remind us of the device UUID to check, and enable support access please? Thanks!

bithell · July 15, 2019, 12:38pm

Sure thing it’s f453f0e67eb717e9b5930cd9452d2d1f

CameronDiver · July 15, 2019, 1:02pm

Taking a look now.

CameronDiver · July 15, 2019, 1:04pm

Could you also make sure that support access is enabled, it does not seem to be currently.

bithell · July 15, 2019, 1:38pm

Apologies, done now @CameronDiver

CameronDiver · July 15, 2019, 1:57pm

Hey, it seems like the device still does not have support access enabled, could you check you enabled it on this device? https://dashboard.balena-cloud.com/devices/f453f0e67eb717e9b5930cd9452d2d1f/summary

bithell · July 15, 2019, 3:14pm

Apologies, I’ve tried again

Topic		Replies	Views
Balena Supervisor starting randomly even if container is running balenaOS	2	209	September 15, 2023
Two instances of supervisor trying to start balenaOS	12	503	September 15, 2022
Balena "Supervisor" process does not restart after a device is "Shut Down" from BalenaCloud Product support support	20	2965	September 8, 2020
Failed to connect to supervisor-api Product support	3	565	August 2, 2019
Supervisor API reboot causing device to stall offline requiring power cycle balenaOS api	0	157	November 23, 2023

Supervisor is unable to start service

Related topics