Marathon stops responding

Issue

  • We are unable to access Marathon endpoint (marathon.cje.example.com) and getting below error while trying to login.
Problem accessing /. Reason:
Could not determine the current leader
  • cje run list-applications is failing

Environment

Resolution

Connect to each of the controllers and restart the marathon service for each of them. From the bastion host, in the CJE project folder:

1.- dna servers to get the id of your controllers (3 in the following example)

$> dna servers
   ...
    controller-x
    controller-y
    controller-z
    ...

2.- For every controller, restart the marathon service. Ensure that the service is up and running before moving to the next one.

$>  dna connect controller-x
controller-x $> sudo service marathon stop
controller-x $> sudo service marathon start
controller-x $> exit

3.- If the previous step did not fix the issue, restart every controller (cje prepare controller-restart) one by one, check that the controller is up and running with dna connect controller-x.

Note: If the problem persists, please open a support ticket including a CJE Bundle, including all the controllers in the server tag.

Tested product/plugin versions

References

Have more questions?

0 Comments

Please sign in to leave a comment.