First cluster implementation #2441

svagner · 2020-01-02T14:20:30Z

Description

As solution for issue #2443

Same as #2345 with solving conflicts and some new api endpoints

Were added new api endpoints:
POST /api/cluster/recover_cluster
Api endpoint Is used to manually force a new configuration in order to recover from a loss of quorum where the current configuration cannot be restored, such as when several servers die at the same time. This works by reading all the current state for this server, creating a snapshot with the supplied configuration, and then truncating the Raft log. This is the only safe way to force a given configuration without actually altering the log to insert any new entries, which could cause conflicts with other servers with a different state.

WARNING! This operation implicitly commits all entries in the Raft log, so in general, this is an extremely unsafe operation. If you've lost your other servers and are performing a manual recovery, then you've also lost the commit information, so this is likely the best you can do, but you should be aware that calling this can cause Raft log entries that were in the process of being replicated but not yet be committed to be committed.

Example:

$ curl -s 127.0.0.1:8071/api/cluster/status | jq .
{
  "State": "Candidate",
  "Nodes": [
    {
      "Address": "127.0.0.1:10002",
      "State": "Follower"
    },
    {
      "Address": "127.0.0.1:10014",
      "State": "Follower"
    }
  ],
  "Stats": {
...
  }
}
$ curl -XPOST 127.0.0.1:8071/api/cluster/recover_cluster -d '{"members": [{"address": "127.0.0.1:10002"}]}'
{
  "State": "Leader",
  "Nodes": [
    {
      "Address": "127.0.0.1:10002",
      "State": "Leader"
    }
  ],
  "Stats": {
...
  }
}

POST /api/cluster/change_master - move leadership to another node in cluster
Example:

$ curl -s 127.0.0.1:8071/api/cluster/status | jq .
{
  "State": "Leader",
  "Nodes": [
    {
      "Address": "127.0.0.1:10002",
      "State": "Leader"
    },
    {
      "Address": "127.0.0.1:10014",
      "State": "Follower"
    }
  ],
  "Stats": {
    ...
  }
}
$ curl -XPOST 127.0.0.1:8072/api/cluster/change_master -d '{"id": "127.0.0.1:10014", "address": "127.0.0.1:10014"}'
{"status":"error","error":"cannot transfer leadership to itself"}
$ curl -XPOST 127.0.0.1:8072/api/cluster/change_master -d '{"id": "127.0.0.1:10002", "address": "127.0.0.1:10002"}'
{"status":"error","error":"node is not the leader"}
$ curl -XPOST 127.0.0.1:8071/api/cluster/change_master -d '{"id": "127.0.0.1:10014", "address": "127.0.0.1:10014"}'
{"status":"ok"}
$ curl 127.0.0.1:8071/api/cluster/status | jq .
{
  "State": "Follower",
  "Nodes": [
    {
      "Address": "127.0.0.1:10002",
      "State": "Follower"
    },
    {
      "Address": "127.0.0.1:10014",
      "State": "Leader"
    }
  ],
  "Stats": {
...
  }
}

Type of change

From the following, please check the options that are relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How has this been tested?

Checklist:

This contribution follows the project's code of conduct
This contribution follows the project's contributing guidelines
My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
Any dependent changes have been merged and published in downstream modules

muffix · 2020-01-02T15:20:39Z

Hi @svagner, thanks a lot for the contribution. 🙌 Is it worth creating an issue first to discuss the approach and keep this PR for the technical discussion?
Can I also ask you to please use the template that's generated when you open a PR and fill in the details? This makes it easier for us to review bigger changes and contributions like this one. That'd be much appreciated.

svagner · 2020-01-02T15:50:10Z

Hi @svagner, thanks a lot for the contribution. Is it worth creating an issue first to discuss the approach and keep this PR for the technical discussion?
Can I also ask you to please use the template that's generated when you open a PR and fill in the details? This makes it easier for us to review bigger changes and contributions like this one. That'd be much appreciated.

I'll create issue then. We already have a couple of them, but there was no movement about clustering. Also, there is some discussion in prev MR, but anyway it is not bad to have a separate issue for discussion

Cluster would only have one ‘leader’ at a time, all other nodes are followers (so this is an implementation of a model with with 1 master and multiple standby nodes). ‘Master’ node executes the checks and sends notifications, ‘follower’ nodes don’t do neither (they run with ‘no-checks’ and ‘quiet-mode’ options enabled). This also adds a new (optional) dependency raftdb to store state and perform leader election.

For now, we are looking to global variable that was initialized once we've started. If we want to have flexibility to restart scheduler (config api reload/clustering etc.) we should have it as time of scheduler's start

svagner · 2020-04-23T21:44:12Z

New implementation is in #2472

svagner force-pushed the raft_cluster_implementation branch 3 times, most recently from be36983 to 25386eb Compare January 2, 2020 15:46

svagner mentioned this pull request Jan 2, 2020

Feature request: Clustering support #2443

Closed

3 tasks

svagner added 6 commits March 3, 2020 15:57

Fix test config flag with cluster usage

1cd84e2

Add Cluster timeouts settings

d4205cc

Add possibility to define members in the separate memberlist file

2deeb29

Fix check unknowns while we restart scheduler

996c467

For now, we are looking to global variable that was initialized once we've started. If we want to have flexibility to restart scheduler (config api reload/clustering etc.) we should have it as time of scheduler's start

Rebase with current master version

d54b595

svagner force-pushed the raft_cluster_implementation branch from 72beb61 to d54b595 Compare March 3, 2020 15:02

svagner added 3 commits March 9, 2020 12:12

Update raft dependency to version 1.1.2

b5fe247

Fix unknowns after reload scheduler/start bosun after long downtime

228785a

Increase reload channel size for cluster

9ec1ce8

svagner mentioned this pull request Apr 23, 2020

Raft-based cluster support #2472

Closed

18 tasks

svagner closed this Apr 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

First cluster implementation #2441

First cluster implementation #2441

Uh oh!

svagner commented Jan 2, 2020 •

edited

Loading

Uh oh!

muffix commented Jan 2, 2020

Uh oh!

svagner commented Jan 2, 2020

Uh oh!

svagner commented Apr 23, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

First cluster implementation #2441

First cluster implementation #2441

Uh oh!

Conversation

svagner commented Jan 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How has this been tested?

Checklist:

Uh oh!

muffix commented Jan 2, 2020

Uh oh!

svagner commented Jan 2, 2020

Uh oh!

svagner commented Apr 23, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

svagner commented Jan 2, 2020 •

edited

Loading