-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix: systemd: make corosync wait for sbd-start to complete #74
base: main
Are you sure you want to change the base?
Conversation
Are you sure that this is working? We had a lengthy discussion about similar topics some time ago on the clusterlabs-list. Currently sbd is being started as part-of corosync which makes it start/stop in parallel providing that there is never sbd without corosync. Do you have a scenario in mind where this behavior (assuming it is working as expected) Where I think we would need a similar pattern is when we look at pacemaker-remote. There it |
The sbd.service file in 1.4.0 release has:
Not sure why is corosync missing from Before list? |
Because corosync doesn't do anything critical that might lead to split-brain if running unwatched by sbd ... Not saying our current way to use systemd enforcement of the startup-sequence and -requirements is ideal. As already discussed in the thread I was referring to before a target something like ready-for-resource-manager sounds interesting. But again in my current world sbd & corosync would still be part of that target ... |
I see, but than the line |
Well sounds reasonable but iirc that still was needed for some part of the synchronized stopping/starting of corosync & sbd. But I might be wrong here - have to refresh / proof wrong my memory ;-) |
Don't know, but if sbd and corosync start in parallel than there is no real dependency between them because they can start in any order. Instead starting or stopping pacemaker will take sbd with it (and fail pacemaker if sbd start fails). |
Didn't have time to dive into that again yet. |
Can one of the admins verify this patch? |
As corosync is contributes to quorum-voting I guess it should be observed by sbd in some way. |
If sbd fails to start it prevents pacemaker from starting, but corosync
continues to start without errors. This generates a quorum vote for the
current node although the sbd and pacemaker components are not alive.