feat: add support for TopologySpreadConstraints #546

davidspek · 2024-06-14T12:11:54Z

Description

This PR adds support for using Pod Topology Spread Constraints. This is useful when trying to distribute the scheduling of pods evenly across failure domains such as nodes or availability zones. In many situations this cannot be done with simple Affinity controls.

Related Issue(s)

Please list any related issues and link them here.

Checklist

For operator, please complete the following checklist:

run make generate to generate the code.
run golangci-lint run to check the code style.
run make test to run UT.
run make manifests to update the yaml files of CRD.

For helm chart, please complete the following checklist:

make sure you have updated the values.yaml
file of starrocks chart.
In scripts directory, run bash create-parent-chart-values.sh to update the values.yaml file of the parent
chart( kube-starrocks chart).

Signed-off-by: David van der Spek <[email protected]>

CLAassistant · 2024-06-14T12:12:00Z

All committers have signed the CLA.

yandongxiao

Thank you for your PR!

yandongxiao · 2024-06-17T07:25:30Z

pkg/k8sutils/templates/pod/spec.go

@@ -265,6 +265,7 @@ func Spec(spec v1.SpecInterface, container corev1.Container, volumes []corev1.Vo
 		ServiceAccountName:            spec.GetServiceAccount(),
 		TerminationGracePeriodSeconds: spec.GetTerminationGracePeriodSeconds(),
 		Affinity:                      spec.GetAffinity(),
+		TopologySpreadConstraints:     spec.GetTopologySpreadConstraints(),


Since this is a newer feature on k8s. If the user is using an older version of k8s, such as version 1.18, will it cause deployment to fail?

I have verified it.

# deployments.apps "kube-starrocks-operator" was not valid: # * patch: Invalid value: "map[spec:map[aaaa:[]]]": strict decoding error: unknown field "spec.aaaa"```

TopologySpreadContraints was promoted to beta in Kubernetes 1.18 as far as I can find so quickly. Which was released 4 years ago, or almost half the life of Kubernetes :). I think most projects have given up on supporting such old versions of Kubernetes since it can prevent projects from moving forwards with features. It might be possible to have this feature only get enabled after doing a Kubernetes version check, but I'm not familiar with the code enough to do that quickly, nor am I sure it's worth the effort for maintaining support for Kubernetes versions that people shouldn't have been using anymore for years now.

https://kubernetes.io/blog/2020/05/introducing-podtopologyspread/
From this blog, topologySpreadConstraints was introduced in 1.18, which is the oldest version starrocks operator support.

yandongxiao · 2024-06-17T07:38:34Z

‌‌Could you share the scenarios in which you use TopologySpreadConstraints?

davidspek · 2024-06-17T07:51:23Z

You need TopologySpreadConstraints if you want to evenly distribute pods across availability zones in scenarios where you might have more pods than the amount of availability zones. This is especially important since in most cloud providers the persistent disks are bound to availability zones. Meaning that if on the first time the pods get scheduled they aren't spread across all the availability zones you will not be able to correct that later (without serious manual effort). A pod anti-affinity rule can work, but only works if the amount of zones is equal to the amount of pods you will ever want to schedule. The motivation section of the docs gives a more complete overview of why this was added and why it can be such an important feature (especially when dealing with persistence and availability zone bound disks).

Hatlassian · 2024-06-26T23:11:25Z

This is awesome, thanks for this!

feat: add support for TopologySpreadConstraints

6f3c71b

Signed-off-by: David van der Spek <[email protected]>

yandongxiao self-requested a review June 17, 2024 06:57

yandongxiao reviewed Jun 17, 2024

View reviewed changes

yandongxiao approved these changes Jun 17, 2024

View reviewed changes

yandongxiao merged commit 3017f55 into StarRocks:main Jun 17, 2024
6 checks passed

yandongxiao added the v1.9.7 label Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for TopologySpreadConstraints #546

feat: add support for TopologySpreadConstraints #546

davidspek commented Jun 14, 2024

CLAassistant commented Jun 14, 2024 •

edited

Loading

yandongxiao left a comment

yandongxiao Jun 17, 2024

yandongxiao Jun 17, 2024

davidspek Jun 17, 2024

yandongxiao Jun 17, 2024

yandongxiao commented Jun 17, 2024

davidspek commented Jun 17, 2024 •

edited

Loading

Hatlassian commented Jun 26, 2024

feat: add support for TopologySpreadConstraints #546

feat: add support for TopologySpreadConstraints #546

Conversation

davidspek commented Jun 14, 2024

Description

Related Issue(s)

Checklist

CLAassistant commented Jun 14, 2024 • edited Loading

yandongxiao left a comment

Choose a reason for hiding this comment

yandongxiao Jun 17, 2024

Choose a reason for hiding this comment

yandongxiao Jun 17, 2024

Choose a reason for hiding this comment

davidspek Jun 17, 2024

Choose a reason for hiding this comment

yandongxiao Jun 17, 2024

Choose a reason for hiding this comment

yandongxiao commented Jun 17, 2024

davidspek commented Jun 17, 2024 • edited Loading

Hatlassian commented Jun 26, 2024

CLAassistant commented Jun 14, 2024 •

edited

Loading

davidspek commented Jun 17, 2024 •

edited

Loading