Remove sandbox and subclusters from vdb at the same time #959

roypaulin · 2024-10-14T18:39:36Z

This fixes the issue that happens when you remove a sandbox and all its subclusters at the same time from the vdb, the pods will be down but will never get deleted.
Now, as soon as unsandbox is done the operator will find the statefulset make it rejoin the main cluster where it will be removed.

cchen-vertica · 2024-10-14T21:00:08Z

pkg/iter/sc_finder_test.go

+		vdb.Spec.Subclusters = vdb.Spec.Subclusters[1:]
+		vdb.Spec.Subclusters = vdb.Spec.Subclusters[:2]
+		scNames = append(scNames[0:1], scNames[3])
+		verifySubclusters(ctx, vdb, scNames, []int32{0, 0}, sbName, FindNotInVdbAcrossSandboxes)


Why the size is set to {0, 0}?

Because when the operator constructs subclusters no longer in vdb from their statefulsets, it gives them size 0 as they will soon be removed.

cchen-vertica · 2024-10-14T21:23:24Z

api/v1/verticadb_webhook.go

-	// find subclusters that are sandboxed in old vdb but removed in new vdb
+// checkSandboxSubclustersRemoved checks subclusters that are sandboxed in old vdb but removed in new vdb
+func (v *VerticaDB) checkSandboxSubclustersRemoved(allErrs field.ErrorList, oldObj *VerticaDB, oldScIndexMap map[string]int,
+	oldScMap, newScMap map[string]*Subcluster, path *field.Path) field.ErrorList {
 	oldScInSandbox := oldObj.GenSubclusterSandboxMap()


We should modify this to: newScInSandbox := v.GenSubclusterSandboxMap(). Then in line 1647, verify the removed sc isn't in newScInSandbox. The check in line 1650(if v.GetSandbox(sb) == nil) is not accurate since the sc could be in a new sandbox of new vdb.

I see, I always add the impression that moving a subcluster from one sandbox to another would not work. Then I need to change my definition of zombie subcluster as it will also cover the case where sandbox still exists.

cchen-vertica

In your next PR: you can remove the timeout you added in the e2e tests.

Remove sandbox and subclusters from vdb

a5639d8

roypaulin requested review from cchen-vertica, fenic-fawkes, HaoYang0000, qindotguan and LiboYu2 as code owners October 14, 2024 18:39

cchen-vertica reviewed Oct 14, 2024

View reviewed changes

roypaulin added 7 commits October 15, 2024 14:36

Address comments

b70f751

Fix test error

18ac1fe

Merge remote-tracking branch 'origin/main' into roypaulin/terminate

69d52cc

Make e2e test more stable

9829b1e

Increase timeout in vdb-gen test

547b38c

Increase timeout

3c8088f

Increase vdb concurrency in leg-4

80e9552

roypaulin requested a review from cchen-vertica October 17, 2024 09:02

cchen-vertica approved these changes Oct 17, 2024

View reviewed changes

roypaulin merged commit 0870228 into main Oct 17, 2024
37 checks passed

roypaulin deleted the roypaulin/terminate branch October 17, 2024 16:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove sandbox and subclusters from vdb at the same time #959

Remove sandbox and subclusters from vdb at the same time #959

roypaulin commented Oct 14, 2024

cchen-vertica Oct 14, 2024

roypaulin Oct 15, 2024

cchen-vertica Oct 14, 2024

roypaulin Oct 15, 2024

cchen-vertica left a comment

Remove sandbox and subclusters from vdb at the same time #959

Remove sandbox and subclusters from vdb at the same time #959

Conversation

roypaulin commented Oct 14, 2024

cchen-vertica Oct 14, 2024

Choose a reason for hiding this comment

roypaulin Oct 15, 2024

Choose a reason for hiding this comment

cchen-vertica Oct 14, 2024

Choose a reason for hiding this comment

roypaulin Oct 15, 2024

Choose a reason for hiding this comment

cchen-vertica left a comment

Choose a reason for hiding this comment