Active-passive to active-active domain migration support #7071

taylanisikdemir · 2025-07-16T22:55:47Z

What changed?

Updated replication simulation to support active-passive to active-active domain update operation
Updated domain handler to support this case
- Carry over domain's failover version to the corresponding entry in ActiveClusterbyRegion map
- Increment top level failover version to indicate a cluster change/failover event occurred. Domain update propagation needs this.
Update failover history entries to record 3 types of events:
- active-passive domain failover
- active-active domain failover
- active-passive to active-active migration

Why?
Avoid requiring creating new domains to leverage active-active mode by supporting migration

How did you test it?

Unit tests for domain handler
active-passive to active-active simulation: Validates domain is functioning as active-active afterwards
active-active regional failover: Validates that we can do regional failover for active-active domains. e.g. region0-> region1. Workflows previously active on the cluster in region0 are resumed on the cluster in region1

Shaddoll · 2025-07-17T21:14:44Z

common/domain/handler.go

+				// we increment failover version so top level failoverVersion is updated and domain data is replicated.
+				failoverVersion = d.clusterMetadata.GetNextFailoverVersion(
+					replicationConfig.ActiveClusterName,
+					failoverVersion+1,


GetNextFailoverVersion(0) returns 0. I need it to actually increase.

common/domain/handler.go

Shaddoll · 2025-07-17T21:58:43Z

docs/design/active-active/active-active.md

+When an active-passive domain is migrated to active-active,
+- The domain record will have the `ActiveClusters` field set
+- The existing `ActiveClusterName` field will be left as is.
+- The failover version of the domain will be incremented. Even though this is not used for task versions for active-active domains, it's incremented to indicate there was a change in replication config.


I'm kinda afraid that this can cause confusion to people. For active-active domain and active-passive domains, this field will have different semantics. Should we deprecate this field for active-active domain and use a different field?

We are gonna hide it in UI & CLI response so shouldn't cause confusion unless DB is inspected. However it's not a bad idea to not use this field for active-active domains at all. That requires further changes in domain replication such as this condition. I'll leave that as TODO(active-active) and we can evaluate later

Shaddoll · 2025-07-17T22:10:06Z

common/domain/handler.go

+				// top level failover version is not used for task versions for active-active domains but we still increment it
+				// to indicate there was a change in replication config
+				failoverVersion = d.clusterMetadata.GetNextFailoverVersion(
+					d.clusterMetadata.GetCurrentClusterName(),


Why do we use current cluster here and active cluster above?

Active-active domain doesn't have a current cluster (unless it's migrated) but we still need to increase the failover version due to the reason I mentioned in other comment.

…gration

taylanisikdemir added 2 commits July 16, 2025 15:46

Active-passive to active-active domain migration support

b086f31

fixes

7e0dfa8

taylanisikdemir changed the title ~~WIP: Active-passive to active-active domain migration support~~ Active-passive to active-active domain migration support Jul 17, 2025

taylanisikdemir marked this pull request as ready for review July 17, 2025 18:20

taylanisikdemir requested review from Shaddoll, neil-xie, davidporter-id-au, Groxx, shijiesheng, jakobht, 3vilhamster, sankari165, dkrotx and demirkayaender as code owners July 17, 2025 18:20

Shaddoll reviewed Jul 17, 2025

View reviewed changes

remove debug logs

e3da7c0

Shaddoll approved these changes Jul 17, 2025

View reviewed changes

change the simulation scenario to validate ongoing workflow during mi…

3987016

…gration

taylanisikdemir merged commit 11c594a into cadence-workflow:master Jul 18, 2025
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Active-passive to active-active domain migration support #7071

Active-passive to active-active domain migration support #7071

Uh oh!

taylanisikdemir commented Jul 16, 2025 •

edited

Loading

Uh oh!

Shaddoll Jul 17, 2025

Uh oh!

taylanisikdemir Jul 17, 2025

Uh oh!

Uh oh!

Shaddoll Jul 17, 2025

Uh oh!

taylanisikdemir Jul 17, 2025

Uh oh!

Shaddoll Jul 17, 2025

Uh oh!

taylanisikdemir Jul 17, 2025

Uh oh!

Uh oh!

Uh oh!

Active-passive to active-active domain migration support #7071

Active-passive to active-active domain migration support #7071

Uh oh!

Conversation

taylanisikdemir commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Shaddoll Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

taylanisikdemir Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Shaddoll Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

taylanisikdemir Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Shaddoll Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

taylanisikdemir Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

taylanisikdemir commented Jul 16, 2025 •

edited

Loading