Upsert memo support #858

maciejdudko · 2025-05-08T20:28:19Z

What was changed

Added workflow.upsert_memo function to support memo upsert functionality.

Why?

Feature request: temporalio/features#119

Checklist

Closes [Feature Request] Upsert memo support #190
How was this tested:

Modified test_workflow_memo to also test upsert operation.

Any docs updates needed?

CLAassistant · 2025-05-08T20:28:25Z

All committers have signed the CLA.

temporalio/workflow.py

cretz · 2025-05-09T15:58:47Z

temporalio/worker/_workflow_instance.py

@@ -218,8 +218,11 @@ def __init__(self, det: WorkflowInstanceDetails) -> None:
        self._current_history_length = 0
        self._current_history_size = 0
        self._continue_as_new_suggested = False
+        self._raw_memo: Optional[


Is _raw_memo needed here if it's already available in info? Also, we need to make sure we update the info's raw memo info on upsert. Note, we need to make sure we update the same dict, not recreate it (which you can't really anyways because it's a frozen data class).

Oh, I thought _info is read-only, that's why I made the other one. I've updated PR to operate on _info.raw_memo directly.

👍 While the top level fields are read-only, we do mutate a couple of maps in there (memo and search attributes)

cretz · 2025-05-09T15:59:41Z

temporalio/worker/_workflow_instance.py

+        # Clearing cached value, will be regenerated on next workflow_memo() call.
+        self._untyped_converted_memo = None


We should update the existing dict with the updates IMO. A user may have assigned it to a variable. It also doesn't need to go back through the payload conversion process because you can just use the objects directly that you were given by the user. We should also document on memo() that the result can be mutated with the results of upsert.

I don't like an idea of handing over references to mutable state. I'd imagine most code would expect it to not change. Handing over a read-only copy feels safer and less surprising. Additionally, if the user ever modified the returned dict, our internal state would get desynchronized with what's in the command buffer and that smells like all kinds of trouble.

I deliberately round-trip the memo value for two reasons: to keep the result of the untyped memo read consistent regardless of how the memo was obtained (from server vs. from recent upsert), and to allow deserialization with different type hints than the original value (we already offer that ability in current SDK version, somebody somewhere probably depends on it).

I don't like an idea of handing over references to mutable state. I'd imagine most code would expect it to not change. Handing over a read-only copy feels safer and less surprising. Additionally, if the user ever modified the returned dict, our internal state would get desynchronized with what's in the command buffer and that smells like all kinds of trouble.

But we already do this with search attributes and this happens in other SDKs. I think it's actually more surprising to users when multiple instances of a workflow's memo are independent. Sure in some languages where immutable collections are more common it can make more sense to copy-on-write, but not here (and again we don't with other mutable collections like search attributes). We do have a type that tells people the map is an immutable view of it, so they shouldn't mutate, but we can't stop them.

I deliberately round-trip the memo value for two reasons: to keep the result of the untyped memo read consistent regardless of how the memo was obtained (from server vs. from recent upsert), and to allow deserialization with different type hints than the original value (we already offer that ability in current SDK version, somebody somewhere probably depends on it).

I think we can keep the behavior of lazily mutating the typed map only if it had been accessed before, but instead let it continue to be lazily created from raw (which we also update here) if it hasn't been accessed. All of the existing functionality for individual memo values by type hint is retained because it works off of raw. Only this one workflow_memo method does this deserialization with default type hints (so often dicts and arrays and such) and that should continue to work.

workflow.memo() should return the same object every time for clarity and consistency.

cretz · 2025-05-12T19:30:29Z

temporalio/worker/_workflow_instance.py

+                self._untyped_converted_memo[k] = self._payload_converter.from_payload(
+                    v
+                )


Suggested change

self._untyped_converted_memo[k] = self._payload_converter.from_payload(

v

)

self._untyped_converted_memo[k] = updates[k]

I don't think you should go back through the converter, it's fine if we use the value as the user gave it to us

After discussion, to have the same value whether they have called memo() before or not, we do need to re-convert here. Hopefully people don't use memo() and they use type-hint-based ones instead. Feel free to add comment mentioning that we re-convert because we don't want the addition of memo() earlier in the workflow to affect the this value.

Fixes temporalio#190

maciejdudko requested a review from a team as a code owner May 8, 2025 20:28

cretz reviewed May 8, 2025

View reviewed changes

temporalio/workflow.py Show resolved Hide resolved

cretz reviewed May 9, 2025

View reviewed changes

maciejdudko force-pushed the upsert-memo branch from 793e88f to 4168fba Compare May 9, 2025 21:19

cretz reviewed May 12, 2025

View reviewed changes

cretz approved these changes May 12, 2025

View reviewed changes

maciejdudko added 4 commits May 12, 2025 16:31

Upsert memo support

2c18c52

Fixes temporalio#190

Made workflow_upsert_memo operate on info.raw_memo. Linter fixes.

77ccecc

Changed null encoding to fix behavior on x86.

7a823ea

Changed untyped memo to reuse same dict instance.

ca22fa1

maciejdudko force-pushed the upsert-memo branch from 7ae926c to ca22fa1 Compare May 12, 2025 20:31

maciejdudko merged commit 257f143 into temporalio:main May 12, 2025
14 checks passed

maciejdudko deleted the upsert-memo branch May 12, 2025 21:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Upsert memo support #858

Upsert memo support #858

Uh oh!

maciejdudko commented May 8, 2025

Uh oh!

CLAassistant commented May 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

cretz May 9, 2025

Uh oh!

maciejdudko May 9, 2025

Uh oh!

cretz May 12, 2025

Uh oh!

cretz May 9, 2025

Uh oh!

maciejdudko May 9, 2025

Uh oh!

cretz May 12, 2025 •

edited

Loading

Uh oh!

cretz May 12, 2025

Uh oh!

cretz May 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

		# Clearing cached value, will be regenerated on next workflow_memo() call.
		self._untyped_converted_memo = None

Upsert memo support #858

Upsert memo support #858

Uh oh!

Conversation

maciejdudko commented May 8, 2025

What was changed

Why?

Checklist

Uh oh!

CLAassistant commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

cretz May 9, 2025

Choose a reason for hiding this comment

Uh oh!

maciejdudko May 9, 2025

Choose a reason for hiding this comment

Uh oh!

cretz May 12, 2025

Choose a reason for hiding this comment

Uh oh!

cretz May 9, 2025

Choose a reason for hiding this comment

Uh oh!

maciejdudko May 9, 2025

Choose a reason for hiding this comment

Uh oh!

cretz May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cretz May 12, 2025

Choose a reason for hiding this comment

Uh oh!

cretz May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented May 8, 2025 •

edited

Loading

cretz May 12, 2025 •

edited

Loading

cretz May 12, 2025 •

edited

Loading