feat(spans): Distribute span payload keys across Redis cluster by lvthanh03 · Pull Request #110593 · getsentry/sentry

lvthanh03 · 2026-03-12T21:56:59Z

Spread span payload sets across Redis cluster nodes to avoid concentrated large traces on a single node.

Instead of merging all payloads under {project_id:trace_id}, write them to {project_id:trace_id:span_id} so they shard across nodes. A member-keys tracking set (span-buf:mk) indexes which distributed keys belong to each segment.

Three-phase rollout (similar to the ZSET to SET change):

Phase 1 (write-distributed-payloads->set to True): Dual-write to both key formats, read from merged set keys.
Phase 2 (read-distributed-payloads->set to True): Dual-write continues, flusher reads from distributed keys.
Phase 3 (write-merged-payloads->set to False): Stop writing merged payloads.

Spread span payload sets across Redis cluster nodes to eliminate expensive colocated set merges (SMEMBERS+SADD) that block a single node for large segments. Instead of colocating all payload keys under {project_id:trace_id}, write them to {project_id:trace_id:span_id} so they shard across nodes. A member-keys tracking set (span-buf:mk) indexes which distributed keys belong to each segment. Three-phase rollout: - Phase 1 (distribute-payload-keys): Dual-write to both key formats, read from colocated. - Phase 2 (distribute-payload-keys-read): Dual-write continues, flusher reads from distributed keys. - Phase 3 (distribute-payload-keys-stop-colocated): Stop colocated writes.

lvthanh03 · 2026-03-13T14:21:35Z

src/sentry/spans/buffer.py

                project_id_bytes, _, _ = parse_segment_key(key)
-                project_id = int(project_id_bytes)
+                project_id_int = int(project_id_bytes)
                try:
-                    project = Project.objects.get_from_cache(id=project_id)
+                    project = Project.objects.get_from_cache(id=project_id_int)
                except Project.DoesNotExist:
                    logger.warning(
                        "Project does not exist for segment with dropped spans",
-                        extra={"project_id": project_id},
+                        extra={"project_id": project_id_int},
                    )
                else:
                    track_outcome(
                        org_id=project.organization_id,
-                        project_id=project_id,
+                        project_id=project_id_int,


renaming since I'm declaring project_id above as type bytes, so redeclaring as int will cause mypy errors.

Why was it bytes ?

src/sentry/spans/buffer.py

evanh

I think this all makes sense. Nice job!

fpacifici · 2026-03-16T23:50:28Z

src/sentry/spans/buffer.py

+                scan_key_to_segment[key] = key
+                cursors[key] = 0
+
+        self._distributed_payload_keys_map = {}


Why did you make _distributed_payload_keys_map an attribute of the class if you are supposed to manage it only in this method or within an iteration to flush?
Having it as a mutable attribute is a liability as it can easily cause race conditions we can prevent. It is not given that we will only call process_spans or flush segment from the same thread.

IF you need it only through an iteration of the flush method please make it a local variable. You can pass it back from load_segment_data

fpacifici · 2026-03-16T23:51:00Z

src/sentry/spans/buffer.py

                project_id_bytes, _, _ = parse_segment_key(key)
-                project_id = int(project_id_bytes)
+                project_id_int = int(project_id_bytes)
                try:
-                    project = Project.objects.get_from_cache(id=project_id)
+                    project = Project.objects.get_from_cache(id=project_id_int)
                except Project.DoesNotExist:
                    logger.warning(
                        "Project does not exist for segment with dropped spans",
-                        extra={"project_id": project_id},
+                        extra={"project_id": project_id_int},
                    )
                else:
                    track_outcome(
                        org_id=project.organization_id,
-                        project_id=project_id,
+                        project_id=project_id_int,


Why was it bytes ?

fpacifici · 2026-03-17T00:00:53Z

src/sentry/spans/buffer.py

+        read_distributed_payloads = options.get("spans.buffer.read-distributed-payloads")
+        write_distributed_payloads = options.get("spans.buffer.write-distributed-payloads")


When you are adding multiple ways to perform some operation in a very complex method, adding more branches make things even more complex and hard to follow.
A safer way to proceed is to break the method into multiple. Have a common interface between the two options and use the main method to switch between one implementation and the other.

fpacifici · 2026-03-17T00:01:24Z

src/sentry/spans/buffer.py

+
+            for key, sub_span_ids in zip(segment_keys, mk_results):
+                project_id, trace_id, _ = parse_segment_key(key)
+                pat = f"{project_id.decode('ascii')}:{trace_id.decode('ascii')}"


fpacifici · 2026-03-17T00:01:52Z

src/sentry/spans/buffer.py

+                project_id, trace_id, _ = parse_segment_key(key)
+                pat = f"{project_id.decode('ascii')}:{trace_id.decode('ascii')}"
+                distributed_keys: list[bytes] = []
+                for sub_span_id in sub_span_ids:


What is a sub_span_id ?

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Mar 12, 2026

vercel bot deployed to Preview March 12, 2026 21:59 View deployment

vercel bot deployed to Preview March 12, 2026 22:08 View deployment

lvthanh03 force-pushed the tony/distribute-spans-payloads branch from 5249883 to 7b770e9 Compare March 12, 2026 22:10

vercel bot deployed to Preview March 12, 2026 22:13 View deployment

cleanup distributed keys in separate fn

b38e2dc

vercel bot deployed to Preview March 12, 2026 22:19 View deployment

lvthanh03 added 2 commits March 13, 2026 09:56

fix: typing

2183cbf

Merge branch 'master' into tony/distribute-spans-payloads

4d39d4e

vercel bot deployed to Preview March 13, 2026 14:00 View deployment

Fix options

a7b52ea

lvthanh03 commented Mar 13, 2026

View reviewed changes

vercel bot deployed to Preview March 13, 2026 14:22 View deployment

lvthanh03 added 3 commits March 13, 2026 10:31

WIP

7cf7ebc

fix: docs

e4c21af

more fixes

eac5b26

vercel bot deployed to Preview March 13, 2026 14:38 View deployment

var renames

4494dc0

vercel bot deployed to Preview March 13, 2026 15:17 View deployment

lvthanh03 marked this pull request as ready for review March 13, 2026 15:21

lvthanh03 requested review from a team as code owners March 13, 2026 15:21

sentry bot reviewed Mar 13, 2026

View reviewed changes

src/sentry/spans/buffer.py Show resolved Hide resolved

evanh approved these changes Mar 13, 2026

View reviewed changes

lvthanh03 merged commit f0d5daf into master Mar 13, 2026
61 checks passed

lvthanh03 deleted the tony/distribute-spans-payloads branch March 13, 2026 17:07

sentry-release-bot bot mentioned this pull request Mar 15, 2026

publish: getsentry/sentry@26.3.0 getsentry/publish#7450

Closed

3 tasks

fpacifici reviewed Mar 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(spans): Distribute span payload keys across Redis cluster#110593

feat(spans): Distribute span payload keys across Redis cluster#110593
lvthanh03 merged 9 commits intomasterfrom
tony/distribute-spans-payloads

lvthanh03 commented Mar 12, 2026 •

edited

Loading

Uh oh!

lvthanh03 Mar 13, 2026

Uh oh!

fpacifici Mar 16, 2026

Uh oh!

Uh oh!

evanh left a comment

Uh oh!

Uh oh!

fpacifici Mar 16, 2026

Uh oh!

fpacifici Mar 16, 2026

Uh oh!

fpacifici Mar 17, 2026

Uh oh!

fpacifici Mar 17, 2026

Uh oh!

fpacifici Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		read_distributed_payloads = options.get("spans.buffer.read-distributed-payloads")
		write_distributed_payloads = options.get("spans.buffer.write-distributed-payloads")

Uh oh!

Conversation

lvthanh03 commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lvthanh03 Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

fpacifici Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

evanh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fpacifici Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

fpacifici Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

fpacifici Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

fpacifici Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

fpacifici Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lvthanh03 commented Mar 12, 2026 •

edited

Loading