ref(seer): Add random 50% rollout for context engine in start_run by Mihir-Mavalankar · Pull Request #110574 · getsentry/sentry

Mihir-Mavalankar · 2026-03-12T20:15:56Z

Gate context engine enablement behind a random coin flip for runs in sentry org with the feature flag.
contuinue_run always just checks the feature flag since in seer for continue runs we have the condition current_state.is_context_engine_enabled and self.request.is_context_engine_enabled:. So if the start run sets the context flag to True this condition is always true other wise false.
Currently set to 0. Will set to 0.5 in options automator.
Options automator PR needs to be merged first though: https://github.com/getsentry/sentry-options-automator/pull/6797

JoshFerge

what's the idea behind the random rollout?

shruthilayaj

why did we remove self.actor from the flag check?

Mihir-Mavalankar · 2026-03-12T20:23:35Z

what's the idea behind the random rollout?

My idea is that users in sentry should use explorer as is without knowing if that particular chat had context engine or not. This way they don't bias the results we collect in any way. For example asking harder questions if they know context engine is on.
From your suggestion in the meeting, Shruthi and I decided that we will make a frontend feature flag toggle just for our team. So just our team can run experiments and manually toggle it on and off. That PR is coming soon but needs more work cuz of the frontend component.

src/sentry/seer/explorer/client.py

Mihir-Mavalankar · 2026-03-12T20:25:16Z

why did we remove self.actor from the flag check?

Since the feature flag check is only org bound now, we can skip actor. Keeping it in won't break anything but is not needed.

shruthilayaj · 2026-03-12T20:30:45Z

why did we remove self.actor from the flag check?

Since the feature flag check is only org bound now, we can skip actor. Keeping it in won't break anything but is not needed.

I've disabled the flag for myself when testing, but I guess it's fine if we have the override 🤷‍♀️

src/sentry/seer/explorer/client.py

Gate context engine enablement behind a configurable rollout rate in start_run for orgs with the feature flag. The rate is controlled by the seer.explorer.context-engine-rollout option (default 0.0). continue_run always passes True since Seer ANDs it with the persisted value from start_run. Co-Authored-By: Claude Sonnet 4 <noreply@example.com>

JoshFerge · 2026-03-12T21:02:25Z

My idea is that users in sentry should use explorer as is without knowing if that particular chat had context engine or not. This way they don't bias the results we collect in any way. For example asking harder questions if they know context engine is on.

do we have enough internal usage to create statistically significant findings from this? why can't we just have evals for this instead?

Mihir-Mavalankar · 2026-03-12T21:21:41Z

My idea is that users in sentry should use explorer as is without knowing if that particular chat had context engine or not. This way they don't bias the results we collect in any way. For example asking harder questions if they know context engine is on.

do we have enough internal usage to create statistically significant findings from this? why can't we just have evals for this instead?

We do have evals some evals here just for the context engine. These are the ones Shruthi has added and we do plan to add more. Evals have their limitations too though and I think they are mostly to just catch glaring regressions.
While just sample size of just sentry org is small it still more than the eval dataset size. I am also hoping to roll this to the early adopter orgs (with the random rollout) and I think then we might have sizable enough dataset.

Mihir-Mavalankar requested a review from shruthilayaj March 12, 2026 20:15

Mihir-Mavalankar self-assigned this Mar 12, 2026

Mihir-Mavalankar requested a review from a team as a code owner March 12, 2026 20:15

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Mar 12, 2026

vercel bot deployed to Preview March 12, 2026 20:17 View deployment

JoshFerge reviewed Mar 12, 2026

View reviewed changes

shruthilayaj reviewed Mar 12, 2026

View reviewed changes

src/sentry/seer/explorer/client.py Outdated Show resolved Hide resolved

Mihir-Mavalankar force-pushed the mihir/ref/context-engine-random-rollout branch from c7bb90e to 4c5fa09 Compare March 12, 2026 20:27

vercel bot deployed to Preview March 12, 2026 20:29 View deployment

shruthilayaj reviewed Mar 12, 2026

View reviewed changes

src/sentry/seer/explorer/client.py Outdated Show resolved Hide resolved

shruthilayaj approved these changes Mar 12, 2026

View reviewed changes

Mihir-Mavalankar force-pushed the mihir/ref/context-engine-random-rollout branch from 4c5fa09 to b227f96 Compare March 12, 2026 20:40

vercel bot deployed to Preview March 12, 2026 20:43 View deployment

Mihir-Mavalankar enabled auto-merge (squash) March 12, 2026 20:57

Mihir-Mavalankar merged commit 9c2ca2c into master Mar 12, 2026
59 checks passed

Mihir-Mavalankar deleted the mihir/ref/context-engine-random-rollout branch March 12, 2026 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ref(seer): Add random 50% rollout for context engine in start_run#110574

ref(seer): Add random 50% rollout for context engine in start_run#110574
Mihir-Mavalankar merged 1 commit intomasterfrom
mihir/ref/context-engine-random-rollout

Mihir-Mavalankar commented Mar 12, 2026 •

edited

Loading

Uh oh!

JoshFerge left a comment

Uh oh!

shruthilayaj left a comment

Uh oh!

Mihir-Mavalankar commented Mar 12, 2026

Uh oh!

Uh oh!

Mihir-Mavalankar commented Mar 12, 2026

Uh oh!

shruthilayaj commented Mar 12, 2026

Uh oh!

Uh oh!

JoshFerge commented Mar 12, 2026

Uh oh!

Uh oh!

Mihir-Mavalankar commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Mihir-Mavalankar commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JoshFerge left a comment

Choose a reason for hiding this comment

Uh oh!

shruthilayaj left a comment

Choose a reason for hiding this comment

Uh oh!

Mihir-Mavalankar commented Mar 12, 2026

Uh oh!

Uh oh!

Mihir-Mavalankar commented Mar 12, 2026

Uh oh!

shruthilayaj commented Mar 12, 2026

Uh oh!

Uh oh!

JoshFerge commented Mar 12, 2026

Uh oh!

Uh oh!

Mihir-Mavalankar commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Mihir-Mavalankar commented Mar 12, 2026 •

edited

Loading