Remove serialization lock in checksum synchronization. by CyrusNajmabadi · Pull Request #72928 · dotnet/roslyn

CyrusNajmabadi · 2024-04-08T01:55:48Z

I collected measurements, and this lock does not seem to help at all.

Some important pieces of information:

With this lock, it takes 20s to synchronize all data. Without it, it takes 12s. That's a huge win.
With this lock, we make 3750 sync calls. Without the lock it drops to 3250. Another nice win.

This PR should be read a commit at a time.

One interesting fact about the sync info is just how many single-checksum syncs we perform. Prior to this change, we perform 1816 (so roughly 50% of the calls). With this change, it drops to 1269 (so roughly 40% of the calls).

However, for both, the numbers seem quite high. During initial sync, i would expect many more large syncs. I'd like to collect more data to see if it helps explain what's going on before/after.

CyrusNajmabadi · 2024-04-08T02:38:46Z

Plugging into google sheets, i'm slicing and dicing some of the information:

What this is is the count of calls where we are only asking for '1' checksum for that particular type. The type is based on hte strong-type passed into the sync call for what we want back. The 'object' line is for calls that are asking for heterogenous data.

CyrusNajmabadi · 2024-04-08T03:08:18Z

Ok. I can make a trivial change to get us to:

Looks like DocumentStateChecksums are where we can also get some nice benefits.

ToddGrun · 2024-04-08T13:50:36Z

I can make a trivial change to get us to:

Is there still an incoming change into this PR?

CyrusNajmabadi · 2024-04-08T15:15:11Z

@ToddGrun I'm doing it in follow ups

sharwell · 2024-04-08T15:16:26Z

Are the improvements due to actual concurrency happening now, or are they simply due to eliminating lock overhead? If the improvements are concurrency, what is the parallelized entry point?

Would a batching work queue make any difference here?

ToddGrun · 2024-04-08T16:08:35Z

I don't understand what this comment means wrt calls for a single checksum

The 'object' line is for calls that are asking for heterogenous data.

CyrusNajmabadi · 2024-04-08T16:29:58Z

I don't understand what this comment means wrt calls for a single checksum

Responding offline.

CyrusNajmabadi · 2024-04-08T16:30:29Z

Are the improvements due to actual concurrency happening now,

Actual concurrency. And tehre is ample opportunity for improving that even more. We can be heavily concurrent here. I just have to measure and make sure it's worthwhile.

sharwell · 2024-04-08T16:47:23Z

FWIW, historically I've found the greatest gains in cross-process communication to come from sequencing calls such that outgoing call 2 waits for outgoing message for call 1 to finish, but does not wait for the response to call 1 to be received. I've seen it's common for implementations to wait for an entire round trip to complete before sending the next outgoing message, which degrades total throughput by much more than expected.

CyrusNajmabadi · 2024-04-08T16:51:59Z

Sure. we can continue looking further at this. I don't see why we can't make of this much more concurrent anyways. Regardless, this is a pure win, so i'd like to take thsi, while making more changes in followups.

sharwell · 2024-04-08T16:54:28Z

I have no objections to the conceptual change here, but didn't review the change in detail w.r.t. allowing concurrency in the operations.

CyrusNajmabadi · 2024-04-08T16:57:34Z

but didn't review the change in detail w.r.t. allowing concurrency in the operations.

OOP syncing is already concurrent (most of hte time). we just had a lock around some batch operations. This then slows down a lot of normal IDE work. For example, while we're doing the 'full bg solution sync', we're updating OOP one project at a time for it. During this time, if a call comes in to do some feature on the current project, and we decide to fully sync that project, it's gated on that same serialization lock that the full-bg-solution-sync is using.

ToddGrun · 2024-04-08T20:27:28Z

This change LGTM, but I'm not sure if Sam wants to dig in deeper.

ToddGrun

CyrusNajmabadi · 2024-04-08T20:32:42Z

I'm going to move forward. Def happy to get more feedback (esp. as more changes come here).

CyrusNajmabadi · 2024-04-24T21:15:06Z

@jasonmalinowski For review when you get back.

CyrusNajmabadi added 7 commits April 7, 2024 17:50

Remove unused method

1a83b8e

remove synchronization lock

1f7e3f1

revert

92bf6a3

Simplify

f73ccfa

inline method

d9fec10

remove type

e7f8f49

lint

0c64233

CyrusNajmabadi requested a review from a team as a code owner April 8, 2024 01:55

ghost added Area-IDE untriaged Issues and PRs which have not yet been triaged by a lead labels Apr 8, 2024

CyrusNajmabadi requested a review from ToddGrun April 8, 2024 02:15

ToddGrun approved these changes Apr 8, 2024

View reviewed changes

CyrusNajmabadi merged commit 86d28ec into dotnet:main Apr 8, 2024

CyrusNajmabadi deleted the removeLock branch April 8, 2024 20:32

dotnet-policy-service bot added this to the Next milestone Apr 8, 2024

dotnet-bot mentioned this pull request Apr 13, 2024

[Automated] PRs inserted in VS build main-34812.241 #73011

Closed

dotnet-bot mentioned this pull request Apr 16, 2024

[Automated] PRs inserted in VS build feature.debugger.main-34815.163 #73037

Closed

dibarbet modified the milestones: Next, 17.11 P1 Apr 29, 2024

Conversation

CyrusNajmabadi commented Apr 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CyrusNajmabadi commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ToddGrun commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 8, 2024

Uh oh!

sharwell commented Apr 8, 2024

Uh oh!

ToddGrun commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 8, 2024

Uh oh!

sharwell commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 8, 2024

Uh oh!

sharwell commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ToddGrun commented Apr 8, 2024

Uh oh!

ToddGrun left a comment

Choose a reason for hiding this comment

Uh oh!

CyrusNajmabadi commented Apr 8, 2024

Uh oh!

CyrusNajmabadi commented Apr 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CyrusNajmabadi commented Apr 8, 2024 •

edited

Loading

CyrusNajmabadi commented Apr 8, 2024 •

edited

Loading

CyrusNajmabadi commented Apr 8, 2024 •

edited

Loading