index: avoid unsafe buffer sharing in `Log_file` by craigfe · Pull Request #358 · mirage/index

craigfe · 2021-10-10T11:39:02Z

#355 introduced a small optimisation to re-use a "local" scratch buffer when decoding values from the log file.

This is actually unsafe: during the merge, the asynchronous merge thread and the main writer thread can attempt concurrent reads from the log file, causing contention over the scratch buffer. This can be observed by inserting Thread.yield calls inside the Value.decode implementation and then stress-testing the interface (e.g. by running the replay benchmarks).

This commit ensures that each call to a Log_file function gets its own scratch buffer, ensuring safe concurrent access without introducing potential contention issues.

Note: I initially tried to just wrap t.scratch_buf in a mutex (0a7b05c), but this was a ~5% performance regression in the replay benchmarks due to contention over the lock. The alternative of just allocating on entry to Log_file has a relatively small cost.

samoht · 2021-10-10T13:06:39Z

What's the mutex impact on the performance? Wouldn't it be better to have a scratch buffer by thread?

craigfe · 2021-10-10T13:36:09Z

@samoht: I took some measurements of that :-) Yes, that mutex has a ~5% performance impact on very write-intensive workloads like the replay benchmarks when the log size is large (but I couldn't see any impact for other workloads).

I've changed the implementation to allocate new buffers on entry to Log_file. It might be better to do things per-thread indeed, but we'd need an LRU to avoid leaking memory there. It's not sufficient to just keep separate threads for the writer + merge thread, because readers in different threads can contend for the buffer as well.

mirage#355 introduced a small optimisation to re-use a "local" scratch buffer when decoding values from the log file. This is actually unsafe: during the merge, the asynchronous merge thread and the main writer thread can attempt concurrent reads from the log file, causing contention over the scratch buffer. This can be observed by inserting `Thread.yield` calls inside the `Value.decode` implementation and then stress-testing the interface (e.g. by running the replay benchmarks). This commit ensures that each call to a `Log_file` function gets its own scratch buffer, ensuring safe concurrent access without introducing potential contention issues.

This contains mirage/index#358, which doesn't change the API but is an important bug-fix.

craigfe force-pushed the guard-unsafe-scratch-buffer branch from 6771e45 to 0a7b05c Compare October 10, 2021 11:40

craigfe force-pushed the guard-unsafe-scratch-buffer branch from 0a7b05c to 5154202 Compare October 10, 2021 13:26

craigfe force-pushed the guard-unsafe-scratch-buffer branch from 5154202 to 4e3ba1e Compare October 10, 2021 13:38

craigfe changed the title ~~index: guard scratch buffer in Log_file with a mutex~~ index: avoid unsafe buffer sharing in Log_file Oct 10, 2021

Ngoguey42 approved these changes Oct 11, 2021

View reviewed changes

craigfe merged commit f359eb8 into mirage:main Oct 11, 2021

icristescu mentioned this pull request Oct 11, 2021

Add test for interleaved reads #361

Open

craigfe added a commit to craigfe/irmin that referenced this pull request Oct 12, 2021

irmin-pack: use latest Index

e05c991

This contains mirage/index#358, which doesn't change the API but is an important bug-fix.

craigfe mentioned this pull request Oct 12, 2021

Adapt to mirage/repr#81 (improved binary decoder type) mirage/irmin#1547

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

index: avoid unsafe buffer sharing in `Log_file`#358

index: avoid unsafe buffer sharing in `Log_file`#358
craigfe merged 1 commit intomirage:mainfrom
craigfe:guard-unsafe-scratch-buffer

craigfe commented Oct 10, 2021 •

edited

Loading

Uh oh!

samoht commented Oct 10, 2021

Uh oh!

craigfe commented Oct 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

craigfe commented Oct 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samoht commented Oct 10, 2021

Uh oh!

craigfe commented Oct 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

craigfe commented Oct 10, 2021 •

edited

Loading