change file save buffer size and use external buffer to avoid neighbo… by SkyInTheSea · Pull Request #944 · microsoft/DiskANN

SkyInTheSea · 2026-04-13T23:02:27Z

…r buffer allocation for each vertex when saving

Does this PR have a descriptive title that could go in our release notes?
Does this PR add any new dependencies?
Does this PR modify any existing APIs?
Is the change to the API backwards compatible?
Should this result in any changes to our documentation, either updating existing docs or adding new ones?

Reference Issues/PRs

What does this implement/fix? Briefly explain your changes.

Any other comments?

…r buffer allocation for each vertex when saving

Copilot

Pull request overview

Improves DiskANN provider serialization performance by increasing buffered write capacity and enabling adjacency list serialization without per-vertex allocations.

Changes:

Introduces a 16 MB BufWriter capacity for .bin data and graph save paths.
Extends GetAdjacencyList with get_adjacency_list_into to support caller-provided reusable buffers (with optimized impls for async providers).
Updates save_graph to reuse a single neighbor buffer across vertices and perform bulk writes.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
`diskann-providers/src/storage/bin.rs`	Adds save buffer sizing, extends adjacency-list trait with buffer-reuse API, and updates graph/data save routines to use larger buffered writes and reusable neighbor buffers.
`diskann-providers/src/model/graph/provider/async_/simple_neighbor_provider.rs`	Implements `get_adjacency_list_into` for `SimpleNeighborProviderAsync` and its `DiskAdaptor` to avoid intermediate `AdjacencyList` allocations during graph saves.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-13T23:10:05Z

        out.write_all(&num_neighbors.to_le_bytes())?;

-        // Write all the neighbors, applying transformation if provided.
-        neighbors
-            .iter()
-            .copied()
-            .try_for_each(|n| out.write_all(&n.to_le_bytes()))?;
+        // Bulk write using bytemuck for zero-copy conversion
+        let neighbor_bytes: &[u8] = bytemuck::must_cast_slice(&neighbor_buffer);
+        out.write_all(neighbor_bytes)?;


save_graph writes neighbor IDs by casting &[u32] to bytes and writing them directly. This relies on the host being little-endian, but the file format docs (and load_graph, which uses ReadBytesExt::<LittleEndian>) require the neighbor IDs be encoded as little-endian u32. On big-endian targets this will silently produce corrupt graph files. Consider bulk-encoding to little-endian (e.g., via a reusable scratch [u8] and byteorder little-endian writes, or by converting u32 values to LE before writing) while still keeping the per-vertex allocation avoidance.

codecov-commenter · 2026-04-13T23:18:43Z

Codecov Report

❌ Patch coverage is 78.94737% with 8 lines in your changes missing coverage. Please review.
✅ Project coverage is 89.29%. Comparing base (eb606a8) to head (dd99465).
⚠️ Report is 8 commits behind head on main.

Files with missing lines	Patch %	Lines
diskann-providers/src/storage/bin.rs	57.89%	8 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #944      +/-   ##
==========================================
- Coverage   89.41%   89.29%   -0.13%     
==========================================
  Files         448      447       -1     
  Lines       84856    83918     -938     
==========================================
- Hits        75878    74937     -941     
- Misses       8978     8981       +3

Flag	Coverage Δ
miri	`89.29% <78.94%> (-0.13%)`	⬇️
unittests	`89.13% <78.94%> (-0.13%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
.../graph/provider/async_/simple_neighbor_provider.rs	`88.55% <100.00%> (-7.30%)`	⬇️
diskann-providers/src/storage/bin.rs	`86.56% <57.89%> (-5.50%)`	⬇️

... and 23 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

hildebrandmw

A couple quick comments on how we can do slightly better without introducing a new method. Longer term, though, this API was always kind of intended to be a stop gap until a more disciplined approach to the inmem providers and the serialization is worked out.

hildebrandmw · 2026-04-13T23:28:01Z

+    /// before being populated with the adjacency list data.
+    ///
+    /// Default implementation falls back to `get_adjacency_list` and copies.
+    fn get_adjacency_list_into(&self, i: usize, buffer: &mut Vec<Self::Element>) -> ANNResult<()>


You can make this even more efficient by not adding get_adjacency_list_into and instead changing the Item to something like this:

struct Guarded<'a, T> { guard: RwLockReadGuard<'a, ()>, slice: &'a [T], } impl<T> Deref for Guarded<'_, T> { type Target = [T]; fn deref(&self) -> &[T] { self.slice } }

This bypasses the copy into the temporary buffer and lets you copy directly from the underlying provider into the destination.

This doesn't work for the DiskAdaptor, but presumably you aren't using that? Another change would be to change get_adjacency_list to accept by &mut self and that way DiskAdaptor can allocate a local buffer itself.

I don't catch your point here. The optimization is to avoid memory allocation per node to flatten its neighbor list to index file, with a global fixed size buffer we can save allocation time, if we don't want to use a new interface to pass the reference of the global buffer address, we may allocate it beforehand and make it referenceable for get_adjacency_list

Right. With your API - you are first copying the data into the externally allocated buffer, then copying it out of that external buffer into the final stream. With the change I showed use using Item<'a> = Guard<'a, u32>, you can avoid every copying into the allocated buffer and instead use the Deref implementation of Guard to copy straight from the underlying store. This does not allocate and removes the extra copy step.

I think I know your meaning now, you want to expose the vertex's neighbor array to final bufwriter to copy, It will save one memcpy operation, but the copy to bufwriter in protection of read lock, which bufwriter is blackbox we don't know when the flush will happen(like if possible part of the copy fill the buffer and trigger flush) and write to disk which is a blocking process and may cause readlock hold for long time compared with a single memcpy, and block other operations like update of the node.

memcopy to a small external buffer is very cheap so generally I prefer to copy it which we can ensure hold readlock for short time.

I sympathize with the desire to minimize the duration under which a lock is held, but this leads me to a bigger concern: why do we need to minimize the duration the lock is held in this API? The only reason I can think of is because doing so would prevent other concurrent writes to the graph, but if writes are happening concurrently to a save using this API, isn't the save going to be incorrect/not a faithful snapshot? If writes are not happening concurrently, then it doesn't matter if we hold onto the lock?

yes, we need consider the write traffic has possibly to update the vertex's neighbors and be blocked by the stream write which fetch read lock.
it's an real issue for your concern, like if your save take 5 minutes(in our prod case for large diskANN index), the copy of the neighbors dump to disk is actually not a completed graph strictly speaking, but an ongoing update one, hard to evaluate the impact.
That means single ANN object can not handle this save during write case but need to resolved by up layer.
Like our ANN service manages multiple indices to avoid such snapshot issue, which host more than one index and the search result is aggregation of them, one read/write index and all others are readonly, write will be applied on read/write index.
when snapshot or save command reached, current read/write index will be sealed as readonly atomically and a new index is created to serve write, and all delete will be applied to not only the new rw index but also a delete list to play on ro index, there's background thread kept merge ro indices and the deletion list.

metajack · 2026-04-14T14:08:22Z

Please do not submit PRs with no description and a blank template. Write up an appropriate description of what this PR does and why.

change file save buffer size and use external buffer to avoid neighbo…

dd99465

…r buffer allocation for each vertex when saving

SkyInTheSea requested review from a team and Copilot April 13, 2026 23:02

Copilot started reviewing on behalf of SkyInTheSea April 13, 2026 23:07 View session

Copilot AI reviewed Apr 13, 2026

View reviewed changes

hildebrandmw reviewed Apr 13, 2026

View reviewed changes

harsha-simhadri reviewed Apr 16, 2026

View reviewed changes

Comment thread diskann-providers/src/model/graph/provider/async_/simple_neighbor_provider.rs

fix comments

9130420

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change file save buffer size and use external buffer to avoid neighbo…#944

change file save buffer size and use external buffer to avoid neighbo…#944
SkyInTheSea wants to merge 2 commits intomainfrom
shizhang/getneighborbyexternalbuffer

SkyInTheSea commented Apr 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 13, 2026

Uh oh!

codecov-commenter commented Apr 13, 2026 •

edited

Loading

Uh oh!

hildebrandmw left a comment

Uh oh!

hildebrandmw Apr 13, 2026

Uh oh!

SkyInTheSea Apr 14, 2026

Uh oh!

hildebrandmw Apr 14, 2026

Uh oh!

SkyInTheSea Apr 14, 2026

Uh oh!

hildebrandmw Apr 14, 2026

Uh oh!

SkyInTheSea Apr 15, 2026

Uh oh!

Uh oh!

metajack commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

SkyInTheSea commented Apr 13, 2026

Reference Issues/PRs

What does this implement/fix? Briefly explain your changes.

Any other comments?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hildebrandmw left a comment

Choose a reason for hiding this comment

Uh oh!

hildebrandmw Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

SkyInTheSea Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

hildebrandmw Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

SkyInTheSea Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

hildebrandmw Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

SkyInTheSea Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

metajack commented Apr 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

codecov-commenter commented Apr 13, 2026 •

edited

Loading