Performance improvements to use RWLock to access LRUQueryCache by boicehuang · Pull Request #13306 · apache/lucene

boicehuang · 2024-04-15T11:57:48Z

Elasticsearch (which based on lucene) can automatically infer types for users with its dynamic mapping feature. When users index some low cardinality fields, such as gender / age / status... they often use some numbers to represent the values, while ES will infer these fields as long, and ES uses BKD as the index of long fields.

Just as #541 said, when the data volume grows, building the result set of low-cardinality fields will make the CPU usage and load very high even if we use a boolean query with filter clauses for low-cardinality fields.

One reason is that it uses a ReentrantLock to limit accessing LRUQueryCache. QPS and costs of their queries are often high, which often causes trying locking failures when obtaining the cache, resulting in low concurrency in accessing the cache.

So I replace the ReentrantLock with a ReentrantReadWriteLock. I only use the read lock when I need to get the cache for a query,

I benchmarked this optimization by mocking some random LongPoint and querying them with one PointInSetQuery with bool filter.

doc count	field cardinality	query terms count	baseline QPS	candidate QPS	diff percentage
30000000	10	1	2481	5102	105.6%
30000000	1000000	1	6396	6596.48	3.1%

I think this change can help filter queries that need to query low-cardinality fields.

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

boicehuang · 2024-04-23T14:43:05Z

This optimization also benefits high-cost querys such as terms query with 10000 terms, by reading cache more frequently instead of searching inverted index

doc count	field cardinality	query terms count	baseline QPS	candidate QPS	diff percentage
30000000	1000000	10000	160	473	191.9%

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

benwtrent · 2024-04-29T16:10:31Z

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

    LeafCache(Object key) {
      this.key = key;
-      cache = new IdentityHashMap<>();
+      cache = Collections.synchronizedMap(new IdentityHashMap<>());


I don't understand this. Aren't all accesses to LeafCache protected by the LRUQueryCache read/write locks?

Since LeafCache isn't a static class, it should have access to the enclosing class's lock.

For testing safety, putIfAbsent, remove, onDocIdSetCache, and onDocIdSetEviction should all do a assert writeLock.isHeldByCurrentThread();

Sorry, I misunderstood here before. Since there is a read lock, concurrent access to IdentityHashMap does not require additional synchronized lock. I optimized the code as per your suggestion.

benwtrent · 2024-05-01T12:35:45Z

@boicehuang what are the new numbers for your benchmarks for the current iteration? Indeed we will be synchronizing more, so I wonder if we will still see improvement.

boicehuang · 2024-05-06T03:47:34Z

@boicehuang what are the new numbers for your benchmarks for the current iteration? Indeed we will be synchronizing more, so I wonder if we will still see improvement.

doc count	field cardinality	query point	baseline QPS	candidate QPS	diff percentage	diff
30000000	10	1	2481	4428	78%	using `LongAdder`. `uniqueQueries` to be `Collections.synchronizedMap`. `cache` to be `IdentityHashMap`.

It still has benefits for high-frequency queries with low cardinality fields.

jpountz

Query caching in Lucene is designed to update the content of the cache infrequently (e.g. because we wait until we've seen a query multiple times before caching it), so it makes sense that switching to a RWLock reduced contention.

The change looks correct, I just left minor comments.

jpountz · 2024-05-07T17:12:52Z

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java

-  private volatile long hitCount;
-  private volatile long missCount;
+  private volatile LongAdder hitCount;
+  private volatile LongAdder missCount;


These fields could be made final and non volative now.

jpountz · 2024-05-07T17:23:26Z

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java


      // If the lock is already busy, prefer using the uncached version than waiting
-      if (lock.tryLock() == false) {
+      if (writeLock.tryLock() == false) {


Why are we using the write lock here? It looks like we're only reading so we could use the read lock?

Yes, read lock can also be used here

boicehuang · 2024-05-08T07:43:20Z

Improvement number of currently submitted code version.

doc count	field cardinality	query point	baseline QPS	candidate QPS	diff percentage	diff
30000000	10	1	2481	4408	78%	using `LongAdder`. `uniqueQueries` to be `Collections.synchronizedMap`. `cache` to be `IdentityHashMap`.

benwtrent

I think this is ready for merging. I can do the merging, but won't back port to 9x until we see nightlies. They might catch something we missed.

@boicehuang could you add a changes entry to 9.11 under optimizations?

boicehuang · 2024-05-10T10:51:32Z

I think this is ready for merging. I can do the merging, but won't back port to 9x until we see nightlies. They might catch something we missed.

@boicehuang could you add a changes entry to 9.11 under optimizations?

Finished

Elasticsearch (which based on lucene) can automatically infer types for users with its dynamic mapping feature. When users index some low cardinality fields, such as gender / age / status... they often use some numbers to represent the values, while ES will infer these fields as long, and ES uses BKD as the index of long fields. Just as #541 said, when the data volume grows, building the result set of low-cardinality fields will make the CPU usage and load very high even if we use a boolean query with filter clauses for low-cardinality fields. One reason is that it uses a ReentrantLock to limit accessing LRUQueryCache. QPS and costs of their queries are often high, which often causes trying locking failures when obtaining the cache, resulting in low concurrency in accessing the cache. So I replace the ReentrantLock with a ReentrantReadWriteLock. I only use the read lock when I need to get the cache for a query,

Use read lock to access cache

e61ddfa

benwtrent reviewed Apr 22, 2024

View reviewed changes

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java Show resolved Hide resolved

fix count

c9d0a0a

boicehuang mentioned this pull request Apr 23, 2024

Suggestion about LRUQueryCache Optimization #13318

Closed

benwtrent reviewed Apr 24, 2024

View reviewed changes

lucene/core/src/java/org/apache/lucene/search/LRUQueryCache.java Show resolved Hide resolved

boicehuang added 5 commits April 25, 2024 17:38

add ConcurrentHashMap for concurrency

57846c4

add synchronizedMap for concurrency

8897ff8

make cache ConcurrentHashMap

37f1977

make LeafCache ConcurrentHashMap

b3f5a82

optimize with synchronizedMap

fa11ecf

benwtrent reviewed Apr 29, 2024

View reviewed changes

boicehuang added 2 commits April 30, 2024 17:39

optimize with synchronizedMap

f9f96af

Merge branch 'apache:main' into readlock

3de83b0

jpountz reviewed May 7, 2024

View reviewed changes

boicehuang added 2 commits May 8, 2024 10:59

fix some format

62f201a

fix unused imports

4604294

boicehuang changed the title ~~Performance improvements to use read lock to access LRUQueryCache~~ Performance improvements to use RWLock to access LRUQueryCache May 8, 2024

benwtrent approved these changes May 9, 2024

View reviewed changes

boicehuang added 4 commits May 10, 2024 11:18

add changes

4a9cdca

fix changes

2a0610d

Merge branch 'apache:main' into readlock

64cf138

fix changes

b3a1fc1

benwtrent merged commit 5ac88c7 into apache:main May 10, 2024

benwtrent mentioned this pull request Jun 2, 2025

Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? #14677

Closed

Conversation

boicehuang commented Apr 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

boicehuang commented Apr 23, 2024

Uh oh!

Uh oh!

benwtrent Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

boicehuang Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benwtrent commented May 1, 2024

Uh oh!

boicehuang commented May 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jpountz left a comment

Choose a reason for hiding this comment

Uh oh!

jpountz May 7, 2024

Choose a reason for hiding this comment

Uh oh!

boicehuang May 8, 2024

Choose a reason for hiding this comment

Uh oh!

jpountz May 7, 2024

Choose a reason for hiding this comment

Uh oh!

boicehuang May 8, 2024

Choose a reason for hiding this comment

Uh oh!

boicehuang commented May 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

boicehuang commented May 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

boicehuang commented Apr 15, 2024 •

edited

Loading

boicehuang Apr 30, 2024 •

edited

Loading

boicehuang commented May 6, 2024 •

edited

Loading

boicehuang commented May 8, 2024 •

edited

Loading