[Searchable Snapshot] Design file caching mechanism for block based files

Currently searchable snapshots download Lucene files using a chunking approach to only download the data that is needed to service a query. It should use a node-level LRU cache that will use up to a configurable amount of local disk space to avoid re-downloading the same parts of frequently-accessed files. All shards on the node should share the same logical cache, meaning that if one shard is queried exclusively then it should use up to the entire cache space configured for the node.

Open questions:

* How is the cache size configured for a node? Is there a reasonable default if no configuration is provided?
* Where on disk should the data be cached? i.e. inside the same directory structure as the rest of the index data? Or are there use cases where the cache would want to be a dedicated disk or mount that would require a separate top-level directory?
* How should a node report cache statistics and/or utilization?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Searchable Snapshot] Design file caching mechanism for block based files #4964

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Searchable Snapshot] Design file caching mechanism for block based files #4964

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions