[Remote Store] Update index settings once all shard copies of an index moves to remote enabled nodes by shourya035 · Pull Request #13253 · opensearch-project/OpenSearch

shourya035 · 2024-04-17T06:06:25Z

Description

Added logic within the ShardStartedClusterStateTaskExecutor to apply remote based index settings and add remote path based index metadata once all shard copies of an index moves over to remote store enabled nodes. Currently this logic would only execute when the cluster is in mixed mode and there are remote enabled nodes present in the cluster.

The execute logic of this cluster state executor fetches the index names whose shards has been marked as STARTED, references the current RoutingTable from the incoming ClusterState to figure out if all shards of the index are in STARTED state and all those shard copies are in remote store enabled nodes. If so, it mutates the incoming metadata by adding the remote store based settings, which is then persisted and published to the data nodes from the cluster manager

Related Issues

Resolves: #13252

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Failing checks are inspected and point to the corresponding known issue(s) (See: Troubleshooting Failing Builds)
Commits are signed per the DCO using --signoff
Commit changes are listed out in CHANGELOG.md file (See: Changelog)
Public documentation issue/PR created

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2024-04-17T07:29:36Z

❌ Gradle check result for 498ec5c: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

gbbafna · 2024-04-17T06:57:35Z

server/src/main/java/org/opensearch/cluster/action/shard/ShardStateAction.java

+                // Let allocation service mark the incoming `RELOCATING` shard copies as `STARTED`
                maybeUpdatedState = allocationService.applyStartedShards(currentState, shardRoutingsToBeApplied);
+                // Run remote store migration based tasks
+                if (ongoingDocrepToRemoteMigration(currentState.metadata().settings())) {


can reuse RemoteStoreNodeService#isMigratingToRemoteStore

gbbafna · 2024-04-17T07:01:23Z

server/src/main/java/org/opensearch/cluster/action/shard/ShardStateAction.java

+                && (indexRoutingTable.shardsMatchingPredicateCount(ShardRouting::started) == indexRoutingTable.shardsMatchingPredicateCount(
+                    shardRouting -> discoveryNodes.get(shardRouting.currentNodeId()).isRemoteStoreNode()
+                ));
+            return allStartedShardsOnRemote && IndexMetadata.INDEX_REMOTE_STORE_ENABLED_SETTING.get(indexMetadata.getSettings()) == false;


we can short circuit on IndexMetadata.INDEX_REMOTE_STORE_ENABLED_SETTING.get(indexMetadata.getSettings()) == false at the start of this function .

gbbafna · 2024-04-17T07:25:11Z

server/src/main/java/org/opensearch/cluster/action/shard/ShardStateAction.java

+                && (indexRoutingTable.shardsMatchingPredicateCount(ShardRouting::started) == indexRoutingTable.shardsMatchingPredicateCount(
+                    shardRouting -> discoveryNodes.get(shardRouting.currentNodeId()).isRemoteStoreNode()
+                ));
+            return allStartedShardsOnRemote && IndexMetadata.INDEX_REMOTE_STORE_ENABLED_SETTING.get(indexMetadata.getSettings()) == false;


The check of (indexRoutingTable.shardsMatchingPredicateCount(ShardRouting::started) == indexRoutingTable.shardsMatchingPredicateCount( shardRouting -> discoveryNodes.get(shardRouting.currentNodeId()).isRemoteStoreNode() is not accurate , as it is not account for started shards for checking shards on remote store . We should check just for all started shards that they are on remote node.

We also need to make sure that all the primary shards are assigned as well . There can be case when 1 shard with 0 replica is unassigned , but that copy is present on a docrep node, which is temporarily down.

gbbafna · 2024-04-17T07:29:27Z

server/src/main/java/org/opensearch/index/remote/RemoteStoreUtils.java

+     * @param discoveryNodes Current set of {@link DiscoveryNodes} in the cluster
+     * @return {@link Tuple} with segment repository name as first element and translog repository name as second element
+     */
+    public static Tuple<String, String> getRemoteStoreRepositoryNames(DiscoveryNodes discoveryNodes) {


The function doesn't fit well . Tuple also doesn't look the right datastructure for this . Would prefer two functions for retrieving translog repo and segment repo.

github-actions · 2024-04-17T07:35:27Z

❌ Gradle check result for da6b91a: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2024-04-17T08:08:35Z

❌ Gradle check result for c11bf32: ABORTED

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2024-04-17T09:14:21Z

❌ Gradle check result for 4e15cf9: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2024-04-20T16:42:35Z

❌ Gradle check result for da6b91a: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2024-04-20T17:37:51Z

❌ Gradle check result for 8d9c389: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions bot added enhancement Enhancement or improvement to existing feature or request Storage:Remote labels Apr 17, 2024

shourya035 self-assigned this Apr 17, 2024

shourya035 added the skip-changelog label Apr 17, 2024

gbbafna reviewed Apr 17, 2024

View reviewed changes

shourya035 force-pushed the index-metadata-mutate branch from 4e15cf9 to da6b91a Compare April 20, 2024 15:52

shourya035 closed this Apr 20, 2024

shourya035 force-pushed the index-metadata-mutate branch from da6b91a to 8d9c389 Compare April 20, 2024 15:55

shourya035 mentioned this pull request Apr 20, 2024

[Remote Store] Update index settings on shard movement during remote store migration #13316

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Remote Store] Update index settings once all shard copies of an index moves to remote enabled nodes#13253

[Remote Store] Update index settings once all shard copies of an index moves to remote enabled nodes#13253
shourya035 wants to merge 0 commit intoopensearch-project:mainfrom
shourya035:index-metadata-mutate

shourya035 commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

gbbafna Apr 17, 2024

Uh oh!

gbbafna Apr 17, 2024

Uh oh!

gbbafna Apr 17, 2024

Uh oh!

gbbafna Apr 17, 2024

Uh oh!

gbbafna Apr 17, 2024

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 20, 2024

Uh oh!

github-actions bot commented Apr 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shourya035 commented Apr 17, 2024

Description

Related Issues

Check List

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

gbbafna Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

gbbafna Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

gbbafna Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

gbbafna Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

gbbafna Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 17, 2024

Uh oh!

github-actions bot commented Apr 20, 2024

Uh oh!

github-actions bot commented Apr 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants