Skip to content

chore: add host tag to persistence metrics#7530

Merged
fimanishi merged 3 commits intocadence-workflow:masterfrom
fimanishi:add-host-to-metered-metrics
Dec 17, 2025
Merged

chore: add host tag to persistence metrics#7530
fimanishi merged 3 commits intocadence-workflow:masterfrom
fimanishi:add-host-to-metered-metrics

Conversation

@fimanishi
Copy link
Member

@fimanishi fimanishi commented Dec 11, 2025

What changed?
Added host and datastore tag to persistence metrics. Add quota metric with host and datastore tags.

Why?
Provide more dimensions to the persistence metrics to help debug

How did you test it?
Unit tests

Potential risks

Release notes

Documentation Changes

Signed-off-by: fimanishi <fimanishi@gmail.com>
Signed-off-by: fimanishi <fimanishi@gmail.com>
Signed-off-by: fimanishi <fimanishi@gmail.com>
@fimanishi fimanishi merged commit 0e62c5a into cadence-workflow:master Dec 17, 2025
42 checks passed
arzonus added a commit to arzonus/cadence that referenced this pull request Dec 17, 2025
eleonoradgr added a commit to eleonoradgr/cadence that referenced this pull request Dec 17, 2025
eleonoradgr added a commit to eleonoradgr/cadence that referenced this pull request Dec 17, 2025
arzonus added a commit to arzonus/cadence that referenced this pull request Dec 18, 2025
arzonus added a commit to arzonus/cadence that referenced this pull request Dec 18, 2025
fimanishi added a commit that referenced this pull request Feb 4, 2026
<!-- 1-2 line summary of WHAT changed technically:
- Always link the relevant projects GitHub issue, unless it is a minor
bugfix
- Good: "Modified FailoverDomain mapper to allow null ActiveClusterName
#320"
- Bad: "added nil check" -->
**What changed?**
Reverted the addition of host tag to persistence calls.


<!-- Your goal is to provide all the required context for a future
maintainer
to understand the reasons for making this change (see
https://cbea.ms/git-commit/#why-not-how).
How did this work previously (and what was wrong with it)? What has
changed, and why did you solve it
this way?
- Good: "Active-active domains have independent cluster attributes per
region. Previously,
modifying cluster attributes required spedifying the default
ActiveClusterName which
updates the global domain default. This prevents operators from updating
regional
configurations without affecting the primary cluster designation. This
change allows
  attribute updates to be independent of active cluster selection."
- Bad: "Improves domain handling" -->
**Why?**
The addition caused metrics to have a very high cardinality. The metric
is not really necessary and was added to add more visibility to
persistence requests.


<!-- Include specific test commands and setup. Please include the exact
commands such that
another maintainer or contributor can reproduce the test steps taken. 
- e.g Unit test commands with exact invocation
`go test -v ./common/types/mapper/proto -run TestFailoverDomainRequest`
- For integration tests include setup steps and test commands
Example: "Started local server with `./cadence start`, then ran `make
test_e2e`"
- For local simulation testing include setup steps for the server and
how you ran the tests
- Good: Full commands that reviewers can copy-paste to verify
- Bad: "Tested locally" or "Added tests" -->
**How did you test it?**
Updated all the unit tests related to the metric added.


<!-- If there are risks that the release engineer should know about
document them here.
For example:
- Has an API/IDL been modified? Is it backwards/forwards compatible? If
not, what are the repecussions?
- Has a schema change been introduced? Is it possible to roll back?
- Has a feature flag been re-used for a new purpose? 
- Is there a potential performance concern? Is the change modifying core
task processing logic?
- If truly N/A, you can mark it as such -->
**Potential risks**
This actually address the risk of metrics with very high cardinality


<!-- If this PR completes a user facing feature or changes functionality
add release notes here.
Your release notes should allow a user and the release engineer to
understand the changes with little context.
Always ensure that the description contains a link to the relevant
GitHub issue. -->
**Release notes**
This revert addresses the issue of high metric cardinality introduced by
#7530

<!-- Consider whether this change requires documentation updates in the
Cadence-Docs repo
- If yes: mention what needs updating (or link to docs PR in
cadence-docs repo)
- If in doubt, add a note about potential doc needs
- Only mark N/A if you're certain no docs are affected -->
**Documentation Changes**


---

## Reviewer Validation

**PR Description Quality** (check these before reviewing code):

- [ ] **"What changed"** provides a clear 1-2 line summary
  - [ ] Project Issue is linked
- [ ] **"Why"** explains the full motivation with sufficient context
- [ ] **Testing is documented:**
- [ ] Unit test commands are included (with exact `go test` invocation)
- [ ] Integration test setup/commands included (if integration tests
were run)
  - [ ] Canary testing details included (if canary was mentioned)
- [ ] **Potential risks** section is thoughtfully filled out (or
legitimately N/A)
- [ ] **Release notes** included if this completes a user-facing feature
- [ ] **Documentation** needs are addressed (or noted if uncertain)

Signed-off-by: fimanishi <fimanishi@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants