fix(custom-resources): waiter state machine retry fails with ExecutionAlreadyExists by newlinedeveloper · Pull Request #35988 · aws/aws-cdk

newlinedeveloper · 2025-11-08T11:14:23Z

Description

Fixes an issue where retrying a CloudFormation deployment that uses a custom resource with an async waiter fails with ExecutionAlreadyExists error.

Root Cause

The custom resource provider framework uses CloudFormation's RequestId as the Step Functions execution name when starting the waiter state machine. When CloudFormation retries a failed deployment, it reuses the same RequestId. Since Step Functions execution names must be unique for 90 days, subsequent retry attempts fail with ExecutionAlreadyExists.

Solution

Removed the name parameter from the startExecution call, allowing Step Functions to auto-generate unique execution names. This is the recommended approach per the AWS Step Functions StartExecution API Reference, where the name parameter is optional and Step Functions will automatically generate a universally unique identifier (UUID) as the execution name if not provided.

Changes

Removed name: resourceEvent.RequestId from the waiter state machine execution call in framework.ts
Updated log statement to remove the name field
Added unit test to verify that name is not included in the startExecution call

Testing

Added unit test waiter state machine execution does not include name field (allows retries) to verify the fix
All existing unit tests pass
Verified that the mock assertion checks for name being undefined

Related Issue

Fixes #35957

Verification

The fix was verified by:

Running unit tests to ensure the name field is not included
Confirming that existing tests continue to pass
The change aligns with AWS Step Functions best practices for execution naming

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license

aws-cdk-automation

(This review is outdated)

newlinedeveloper · 2025-11-08T11:20:43Z

Exemption Request

This fix is in runtime code (Lambda function execution) and does not change CloudFormation templates or infrastructure. The existing integration tests verify infrastructure creation, which is unaffected by this change. Unit tests provide comprehensive coverage of the runtime behavior change.

…e execution

vvigilante · 2025-11-11T09:00:27Z

alternatively we could forward the request id from the lambda. That should never repeat.

packages/aws-cdk-lib/custom-resources/test/provider-framework/runtime.test.ts

Abogical · 2025-11-13T14:00:01Z

I have confirmed that this PR fixes the issue.

✅ Updated pull request passes all PRLinter validations. Dismissing previous PRLinter review.

…ustom-resources-waiter-retry-execution-name

Pull request has been modified.

mrgrain · 2025-11-17T09:27:03Z

Integration test failure are expected due to the changed asset. They are not caused by the new integ-runner engine. You'll need to work with your PR reviewer to update all snapshots. For framework changes like this, I'd typically recommend that a CDK team member is doing this for you.

…ustom-resources-waiter-retry-execution-name

newlinedeveloper · 2025-11-18T07:11:19Z

Hi @Abogical @pahud , Need review and approval for this PR to be closed . Thanks

Abogical

the linter has passed but you'll need to update the integ test snapshots. See https://github.com/aws/aws-cdk/blob/main/INTEGRATION_TESTS.md#running-integration-tests

yarn integ-runner --directory packages/@aws-cdk --update-on-failed

…name

Abogical · 2025-12-10T12:49:40Z

I've updated the snapshots to your PR branch directly.

Abogical · 2025-12-10T14:21:39Z

@newlinedeveloper
There are other snapshots to be uploaded but I don't have access to push LFS files to your fork. Can you merge this PR which will push the changes to your fork? newlinedeveloper#1

update snapshots again

…name

Pull request has been modified.

Abogical

For the record, only 2 snapshots failed to deploy with the new snapshot changes by the Gtihub workflow. Deploying them locally however works:

packages/@aws-cdk-testing/framework-integ/test/aws-codebuild/test/integ.project-fleet.js
packages/@aws-cdk-testing/framework-integ/test/aws-dynamodb/test/integ.global.js

mergify · 2025-12-11T10:57:00Z

Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork).

mergify · 2025-12-11T10:57:06Z

github-actions · 2025-12-11T10:57:14Z

Comments on closed issues and PRs are hard for our team to see.
If you need help, please open a new issue that references this one.

github-actions bot added beginning-contributor [Pilot] contributed between 0-2 PRs to the CDK bug This issue is a bug. effort/medium Medium work item – several days of effort p1 labels Nov 8, 2025

aws-cdk-automation requested a review from a team November 8, 2025 11:14

aws-cdk-automation previously requested changes Nov 8, 2025

View reviewed changes

aws-cdk-automation added the pr-linter/exemption-requested The contributor has requested an exemption to the PR Linter feedback. label Nov 8, 2025

fix(custom-resources): remove name parameter from waiter state machin…

6d329d8

…e execution

newlinedeveloper force-pushed the fix/custom-resources-waiter-retry-execution-name branch from 2a0d935 to 6d329d8 Compare November 8, 2025 13:29

vvigilante approved these changes Nov 11, 2025

View reviewed changes

Abogical self-assigned this Nov 12, 2025

Abogical previously requested changes Nov 13, 2025

View reviewed changes

packages/aws-cdk-lib/custom-resources/test/provider-framework/runtime.test.ts Outdated Show resolved Hide resolved

newlinedeveloper added 2 commits November 15, 2025 12:23

Merge branch 'main' of github.com:newlinedeveloper/aws-cdk into fix/c…

a894ece

…ustom-resources-waiter-retry-execution-name

fixed the trailing space issue

2971d1c

newlinedeveloper requested a review from Abogical November 15, 2025 07:04

This was referenced Nov 15, 2025

CI Pending Approval - 41 PR(s) pahud/aws-cdk#11

Open

Failed CodeBuild CI - 32 PR(s) pahud/aws-cdk#17

Open

Merge branch 'main' of github.com:newlinedeveloper/aws-cdk into fix/c…

f0feb08

…ustom-resources-waiter-retry-execution-name

Abogical previously requested changes Nov 18, 2025

View reviewed changes

Abogical mentioned this pull request Nov 18, 2025

fix(eks): inconsistent fargateProfileName handling causes deletion failure when physicalResourceId exceeds 100 characters #36075

Closed

3 tasks

Merge branch 'main' into fix/custom-resources-waiter-retry-execution-…

e01b3f0

…name

Abogical had a problem deploying to deployment-integ-test December 10, 2025 10:51 — with GitHub Actions Error

update snapshot

44f73cd

Abogical had a problem deploying to deployment-integ-test December 10, 2025 12:48 — with GitHub Actions Error

Merge branch 'main' into fix/custom-resources-waiter-retry-execution-…

3857487

…name

Abogical had a problem deploying to deployment-integ-test December 10, 2025 12:49 — with GitHub Actions Failure

update snapshots again

850851f

Merge pull request #1 from aws/waiter-state

ea88702

update snapshots again

newlinedeveloper had a problem deploying to deployment-integ-test December 10, 2025 14:50 — with GitHub Actions Error

aws-cdk-automation added the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Dec 10, 2025

aws-cdk-automation had a problem deploying to deployment-integ-test December 10, 2025 15:23 — with GitHub Actions Error

Merge branch 'main' into fix/custom-resources-waiter-retry-execution-…

3ccec41

…name

Abogical had a problem deploying to deployment-integ-test December 10, 2025 16:07 — with GitHub Actions Failure

Abogical previously approved these changes Dec 10, 2025

View reviewed changes

aws-cdk-automation removed the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Dec 10, 2025

Merge branch 'main' into fix/custom-resources-waiter-retry-execution-…

551220d

…name

Abogical had a problem deploying to deployment-integ-test December 11, 2025 10:05 — with GitHub Actions Error

aws-cdk-automation added the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Dec 11, 2025

aws-cdk-automation had a problem deploying to deployment-integ-test December 11, 2025 10:35 — with GitHub Actions Failure

Abogical removed the pr/needs-integration-tests-deployment Requires the PR to deploy the integration test snapshots. label Dec 11, 2025

Abogical approved these changes Dec 11, 2025

View reviewed changes

aws-cdk-automation removed the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Dec 11, 2025

mergify bot merged commit 36ea606 into aws:main Dec 11, 2025
40 of 46 checks passed

github-actions bot locked as resolved and limited conversation to collaborators Dec 11, 2025

Conversation

newlinedeveloper commented Nov 8, 2025

Description

Root Cause

Solution

Changes

Testing

Related Issue

Verification

Uh oh!

aws-cdk-automation left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

newlinedeveloper commented Nov 8, 2025

Uh oh!

vvigilante commented Nov 11, 2025

Uh oh!

Uh oh!

Abogical commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrgrain commented Nov 17, 2025

Uh oh!

newlinedeveloper commented Nov 18, 2025

Uh oh!

Abogical left a comment

Choose a reason for hiding this comment

Uh oh!

Abogical commented Dec 10, 2025

Uh oh!

Abogical commented Dec 10, 2025

Uh oh!

Abogical left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Dec 11, 2025

Uh oh!

Uh oh!

mergify bot commented Dec 11, 2025

Merge Queue Status

Uh oh!

github-actions bot commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

aws-cdk-automation left a comment •

edited

Loading

Abogical commented Nov 13, 2025 •

edited

Loading

Abogical left a comment •

edited

Loading