fix(pipelines): ensure assets with same hash but different destinations are published separately #34790

mrlikl · 2025-06-23T12:06:02Z

Issue

Reason for this change

Assets are missing to be published in CDK pipelines when stacks with different synthesizers are used for the same account and region. When assets have identical content hashes but need to be published to different destinations (different publishing role ARNs), they were being incorrectly grouped together, causing assets to only be published to one destination instead of all required destinations.

Description of changes

• Modified publishAsset() method in packages/aws-cdk-lib/pipelines/lib/helpers-internal/pipeline-graph.ts
• Changed asset tracking key from using only stackAsset.assetId to a composite key:
${stackAsset.assetId}:${stackAsset.assetPublishingRoleArn || 'default'}
• This ensures assets with the same content hash, but different destinations are treated as separate publishing jobs

Describe any new or updated permissions being added

NA

Description of how you validated changes

Checked with the code in #31070 and made sure there are 2 asset stages, locally ran the asset commands and verified that they are being deployed to right buckets:

muralikl@b0be83688a18 cdk.out % cdk-assets --path "assembly-pipeline-asset-stack-Staging/pipelineassetstackStagingdevlambdastackEC748226.assets.json" --verbose publish "a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e:current_account-us-east-1"                   
verbose: Loaded manifest from assembly-pipeline-asset-stack-Staging/pipelineassetstackStagingdevlambdastackEC748226.assets.json: 2 assets found
verbose: Applied selection: 1 assets selected.
info   : [0%] start: Publishing LambdaFN/Code (current_account-us-east-1)
verbose: [0%] check: Check s3://cdk-dev-assets-123456789012-us-east-1/a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e.zip
verbose: [0%] build: Zip /Users/muralikl/Downloads/aws-cdk/packages/@aws-cdk-testing/framework-integ/cdk.out/asset.a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e -> assembly-pipeline-asset-stack-Staging/.cache/a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e.zip
verbose: [0%] upload: Upload s3://cdk-dev-assets-123456789012-us-east-1/a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e.zip
info   : [100%] success: Published LambdaFN/Code (current_account-us-east-1)

muralikl@b0be83688a18 cdk.out % cdk-assets --path "assembly-pipeline-asset-stack-Production/pipelineassetstackProductionprdlambdastack4E5ABBC0.assets.json" --verbose publish "a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e:current_account-us-west-2"
verbose: Loaded manifest from assembly-pipeline-asset-stack-Production/pipelineassetstackProductionprdlambdastack4E5ABBC0.assets.json: 2 assets found
verbose: Applied selection: 1 assets selected.
info   : [0%] start: Publishing LambdaFN/Code (current_account-us-west-2)
verbose: [0%] check: Check s3://cdk-hnb659fds-assets-123456789012-us-west-2/a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e.zip
verbose: [0%] build: Zip /Users/muralikl/Downloads/aws-cdk/packages/@aws-cdk-testing/framework-integ/cdk.out/asset.a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e -> assembly-pipeline-asset-stack-Production/.cache/a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e.zip
verbose: [0%] upload: Upload s3://cdk-hnb659fds-assets-123456789012-us-west-2/a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e.zip
info   : [100%] success: Published LambdaFN/Code (current_account-us-west-2)

Checklist

My code adheres to the CONTRIBUTING GUIDE and DESIGN GUIDELINES

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license

rix0rrr · 2025-06-30T13:57:13Z

packages/aws-cdk-lib/pipelines/test/codepipeline/codepipeline.test.ts

@@ -119,7 +119,7 @@ test('Policy sizes do not exceed the maximum size', () => {

  // WHEN
  const regions = ['us-east-1', 'us-east-2', 'eu-west-1', 'eu-west-2', 'somethingelse1', 'somethingelse-2', 'yapregion', 'more-region'];
-  for (let i = 0; i < 70; i++) {
+  for (let i = 0; i < 60; i++) {


I was facing a validation error when testing this after the change. The resources in the stack exceeded 500 limit

ValidationError: Number of resources in stack 'PipelineStack': 515 is greater than allowed maximum of 500: AWS::KMS::Key (1), AWS::KMS::Alias (1), AWS::S3::Bucket (1), AWS::S3::BucketPolicy (1), AWS::IAM::Role (145), AWS::IAM::Policy (145), AWS::IAM::ManagedPolicy (7), AWS::CodePipeline::Pipeline (1), AWS::CodePipeline::Webhook (1), AWS::CodeBuild::Project (212)

rix0rrr · 2025-06-30T14:01:19Z

I'm struggling to understand the problem and the proposed solution.

The linked bug report says:

When a CDK pipeline with 2 stages targeting the same account and region but with different synthesizers, the assets are packaged only once.

What does "packaging" mean in this context?

The PR body says:

When assets have identical content hashes but need to be published to different destinations (different publishing role ARNs), they were being incorrectly grouped together

This means both are being published in the same CodeBuild project?

causing assets to only be published to one destination instead of all required destinations.

I'm not sure I follow why it would only be published once?

Can you go one level deeper and explain what you've diagnosed that is happening?

mrlikl · 2025-07-01T11:29:22Z

@rix0rrr thank you for taking a look:

What does "packaging" mean in this context?

I meant packaging for the asset publishing step in the pipeline. The pipeline creates CodeBuild projects that run cdk-assets commands to upload assets to S3 buckets.

The current behaviour is - consider a pipeline with 2 stages, the only difference in the 2 stages is the qualifier. One uses dev the other uses default hnb659fds. The pipeline needs to publish assets to 2 buckets - cdk-dev-assets-123456789012-region and cdk-hnb659fds-assets-123456789012-region. However, since both the stages have the same Lambda code (same asset hash), the pipeline graph creates only ONE asset publishing node. So the asset gets published to only the first destination and deployment fails with "no such key" in the second stage.

So the change creates a composite key (asset id with publishing tole arn) as publishing role is different for above scenario cdk-dev-file-publishing-role-account-us-east-2. In the end, separate code build jobs are created for the same asset but having different destinations.

rix0rrr · 2025-07-01T12:30:09Z

packages/aws-cdk-lib/pipelines/lib/helpers-internal/pipeline-graph.ts

-    let assetNode = this.assetNodes.get(stackAsset.assetId);
+    // Create a unique key that includes both asset ID and destination information
+    // This ensures assets with the same hash but different destinations are published separately
+    const assetKey = `${stackAsset.assetId}:${stackAsset.assetPublishingRoleArn || 'default'}`;


If I understand correctly, the problem is that two assets in two different manifests have fully the same identifiers, a la 35b88d91656eb5908:111111-us-east-1, but they have different properties. In your case, they have different role ARNs to publish under.

This specific fix would work if the only difference between two assets was the role ARN. But it ignores other things that could be different between them. For example, the destination bucketName could be different. Are you also going to mix the bucket name into this identifier? Or the objectKey? You should mix in the objectKey as well. But that is only for file assets, don't forget to consider container image assets.

I think a simpler and more scalable approach would probably be to make sure the destination identifier of an asset depends on the destination's properties, not just their account and region. For example, by hashing the destination object. In that case, if the role between 2 otherwise identical assets is different, the identifier will be unique and all subsequent code will just automatically pick that up.

In this example, the asset identifiers would be something like

35b88d91656eb5908:111111-us-east-1-2b6edb 35b88d91656eb5908:111111-us-east-1-84a92f ^^^ purposely keeping the hash part here short to avoid overwhelming

You would do that here:

https://github.com/aws/aws-cdk/blob/main/packages/aws-cdk-lib/core/lib/stack-synthesizers/asset-manifest-builder.ts/#L106

https://github.com/aws/aws-cdk/blob/main/packages/aws-cdk-lib/core/lib/stack-synthesizers/asset-manifest-builder.ts/#L129

thank you, ill test this out and revert

just to be clear, we would still expect there to be one asset per asset, just that the destination list would contain both targets (both synthesizer buckets). As suck I believe, you would need to make sure the value in the assetSelector has the hash mentioned above.

yes with the change suggested by @rix0rrr, this is the behavior now. For the same asset, I see one fileasset stage created but multiple destinations:

{ "version": "0.2", "phases": { "install": { "commands": [ "npm install -g cdk-assets@latest" ] }, "build": { "commands": [ "cdk-assets --path "assembly-pipeline-asset-stack-Staging/pipelineassetstackStagingdevlambdastackEC748226.assets.json" --verbose publish "a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e:current_account-us-east-1-2519ad1f"", "cdk-assets --path "assembly-pipeline-asset-stack-Production/pipelineassetstackProductionprdlambdastack4E5ABBC0.assets.json" --verbose publish "a26bd817a0dac44954b5caf83f5880a96f831e43b56157224e073b49f236eb4e:current_account-us-east-1-0b44228e"" ] } } }

for

export class LambdaStack extends Stack { constructor(scope: Construct, id: string, props?: StackProps) { super(scope, id, props); new lambda.Function(this, 'LambdaFN', { runtime: lambda.Runtime.PYTHON_3_10, handler: 'index.handler', code: props?.env?.account == 'xxxxxxxxxxx' ? lambda.Code.fromAsset(path.join(__dirname, 'testhelpers', 'assets', 'test-docker-asset')) : lambda.Code.fromAsset(path.join(__dirname, 'testhelpers', 'assets')), }); } }

there are 2 PipelineAssetsFileAsset codebuild projects

I think if we tweak the destination ID to not accidentally be the same as a logically different destination in the same account and region (by means of appending a hash), no other changes will be necessary. All the rest will sort itself out automatically.

It's just that right now we are deduplicating based on false information.

I am testing by appending a hash created using the stack-name alongside and it is working as expectd:

private manifestEnvName(stack: Stack): string { const account = resolvedOr(stack.account, 'current_account'); const region = resolvedOr(stack.region, 'current_region'); const destinationProps = { account, region, stackName: stack.stackName, }; const destinationHash = crypto.createHash('sha256') .update(JSON.stringify(destinationProps)) .digest('hex') .slice(0, 8); return `${account}-${region}-${destinationHash}`; } }

Cool! If don't mix this into manifestEnvName but do the hash calculation based on destinationProps in the place where manifestEnvName is called, I think we're done.

This reverts commit 3bc0df5.

This reverts commit 4830f8a.

This reverts commit 27521f7.

This reverts commit 2d34bee.

Pull request has been modified.

rix0rrr · 2025-07-04T11:18:44Z

packages/aws-cdk-lib/core/lib/stack-synthesizers/asset-manifest-builder.ts

+    const destinationProps = {
+      account,
+      region,
+      stackName: stack.stackName,


Mixing in the stackName will make it so that assets that are duplicated between stacks must be uploaded twice. That is a stronger guarantee than we need. Try the actual destination's properties instead.

aws-cdk-automation · 2025-07-04T11:28:21Z

AWS CodeBuild CI Report

CodeBuild project: AutoBuildv2Project1C6BFA3F-wQm2hXv2jqQv
Commit ID: adac7d0
Result: SUCCEEDED
Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

mrlikl added 2 commits June 23, 2025 17:21

publish same assets to different destinations separately

2d34bee

integ tests

27521f7

aws-cdk-automation requested a review from a team June 23, 2025 12:06

github-actions bot added bug This issue is a bug. effort/medium Medium work item – several days of effort p1 valued-contributor [Pilot] contributed between 6-12 PRs to the CDK labels Jun 23, 2025

mrlikl added 2 commits June 23, 2025 18:42

lint

4830f8a

lint

3bc0df5

aws-cdk-automation added the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Jun 23, 2025

matboros self-assigned this Jun 27, 2025

rix0rrr reviewed Jun 30, 2025

View reviewed changes

rix0rrr self-assigned this Jun 30, 2025

github-actions bot mentioned this pull request Jul 1, 2025

Monthly PR metrics report #34865

Open

rix0rrr previously requested changes Jul 1, 2025

View reviewed changes

aws-cdk-automation removed the pr/needs-maintainer-review This PR needs a review from a Core Team Member label Jul 1, 2025

mrlikl added 4 commits July 1, 2025 19:03

Revert "lint"

b7c03d7

This reverts commit 3bc0df5.

Revert "lint"

a25b9c7

This reverts commit 4830f8a.

Revert "integ tests"

db133a4

This reverts commit 27521f7.

Revert "publish same assets to different destinations separately"

92452e0

This reverts commit 2d34bee.

matboros removed their assignment Jul 3, 2025

mrlikl added 2 commits July 4, 2025 00:17

include dest for asset hashing

3d4fc7c

include dest for asset hashing

51369ea

mrlikl requested a review from a team as a code owner July 3, 2025 18:48

mrlikl requested a review from rix0rrr July 3, 2025 18:49

mrlikl added 2 commits July 4, 2025 14:48

update failing unit tests

7a0e2f2

update failing unit tests

adac7d0

rix0rrr requested changes Jul 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(pipelines): ensure assets with same hash but different destinations are published separately #34790

fix(pipelines): ensure assets with same hash but different destinations are published separately #34790

mrlikl commented Jun 23, 2025

Uh oh!

rix0rrr Jun 30, 2025

Uh oh!

mrlikl Jul 1, 2025

Uh oh!

rix0rrr commented Jun 30, 2025

Uh oh!

mrlikl commented Jul 1, 2025 •

edited

Loading

Uh oh!

rix0rrr Jul 1, 2025 •

edited

Loading

Uh oh!

rix0rrr Jul 1, 2025

Uh oh!

mrlikl Jul 1, 2025

Uh oh!

matboros Jul 1, 2025

Uh oh!

mrlikl Jul 1, 2025

Uh oh!

rix0rrr Jul 2, 2025

Uh oh!

mrlikl Jul 3, 2025

Uh oh!

rix0rrr Jul 3, 2025

Uh oh!

rix0rrr Jul 4, 2025

Uh oh!

aws-cdk-automation commented Jul 4, 2025

Uh oh!

Uh oh!

fix(pipelines): ensure assets with same hash but different destinations are published separately #34790

Are you sure you want to change the base?

fix(pipelines): ensure assets with same hash but different destinations are published separately #34790

Conversation

mrlikl commented Jun 23, 2025

Issue

Reason for this change

Description of changes

Describe any new or updated permissions being added

Description of how you validated changes

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rix0rrr commented Jun 30, 2025

Uh oh!

mrlikl commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rix0rrr Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aws-cdk-automation commented Jul 4, 2025

AWS CodeBuild CI Report

Uh oh!

Uh oh!

mrlikl commented Jul 1, 2025 •

edited

Loading

rix0rrr Jul 1, 2025 •

edited

Loading