[Codegen][Tuner] merge the default td specs #20127

bangtianliu · 2025-02-28T01:01:06Z

This PR adds the implementation of merging all the default td specs (if all inner modules are) in the LinkTuningSpecsPass. It mainly merges multiple transform.foreach_match operations found across different inner modules into a single consolidated operation inside a newly created top-level __kernel_config NamedSequenceOp.

Issue: nod-ai/shark-ai#810

kuhar

To thoroughly test this code, we can start by assuming that tuning specs with default entypoints should always link, and then emitting warning whenever we notice they can't be linked. This is something we can test with verify-diagnostics.

The way we usually do this in compilers is that we try to have two phases:

Analysis that determines transformation legality
Transformation that cannot bail out

compiler/plugins/target/ROCM/test/default_tuning_specs_amdgpu.mlir

compiler/plugins/target/ROCM/test/tuning_spec_mmt_tile_and_fuse.mlir

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

Max191

Leaving initial comments. Coming back to look at the big emitLinkedDefaultTuningSpec later this morning.

compiler/plugins/target/ROCM/test/default_tuning_specs_amdgpu.mlir

compiler/plugins/target/ROCM/test/tuning_spec_mmt_tile_and_fuse.mlir

compiler/plugins/target/ROCM/test/default_tuning_specs_amdgpu.mlir

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

Max191

High level question: Do we need to be checking for the default tuning spec attr before merging? This seems useful for non-default specs too, and it is already matching for an expected format of the nested modules.

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

bangtianliu · 2025-03-05T15:58:40Z

High level question: Do we need to be checking for the default tuning spec attr before merging? This seems useful for non-default specs too, and it is already matching for an expected format of the nested modules.

Yes, I think it could be another todo item if we could use the attribute as a hint to simplify some implementations. CC @kuhar.

kuhar · 2025-03-05T16:05:22Z

@Max191 note that this checks for attributes that say that the tuning spec has a default entry point, not that it's a default spec in the compiler. A default spec is a spec with a default entrypoint that we decide to ship with the compiler.

So to directly answer your question: yes, I think it makes sense to restrict this to a subset of transform scripts and also switch the tuner to use that format.

Max191 · 2025-03-05T16:06:51Z

@Max191 note that this checks for attributes that say that the tuning spec has a default entry point, not that it's a default spec in the compiler. A default spec is a spec with a default entrypoint that we decide to ship with the compiler.

So to directly answer your question: yes, I think it makes sense to restrict this to a subset of transform scripts and also switch the tuner to use that format.

I see, thanks for the clarification!

Max191 · 2025-03-05T16:09:51Z

Actually, followup question: Do we need the extra attribute on the module? We could just check that there is a named_sequence with iree_codegen.tuning_spec_entrypoint inside the module, right?

Seems like it's just an extra hint for the linking pass that isn't really necessary to me. Am I missing something?

kuhar · 2025-03-05T17:09:48Z

We use it to trigger the attribute verifier over the whole module

kuhar · 2025-03-05T17:10:35Z

We could just check that there is a named_sequence with iree_codegen.tuning_spec_entrypoint inside the module, right?

There's more than that: we also verify the entry point function signature (name and types) and expect a single foreach_match op in the body. And that there's exactly one entry point like that.

Max191 · 2025-03-05T23:24:53Z

We use it to trigger the attribute verifier over the whole module

I see, this is what I was missing, thanks!

There's more than that: we also verify the entry point function signature (name and types) and expect a single foreach_match op in the body. And that there's exactly one entry point like that.

Sorry, this is what I had meant (i.e., run the verification of the full module), but I see now that we have this attribute plumbed through in verifyOperationAttribute. It all makes sense to me now!

bangtianliu · 2025-03-06T00:55:55Z

I've just submitted a commit addressing all of Max's comments. I'll follow up with another PR to add verification for iree_codegen.tuning_spec_entrypoint, ensuring that a single foreach_match op is present in the body. Once that verification PR is merged, this PR can be further simplified.

bangtianliu · 2025-03-06T21:26:31Z

I will further simplify this PR after #20173 is landed.

kuhar · 2025-03-06T21:27:16Z

I will further simplify this PR after #20173 is landed.

You can rebase this PR on top of 20173 before it lands. This will make code review easier.

kuhar · 2025-03-19T02:22:05Z

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

+                      llvm::DenseMap<NamedSequenceOp, ForeachMatchOp>
+                          &namedSequenceToForeachMatch) {
+  StringRef specName = op.getSymName();
+  std::string newSpecName = getUniqueSpecName(specName, specNameCounts);


I'd like to avoid setting the suffix in one place just to forget it and try to recover this information in getBaseName and update again in getUniqueSpecName. This is bad design.

I'd rather do one of:

only set it under a flag in emitLinkedTuningSpec and decide it in the code that does merging if it has different naming requirements

add a different name uniqing mode to emitLinkedTuningSpec to do the right thing for merging

update all names as a pre-processing steps that's common across both code paths (linking and merging)

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

kuhar · 2025-03-19T02:26:32Z

Overall looks good, the only remaining issue I see is the way we handle names.

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

…e and add test Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

compiler/src/iree/compiler/Codegen/Common/test/link_tuning_specs.mlir

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

Max191

Overall LGTM, please address the comments before landing, though.

compiler/src/iree/compiler/Codegen/Common/LinkTuningSpecsPass.cpp

Max191 · 2025-03-20T20:17:02Z

(Also wait for Jakub's review before landing)

58E8

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

kuhar

LGTM

kuhar · 2025-03-21T02:18:56Z

Please update the PR description before merging, this PR does not fix the issue:

fixes nod-ai/shark-ai#810

You can replace this with Issue: <url> to link to it but not close after merging

bangtianliu requested a review from hanhanW as a code owner February 28, 2025 01:01

bangtianliu marked this pull request as draft February 28, 2025 01:01

bangtianliu requested review from kuhar and Max191 February 28, 2025 01:01

bangtianliu force-pushed the merge_default_specs branch 4 times, most recently from 5e55989 to 30e7c43 Compare February 28, 2025 17:40

bangtianliu marked this pull request as ready for review February 28, 2025 18:09

bangtianliu force-pushed the merge_default_specs branch from eeaefb0 to 668fad2 Compare March 3, 2025 00:27

kuhar requested changes Mar 3, 2025

View reviewed changes

bangtianliu force-pushed the merge_default_specs branch from 665cf37 to 0004cc1 Compare March 4, 2025 21:14

bangtianliu requested a review from kuhar March 4, 2025 21:21

Max191 reviewed Mar 5, 2025

View reviewed changes

Max191 requested changes Mar 5, 2025

View reviewed changes

kuhar requested a review from andfau-amd March 5, 2025 16:08

bangtianliu force-pushed the merge_default_specs branch from ab6039e to f97aeda Compare March 6, 2025 21:07

bangtianliu force-pushed the merge_default_specs branch from f97aeda to a25cafd Compare March 6, 2025 21:56

bangtianliu requested review from MaheshRavishankar and ScottTodd as code owners March 6, 2025 21:56

bangtianliu requested a review from kuhar March 18, 2025 22:38

bangtianliu force-pushed the merge_default_specs branch from e9351ef to 8c50308 Compare March 18, 2025 23:33

kuhar reviewed Mar 19, 2025

View reviewed changes

bangtianliu added 11 commits March 19, 2025 16:00

[Codegen][Tuner] merge the default td specs

376f97b

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

plumb through MaterializeTuningSpecsPass

b738c6e

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

solve the name confliction

21e272d

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

address reviewer comments

91c7a04

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

address reviewer comments: further solve name confliction, format cod…

121b3d0

…e and add test Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

rebase and simplfiy the code

b988c3b

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

simplify the code based on improved verifier

51d1c3b

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

format the code

a098efb

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

collect foreachMatchOps at step-2

76d9e10

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

add helper functions and rename the map

2277ab1

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

add a new method to address name conflicts for merging td specs

4aa2fff

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

bangtianliu force-pushed the merge_default_specs branch from 8c50308 to 4aa2fff Compare March 19, 2025 21:26

bangtianliu requested a review from kuhar March 19, 2025 21:26

kuhar requested changes Mar 20, 2025

View reviewed changes

reorder named squence ops in test

bad539b

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

bangtianliu requested a review from kuhar March 20, 2025 19:50

Max191 approved these changes Mar 20, 2025

View reviewed changes

kuhar reviewed Mar 20, 2025

View reviewed changes

address reviewer comments: format the code

bb276c2

Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

bangtianliu force-pushed the merge_default_specs branch from c3c599c to bb276c2 Compare March 20, 2025 21:22

bangtianliu requested a review from kuhar March 20, 2025 21:29

kuhar approved these changes Mar 21, 2025

View reviewed changes

kuhar merged commit 8e4ce9c into iree-org:main Mar 21, 2025
212 of 228 checks passed

ScottTodd mentioned this pull request May 5, 2025

Release tracker - 3.4.0 (2025-05-05) #20361

Closed

6 tasks

[Codegen][Tuner] merge the default td specs #20127

[Codegen][Tuner] merge the default td specs #20127

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!