[Im2Col] Support converting group convs to im2col #20611

rkayaith · 2025-04-23T03:40:53Z

This adds support for converting group convs to im2col, allowing them to go down the IGEMM path.

Group dimensions are parallel iterator dims that index into the image, filter, and output. For im2col they are treated as a batch dimension.

This also fixes #20498

rkayaith · 2025-04-23T03:46:03Z

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/ConvertConvToIm2ColOp.cpp

-  collectDimExprs(inputMap.getResults(), inputDimsSet);
-  collectDimExprs(filterMap.getResults(), filterDimsSet);
-
-  // Get shared dims from input and filter in order of appearance.


The previous logic included dimensions shared between the input and filter, but group dimensions are included in that and we don't want them here.

rkayaith · 2025-04-23T04:02:28Z

On MI300X this results in a ~90% reduction in execution time on a number of configurations (benchmarked using boo_driver):

convbfp16 -n 2 -c 896 -H 59 -W 91 -k 896 -y 3 -x 3 -p 1 -q 1 -u 1 -v 1 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 16 -F 1 -t 1              -5033.66 (-94.3%)
convbfp16 -n 2 -c 448 -H 118 -W 182 -k 448 -y 3 -x 3 -p 1 -q 1 -u 1 -v 1 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 8 -F 1 -t 1             -9760.99 (-91.4%)
convbfp16 -n 2 -c 224 -H 470 -W 725 -k 224 -y 3 -x 3 -p 1 -q 1 -u 2 -v 2 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 4 -F 1 -t 1             -11450.01 (-93.5%)
convbfp16 -n 2 -c 224 -H 235 -W 363 -k 224 -y 3 -x 3 -p 1 -q 1 -u 1 -v 1 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 4 -F 1 -t 1             -11380.67 (-94.2%)
convbfp16 -n 2 -c 448 -H 235 -W 363 -k 448 -y 3 -x 3 -p 1 -q 1 -u 2 -v 2 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 8 -F 1 -t 1             -9761.35 (-90.8%)
convbfp16 -n 2 -c 896 -H 118 -W 182 -k 896 -y 3 -x 3 -p 1 -q 1 -u 2 -v 2 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 16 -F 1 -t 1            -5039.59 (-93.7%)
convbfp16 -n 2 -c 2016 -H 59 -W 91 -k 2016 -y 3 -x 3 -p 1 -q 1 -u 2 -v 2 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 36 -F 1 -t 1            -1504.16 (-84.7%)

nirvedhmeshram

Nice work! I have a few minor questions and comments.

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/ConvertConvToIm2ColOp.cpp

nirvedhmeshram · 2025-04-23T16:47:23Z

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/test/conv_to_im2col.mlir

+// CHECK:      %[[IM2COL:.+]] = iree_linalg_ext.im2col
+// CHECK-SAME:   strides = [1, 1] dilations = [1, 1] kernel_size = [3, 3]
+// CHECK-SAME:   m_offset = [0, 0] * [8, 1] k_offset = [0] * [1]
+// CHECK-SAME:   batch_pos = [3, 0] m_pos = [1, 2] k_pos = [4]


I am wondering why the group dimension is bubbled out like this to the front, to me doing [0,3] seems more natural.

It's trivial to change, I can put the batch dims first if preferred. Does this materially affect the code generation? I was curious if this affects performance but in my earlier tests it didn't.

I think it creates a transpose access pattern when writing the results back, it might matter if we start doing something fancy there but you are right might not make a difference now but lets change to [0,3] so atleast we tried to keep the dimensions in the same sequence within the limits of what im2col needs which is (batch,M,reductions)

I've changed the ordering here: 97e70b2

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/test/conv_to_im2col.mlir

nirvedhmeshram · 2025-04-23T16:59:42Z

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/ConvertConvToIm2ColOp.cpp

+      int64_t igemmInputDim = igemmConvDetails.getIgemmInputImageMap()
+                                  .getResultPosition(dimExpr)
+                                  .value();
+      batchPos[igemmInputDim] = im2colInputDim;


Is this relying on say a dim like "d3" in the original map stays d3 in the new map. Wondering if this is a fairly safe assumption and if not should we guard against a wrong mapping if that doesnt happen.

good point, I think with linalg.generic it should be possible to come up with a case where they don't match. let me try and come up with a test+fix

I've fixed this here by including the dimension mapping in igemmConvDetails and using that to map to the correct dimension: 9f5a51c

yzhang93

Overall LGTM. Just some minor comments.

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/ConvertConvToIm2ColOp.cpp

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/test/conv_to_im2col.mlir

yzhang93 · 2025-04-23T17:12:26Z

compiler/src/iree/compiler/Dialect/LinalgExt/Transforms/test/conv_to_im2col.mlir

+// CHECK-SAME:   %[[IMG:.+]]: [[IMG_T:tensor<1x10x10x7x4xf32>]]
+// CHECK-SAME:   %[[FIL:.+]]: [[FIL_T:tensor<7x16x3x3x4xf32>]]
+// CHECK-SAME:   %[[OUT:.+]]: [[OUT_T:tensor<1x8x8x7x16xf32>]]
+// CHECK:      %[[EMPTY:.+]] = tensor.empty() : [[LHS_T:tensor<7x1x8x8x36xf32>]]


Does it require the g dimension to be in front? Does it work when the n dimension is larger than 1? It's better to modify either of these two tests to cover a case that n>1.

the dimension order and size doesn't matter. I've updated the tests to use a non-unit batch size

nirvedhmeshram

LGTM

yzhang93

Thanks for the changes and improvement! LGTM.

This adds support for converting group convs to im2col, allowing them to go down the IGEMM path. Group dimensions are parallel iterator dims that index into the image, filter, and output. For im2col they are treated as a batch dimension. This also fixes iree-org#20498

[Im2Col] Support converting group convs to im2col

19e28c6

rkayaith commented Apr 23, 2025

View reviewed changes

rkayaith marked this pull request as ready for review April 23, 2025 04:03

rkayaith requested review from hanhanW and MaheshRavishankar as code owners April 23, 2025 04:03

rkayaith requested a review from nirvedhmeshram April 23, 2025 04:04

nirvedhmeshram requested a review from yzhang93 April 23, 2025 16:20

nirvedhmeshram reviewed Apr 23, 2025

View reviewed changes

yzhang93 reviewed Apr 23, 2025

View reviewed changes

rkayaith added 3 commits April 23, 2025 10:32

address minor comments

f29a2c6

Order batch dims before group dims in im2col

97e70b2

Fix missing dim remapping when computing 'batchPos'

9f5a51c

nirvedhmeshram approved these changes Apr 23, 2025

View reviewed changes

yzhang93 approved these changes Apr 23, 2025

View reviewed changes

rkayaith merged commit b86ed92 into iree-org:main Apr 23, 2025
41 checks passed

rkayaith deleted the group-conv-im2col branch April 23, 2025 21:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Im2Col] Support converting group convs to im2col #20611

[Im2Col] Support converting group convs to im2col #20611

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Im2Col] Support converting group convs to im2col #20611

[Im2Col] Support converting group convs to im2col #20611

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!