Fix smem swizzle for matmul #588

zasdfgbnm · 2023-07-13T15:32:58Z

In #387, we have to do a

    int64_t swizzle_period =
        std::gcd(n_rows / repeated_pattern_size, tile_size_y / n_cols);

in order to make our swizzling algorithm work for epilogue. This looks more like an empirical hack whose only goal is to creates a square block. Although it empirically worked, I struggled to find a first-principle explanation for this approach. So I read through my original PR #155 multiple times and think through things carefully. But the more I read and think, the more I feel that the original implementation in #155 does not make sense. The problem is, #155 tries to interleave the entire ldmatrix_rows / repeated_pattern_size with an equal size split on tile y dimension. This is overkill, because we just need to evenly distribute rows on different megabanks, and as long as we do so, the number of rows can be arbitrarily large and we can still be bank-conflict free. So we should be swizzling on a (g, g) block instead of a (potentially much larger) (ldmatrix_rows / repeated_pattern_size, ldmatrix_rows / repeated_pattern_size) block.

zasdfgbnm · 2023-07-13T15:36:49Z

!build

liqiangxl · 2023-07-13T21:51:52Z

I tested the change in this PR with #387, it looks good.

liqiangxl

LGTM.

Fix smem swizzle for matmul

67ccf54

zasdfgbnm requested review from naoyam, liqiangxl and drzejan2 July 13, 2023 15:36

zasdfgbnm mentioned this pull request Jul 13, 2023

add epilogue to store MMA results in shared memory before write to #387

Merged

3 tasks

liqiangxl approved these changes Jul 13, 2023

View reviewed changes

Merge branch 'main' into smem-swizzle-fix

34615d2

zasdfgbnm merged commit 2e4c1e5 into main Jul 13, 2023

zasdfgbnm deleted the smem-swizzle-fix branch July 13, 2023 23:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix smem swizzle for matmul #588

Fix smem swizzle for matmul #588

zasdfgbnm commented Jul 13, 2023 •

edited

Loading

zasdfgbnm commented Jul 13, 2023

liqiangxl commented Jul 13, 2023

liqiangxl left a comment

Fix smem swizzle for matmul #588

Fix smem swizzle for matmul #588

Conversation

zasdfgbnm commented Jul 13, 2023 • edited Loading

zasdfgbnm commented Jul 13, 2023

liqiangxl commented Jul 13, 2023

liqiangxl left a comment

Choose a reason for hiding this comment

zasdfgbnm commented Jul 13, 2023 •

edited

Loading