xe: increase concat lws[0] size w/new optimal calculation #2315

syurkevi · 2024-12-24T16:48:14Z

Description

This PR adresses performance regressions discovered as part of regular monitoring by increasing the lws[0] as much as possible. The new get_optimal_lws() logic is biased towards a more n-dimensional lws. Removing the requested vectorization axis (-1) will create a lws[0] as large as possible. If maximizing lws[0] will cause new slowdowns an updated reverted get_optimal_lws() will be used instead.

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

Performance improvements

Have you submitted performance data that demonstrates performance improvements?

syurkevi · 2024-12-24T16:49:20Z

make test perf-gpu
set primitive=concat

xe: increase concat lws[0] size w/new optimal calculation

899bb1e

syurkevi requested a review from a team as a code owner December 24, 2024 16:48

github-actions bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Dec 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xe: increase concat lws[0] size w/new optimal calculation #2315

xe: increase concat lws[0] size w/new optimal calculation #2315

syurkevi commented Dec 24, 2024

syurkevi commented Dec 24, 2024

xe: increase concat lws[0] size w/new optimal calculation #2315

Are you sure you want to change the base?

xe: increase concat lws[0] size w/new optimal calculation #2315

Conversation

syurkevi commented Dec 24, 2024

Description

General

Performance improvements

syurkevi commented Dec 24, 2024