-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CPU] enable brdgmm_dw_conv kernels with non f32 bias #28076
base: master
Are you sure you want to change the base?
[CPU] enable brdgmm_dw_conv kernels with non f32 bias #28076
Conversation
d5b88c4
to
28e92a0
Compare
Hi @luweizhou2016 could you review the PR? Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As we discussed, update the JIRA ticket with the perf gain reason as track.
- SPR: jit_uni_dw_conv on avx512_core_bf16 -> brdg conv on avx512_core_bf16
- GNR : brgconv on AMX_fp16 -> brdg conv on avx512_fp16
@@ -980,6 +980,10 @@ void Convolution::createDescriptor(const std::vector<MemoryDescPtr>& inputDesc, | |||
memory::data_type bdt = outDnnlDesc.get_data_type(); | |||
#else | |||
memory::data_type bdt = memory::data_type::f32; | |||
// brdgmm_dw_conv supports only bia_type the same as src_type or dst_type |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe add more comments here:
/* brdgmm_dw_conv has more perf gain on bf16/fp16 inference.
brdgmm_dw_conv supports only bia_type the same as src_type or dst_type.
dw convolution support in onednn 3.5.
BF16:
kernel type | brgdconv | jit_uni_dw_convolution_fwd_t
support impl type | native bf16 ISA without AMX | avx512_core_bf16 or avx512_core
bias dt | oneof(src,dest) | oneof(src, dest, f32)
FP16:
kernel type | brgdconv | brgemm_convolution_fwd_t
impl type | native FP16 ISA without AMX | native FP16 ISA
bias type | oneof(src,dest) | oneof(src, dest, f32)
*/
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
28e92a0
to
12133d1
Compare
Details:
Tickets: