OPENCL_MEMORY_CONFIG_FILE环境变量如何使用？按照文档中的配置conv2d在device:cpu上不生效 #10469

lg920810 · 2024-03-15T10:16:17Z

我在使用PaddleLite-generic-demo的时候，添加了自己的模型，使用opencl后端进行推理加速，使用paddle_lite_opt工具进行模型转换成功，cpu上的推理均正确，在进行推理是报错为：
[3 4/ 8 13:36:25.947 ...-Lite/lite/backends/opencl/cl_context.cc:42 AddKernel] --- end get program ---
[3 4/ 8 13:36:25.947 ...-Lite/lite/backends/opencl/cl_context.cc:43 AddKernel] --- to create kernel: conv2d_1x1_h1w4c1 ---
[3 4/ 8 13:36:25.947 ...-Lite/lite/backends/opencl/cl_context.cc:49 AddKernel] --- end create kernel ---
[3 4/ 8 13:36:25.947 ...-Lite/lite/backends/opencl/cl_context.cc:59 GetKernel] --- kernel count: 71 ---
dir: ../assets/configs
width: 512
height: 512
mean: 0.500000,0.500000,0.500000
std: 0.500000,0.500000,0.500000
draw_weight: 0.800000
malloc image is out of max image size(w,h):16384,16384, need image size(w,h):32768,64
[F 4/ 8 13:36:25.947 ...e/lite/backends/opencl/target_wrapper.cc:98 ImageValid] malloc image is out of max image size(w,h):16384,16384, need image size(w,h):32768,64
[F 4/ 8 13:36:25.947 ...e/lite/backends/opencl/target_wrapper.cc:98 ImageValid] malloc image is out of max image size(w,h):16384,16384, need image size(w,h):32768,64

[4 4/ 8 13:36:25. 2 ...e-Lite/lite/backends/opencl/cl_context.h:43 ~CLContext] release cl::Program, cl::Kernel finished.
[I 4/ 8 13:36:26. 13 ...-Lite/lite/backends/opencl/cl_runtime.cc:41 ~CLRuntime] is_cl_runtime_initialized_:1
An internal error occurred in PaddleLite(mobile config).

查阅文档后添加以下内容到kernel.txt中，使用环境变量export OPENCL_MEMORY_CONFIG_FILE=./kernel.txt 进行模型转化和推，但都没有work。
device:cpu
conv2d

MuShangCC · 2024-03-15T11:50:20Z

OPENCL_MEMORY_CONFIG_FILE 文件内容不全，格式如下：
device:cpu
op type:input_var name:output_var name
需要给出具体的输入输出名称以匹配算子。

sprouteer · 2024-03-17T10:04:14Z

把算子的名字，输入输出写具体些试试。

MuShangCC · 2024-03-22T01:32:59Z

@lg920810 请问问题是否已经解决？

lg920810 · 2024-04-03T08:36:46Z

试过了，这个模型onnx共有600多层，拿onnx推理看了下每层输出的shape，有几十层的output都超过16384，如何查找onnx和paddlelite种对应的层的名字啊？或者是否可以限制opencl使用Image2D的格式计算呢？

lg920810 · 2024-04-08T03:24:24Z

@lg920810 请问问题是否已经解决？

hello~ 可以帮忙再看下这个问题吗？

MuShangCC · 2024-04-08T14:08:10Z

试过了，这个模型onnx共有600多层，拿onnx推理看了下每层输出的shape，有几十层的output都超过16384，如何查找onnx和paddlelite种对应的层的名字啊？或者是否可以限制opencl使用Image2D的格式计算呢？

没办法去将 onnx 模型和 paddlelite 模型的层去一一对应。
限制opencl使用Image2D就是通过OPENCL_MEMORY_CONFIG_FILE来设置的。
所以你的模型是动态shape的？

lg920810 · 2024-04-09T02:13:14Z

试过了，这个模型onnx共有600多层，拿onnx推理看了下每层输出的shape，有几十层的output都超过16384，如何查找onnx和paddlelite种对应的层的名字啊？或者是否可以限制opencl使用Image2D的格式计算呢？

没办法去将 onnx 模型和 paddlelite 模型的层去一一对应。限制opencl使用Image2D就是通过OPENCL_MEMORY_CONFIG_FILE来设置的。所以你的模型是动态shape的？

不是动态shape的，那看起来也没什么好办法了，我本来是想问问有没有全局限制Image2D的方法，我发现paddle模型中有一个layout算子，这个算子会做nchw->ImageDefault的转化，通过config也无法限制，有什么办法吗？

无法上传图片，算子执行顺序如下
reshape2 host/any/any
reshape2 host/any/any
io_copy opencl/any/any
layout opencl/any/ImageDefault N/A
transpose2 opencl/float16/ImageDefault N/A
layout opencl/any/NCHW
io_copy opencl/any/any

MuShangCC · 2024-04-09T06:56:13Z

试过了，这个模型onnx共有600多层，拿onnx推理看了下每层输出的shape，有几十层的output都超过16384，如何查找onnx和paddlelite种对应的层的名字啊？或者是否可以限制opencl使用Image2D的格式计算呢？

没办法去将 onnx 模型和 paddlelite 模型的层去一一对应。限制opencl使用Image2D就是通过OPENCL_MEMORY_CONFIG_FILE来设置的。所以你的模型是动态shape的？

不是动态shape的，那看起来也没什么好办法了，我本来是想问问有没有全局限制Image2D的方法，我发现paddle模型中有一个layout算子，这个算子会做nchw->ImageDefault的转化，通过config也无法限制，有什么办法吗？

无法上传图片，算子执行顺序如下 reshape2 host/any/any reshape2 host/any/any io_copy opencl/any/any layout opencl/any/ImageDefault N/A transpose2 opencl/float16/ImageDefault N/A layout opencl/any/NCHW io_copy opencl/any/any

对于 OpenCL 的内存使用有新的需求，可以在该issue下新增：#10452

lg920810 · 2024-04-10T05:44:17Z

把算子的名字，输入输出写具体些试试。

请问一下如果concate算子有4个输入1个输出，如何在config的txt中表示？以下这样不生效，且把逗号换成空格也不生效
device:cpu
concat:bilinear_interp_v2_51.tmp_0,bilinear_interp_v2_50.tmp_0,bilinear_interp_v2_49.tmp_0,bilinear_interp_v2_48.tmp_0:concat_81.tmp_0

MuShangCC · 2024-04-11T06:11:39Z

把算子的名字，输入输出写具体些试试。

请问一下如果concate算子有4个输入1个输出，如何在config的txt中表示？以下这样不生效，且把逗号换成空格也不生效 device:cpu concat:bilinear_interp_v2_51.tmp_0,bilinear_interp_v2_50.tmp_0,bilinear_interp_v2_49.tmp_0,bilinear_interp_v2_48.tmp_0:concat_81.tmp_0

Paddle-Lite/lite/core/optimizer/mir/opencl_memory_object_config_pass.cc

Line 57 in ddd8fd8

in_vars_name = Split(node_info.at(1), ",");

确实是用逗号分隔的啊，你再检查一下输入输出？

lg920810 · 2024-04-11T07:10:07Z

把算子的名字，输入输出写具体些试试。

请问一下如果concate算子有4个输入1个输出，如何在config的txt中表示？以下这样不生效，且把逗号换成空格也不生效 device:cpu concat:bilinear_interp_v2_51.tmp_0,bilinear_interp_v2_50.tmp_0,bilinear_interp_v2_49.tmp_0,bilinear_interp_v2_48.tmp_0:concat_81.tmp_0

Paddle-Lite/lite/core/optimizer/mir/opencl_memory_object_config_pass.cc

Line 57 in ddd8fd8

in_vars_name = Split(node_info.at(1), ",");

确实是用逗号分隔的啊，你再检查一下输入输出？

paddle-bot bot assigned shentanyue Mar 15, 2024

lg920810 mentioned this issue Apr 9, 2024

Paddle-Lite新需求收集(call for new feature request) #10452

Open

lg920810 closed this as completed Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OPENCL_MEMORY_CONFIG_FILE环境变量如何使用？按照文档中的配置conv2d在device:cpu上不生效 #10469

OPENCL_MEMORY_CONFIG_FILE环境变量如何使用？按照文档中的配置conv2d在device:cpu上不生效 #10469

lg920810 commented Mar 15, 2024

MuShangCC commented Mar 15, 2024

sprouteer commented Mar 17, 2024

MuShangCC commented Mar 22, 2024

lg920810 commented Apr 3, 2024

lg920810 commented Apr 8, 2024

MuShangCC commented Apr 8, 2024

lg920810 commented Apr 9, 2024 •

edited

Loading

MuShangCC commented Apr 9, 2024

lg920810 commented Apr 10, 2024 •

edited

Loading

MuShangCC commented Apr 11, 2024 •

edited

Loading

lg920810 commented Apr 11, 2024

OPENCL_MEMORY_CONFIG_FILE环境变量如何使用？按照文档中的配置conv2d在device:cpu上不生效 #10469

OPENCL_MEMORY_CONFIG_FILE环境变量如何使用？按照文档中的配置conv2d在device:cpu上不生效 #10469

Comments

lg920810 commented Mar 15, 2024

MuShangCC commented Mar 15, 2024

sprouteer commented Mar 17, 2024

MuShangCC commented Mar 22, 2024

lg920810 commented Apr 3, 2024

lg920810 commented Apr 8, 2024

MuShangCC commented Apr 8, 2024

lg920810 commented Apr 9, 2024 • edited Loading

MuShangCC commented Apr 9, 2024

lg920810 commented Apr 10, 2024 • edited Loading

MuShangCC commented Apr 11, 2024 • edited Loading

lg920810 commented Apr 11, 2024

lg920810 commented Apr 9, 2024 •

edited

Loading

lg920810 commented Apr 10, 2024 •

edited

Loading

MuShangCC commented Apr 11, 2024 •

edited

Loading